On the Incommensurability Phenomenon

Journal of Classification - Tập 33 - Trang 185-209 - 2016
Donniell E. Fishkind1, Cencheng Shen1, Youngser Park1, Carey E. Priebe1
1Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, United States

Tóm tắt

Suppose that two large, multi-dimensional data sets are each noisy measurements of the same underlying random process, and principal components analysis is performed separately on the data sets to reduce their dimensionality. In some circumstances it may happen that the two lower-dimensional data sets have an inordinately large Procrustean fitting-error between them. The purpose of this manuscript is to quantify this “incommensurability phenomenon”. In particular, under specified conditions, the square Procrustean fitting-error of the two normalized lower-dimensional data sets is (asymptotically) a convex combination (via a correlation parameter) of the Hausdorff distance between the projection subspaces and the maximum possible value of the square Procrustean fitting-error for normalized data. We show how this gives rise to the incommensurability phenomenon, and we employ illustrative simulations and also use real data to explore how the incommensurability phenomenon may have an appreciable impact.

Tài liệu tham khảo