Outlier identification in high dimensions

Computational Statistics and Data Analysis - Tập 52 Số 3 - Trang 1694-1711 - 2008
Peter Filzmoser, Ricardo A. Maronna1, Mark Werner2
1Department of Mathematics, Faculty of Exact Sciences, National University of La Plata, and C.I.C.P.B.A., La Plata, Argentina
2Department of Mathematics, The American University in Cairo, Egypt

Tóm tắt

Từ khóa


Tài liệu tham khảo

Adrover, 2002, Projection estimates of multivariate location, Ann. Statist., 30, 1760, 10.1214/aos/1043351256

Becker, 1999, The masking breakdown point of multivariate outliers, J. Amer. Statist. Assoc., 94, 947, 10.2307/2670009

Billor, 2000, BACON: blocked adaptive computationally-efficient outlier nominators, Comput. Statist. Data Anal., 34, 279, 10.1016/S0167-9473(99)00101-2

Davies, 1987, Asymptotic behaviour of S-estimates of multivariate location parameters and dispersion matrices, Ann. Statist., 15, 1269, 10.1214/aos/1176350505

Donoho, D., 1982. Breakdown properties of multivariate location estimators. Ph.D. Thesis, Harvard University.

Dudoit, 2002, Comparison of discrimination methods for the classification of tumors using gene expression data, J. Amer. Statist. Assoc., 97, 77, 10.1198/016214502753479248

Filzmoser, 2005, Multivariate outlier detection in exploration geochemistry, Comput. Geosci., 31, 579, 10.1016/j.cageo.2004.11.013

Gnanadesikan, 1972, Robust estimates, residuals, and outlier detection with multiresponse data, Biometrics, 28, 81, 10.2307/2528963

Golub, 1999, Molecular classification of cancer: class discovery and class prediction by gene expression monitoring, Science, 286, 531, 10.1126/science.286.5439.531

Hall, 2005, Geometric representation of high dimension, low sample size data, J. Roy. Statist. Soc. Ser. B, 67, 427, 10.1111/j.1467-9868.2005.00510.x

Hardin, 2005, The distribution of robust distances, J. Comput. Graphical Statist., 14, 928, 10.1198/106186005X77685

Hössjer, 1995, Generalizing univariate signed rank statistics for testing and estimating a multivariate location parameter, J. Nonparametric Statist., 4, 293, 10.1080/10485259508832620

Huber, 1985, Projection pursuit, Ann. Statist., 13, 435, 10.1214/aos/1176349519

Hubert, 2005, Robpca: a new approach to robust principal components analysis, Technometrics, 47, 64, 10.1198/004017004000000563

Jackson, 1991

Janssens, 1998, Composition of 15–17th century archæological glass vessels excavated in Antwerp, Belgium, Mikrochimica Acta, 15, 253

Johnson, 1998

Lemberge, 2000, Quantitative analysis of 16–17th century archæological glass vessels using PLS regression of EPXMA and μ-XRF data, J. Chemometrics, 14, 751, 10.1002/1099-128X(200009/12)14:5/6<751::AID-CEM622>3.0.CO;2-D

Locantore, 1999, Robust principal components for functional data, Test, 8, 1, 10.1007/BF02595862

Maronna, 1995, The behavior of the Stahel–Donoho robust multivariate estimator, J. Amer. Statist. Assoc., 90, 330, 10.2307/2291158

Maronna, 2002, Robust estimates of location and dispersion for high-dimensional data sets, Technometrics, 44, 307, 10.1198/004017002188618509

Maronna, 2006

Peña, 2001, Multivariate outlier detection and robust covariance matrix estimation, Technometrics, 43, 286, 10.1198/004017001316975899

R Development Core Team, 2005. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, ISBN 3-900051-07-0. URL: 〈http://www.R-project.org〉.

Reimann, C., Äyräs, M., Chekushin, V., Bogatyrev, I., Boyd, R., Caritat, P.de., Dutter, R., Finne, T., Halleraker, J., Jæger, O., Kashulina, G., Lehto, O., Niskavaara, H., Pavlov, V., Räisänen, M., Strand, T., Volden, T., 1998. Environmental Geochemical Atlas of the Central Barents Region. Geological Survey of Norway (NGU), Geological Survey of Finland (GTK), and Central Kola Expedition (CKE), Special Publication, Trondheim, Espoo, Monchegorsk.

Rocke, 1996, Robustness properties of S-estimators of multivariate location and shape in high dimension, Ann. Statist., 24, 1327, 10.1214/aos/1032526972

Rousseeuw, 1985, Multivariate estimation with high breakdown point, vol. B, 283

Rousseeuw, 1999, A fast algorithm for the minimum covariance determinant estimator, Technometrics, 41, 212, 10.2307/1270566

Serneels, 2005, Partial robust M-regression, Chemometrics and Intelligent Laboratory Systems, 79, 55, 10.1016/j.chemolab.2005.04.007

Stahel, W., 1981. Breakdown of covariance estimators. Research Report 31, Fachgruppe für Statistik, E.T.H. Zürich.

Tenenhaus, M., Vinzi, V.E., Chatelin, Y.-M., Lauro, C., 2005. Pls path modeling. Comput. Statist. Data. Anal. 48(1), 159–205.

Woodruff, 1994, Computable robust estimation of multivariate location and shape in high dimension using compound estimators, J. Amer. Statist. Assoc., 89, 888, 10.2307/2290913