Regularized projection pursuit for data with a small sample-to-variable ratio

Siyuan Hou1, Peter D. Wentzell1
1Department of Chemistry, Dalhousie University, PO Box 15000, Halifax, NS, B3H 4R2, Canada

Tóm tắt

Từ khóa


Tài liệu tham khảo

Ametaj, B. N., Zebeli, Q., Saleem, F., Psychogios, N., Lewis, M. J., Dunn, S. M., et al. (2010). Metabolomics reveals unhealthy alterations in rumen metabolism with increase proportion of cereal grain in the diet of dairy cows. Metabolomics, 6, 583–594.

Bishop, C. M. (2006). Pattern recognition and machine learning. New York: Springer Science+Business Media LLC.

Broadhurst, D. I., & Kell, D. B. (2006). Statistical strategies for avoiding false discoveries in metabolomics and related experiments. Metabolomics, 2, 171–196.

Comon, P. (1994). Independent component analysis, a new concept? Signal Processing, 36, 287–314.

Emmanuel, D. G. V., Dunn, S. M., & Ametaj, B. N. (2008). Feeding high proportions of barley grain stimulates an inflammatory response in dairy cows. Journal of Dairy Science, 91, 606–614.

Frank, A., & Asuncion, A. (2010). UCI Machine Learning Repository. School of Information and Computer Science, University of California, Irvine, CA. Retrieved April 10, 2011 from http://archive.ics.uci.edu/ml .

Friedman, J. H. (1987). Exploratory projection pursuit. Journal of the American Statistical Association, 82, 249–266.

Friedman, J. H. (1989). Regularized discriminant analysis. Journal of the American Statistical Association, 84, 165–175.

Friedman, J. H., & Tukey, J. W. (1974). A Projection pursuit algorithm for exploratory data analysis. IEEE Transactions on Computers, 23, 881–890.

Hall, P. (1989). On polynomial-based projection indices for exploratory projection pursuit. The Annals of Statistics, 17, 589–605.

Hoerl, E., & Kennard, R. W. (1970a). Ridge regression: biased estimation for nonorthogonal problems. Technometrics, 12, 55–67.

Hoerl, E., & Kennard, R. W. (1970b). Ridge regression: applications to nonorthogonal problems. Technometrics, 12, 69–82.

Hou, S., & Wentzell, P. D. (2011). Fast and simple methods for the optimization of kurtosis used as a projection pursuit index. Analytica Chimica Acta, 704, 1–15.

Hsiang, T. C. (1975). A Bayesian view on ridge regression. The Statistician, 24, 267–268.

Huber, P. J. (1985). Projection pursuit. The Annals of Statistics, 13, 435–475.

Hubert, M. (2001). Multivariate outlier detection and robust covariance matrix estimation: discussion. Technometrics, 43, 303–306.

Hyvärinen, A., Karhunen, J., & Oja, E. (2001). Independent component analysis. New York: John Wiley and Sons Inc.

Jones, M. C., & Sibson, R. (1987). What is projection pursuit? Journal of the Royal Statistical Society Series A, 150, 1–36.

Magnus, J. R., & Neudecker, H. (1988). Matrix differential calculus with applications in statistics and econometrics. New York: John Wiley & Sons.

Marquardt, D. W., & Snee, R. D. (1975). Ridge regression in practice. The American Statistician, 29, 3–20.

Metaboanalyst: a web service for metabolomic data analysis (2011). Retrieved April 10, 2011 from http://www.metaboanalyst.ca/MetaboAnalyst/faces/Docs/Format.jsp .

Michalski, R. S., & Chilausky, R. L. (1980). Learning by being told and learning from examples: an experimental comparison of the two methods of knowledge acquisition in the context of development an expert system for soybean disease diagnosis. International Journal of Policy Analysis and Information Systems, 4, 125–161.

Montgomery, D. C., & Peck, E. A. (1992). Introduction to linear regression analysis (2nd ed.). New York: John Wiley & Sons Inc.

Morton, S. C. (1989). Interpretable Projection Pursuit. SLAC Report-355, Stanford Linear Accelerator Center, Stanford University, California.

O’Sullivan, F. (1986). A statistical perspective on ill-posed inverse problems. Statistical Science, 1, 502–518.

Peña, D., & Prieto, F. J. (2001a). Cluster identification using projections. Journal of the American Statistics Association, 96, 1433–1445.

Peña, D., & Prieto, F. J. (2001b). Multivariate outlier detection and robust covariance matrix estimation. Technometrics, 43, 286–310.

Petersen, K. B., & Pedersen, M. S. (2008). The Matrix Cookbook (ver. 14 November 2008). Retrieved April 10, 2011 from http://matrixcookbook.com .

Poole, D. (2003). Linear algebra: a modern introduction. Boston: Brooks/Cole.

Posse, C. (1990). An effective two-dimensional projection pursuit algorithm. Communications in Statistics, Simulation and Computation, 19, 1143–1164.

Psihogios, N. G., Kalaitzidis, R. G., Dimou, S., Seferiadis, K. I., Siamopoulos, K. C., & Bairaktari, E. T. (2007). Evaluation of tubulointerstitial lesions’ severity in patients with glomerulonephritides: an NMR-based metabonomic study. Journal of Proteome Research, 6, 3760–3770.

Ragazzi, E., Pucciarelli, S., Seraglia, R., Molin, L., Agostini, M., Lise, M., et al. (2006). Multivariate analysis approach to the plasma protein profile of patients with advanced colorectal cancer. Journal of Mass Spectrometry, 41, 1546–1553.

Sietsma, J., & Dow, R. J. F. (1991). Creating artificial neural networks that generalize. Neural Networks, 4, 67–69.

Stone, V. (2004). Independent component analysis: A tutorial introduction. Cambridge: The MIT Press.

Stout, F., & Kalivas, J. H. (2006). Tikhonov regularization in standard and general form for multivariate calibration with applications towards removing unwanted spectral artifacts. Journal of Chemometrics, 20, 22–33.

Stout, F., Kalivas, J. H., & Heberger, K. (2007). Wavelength selection for multivariate calibration using Tikhonov regularization. Applied Spectroscopy, 61, 85–95.

Tibshirani, R. (1996). Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society Series B, 58, 267–288.

Titterington, D. M. (1985). Common structure of smoothing techniques in statistics. International Statistical Review, 53, 141–170.

Tukey, J. W. (1977). Exploratory data analysis. Reading: Addison-Wesley Publishing Company Inc.

Tukey, J. W. (1980). We need both exploratory and confirmatory. The American Statistician, 34, 23–25.

Vapnik, V. N. (1995). The nature of statistical learning theory. New York: Springer-Verlag.

Wahl, P. W., & Kronmal, R. A. (1977). Discriminant functions when covariances are unequal and sample sizes are moderate. Biometrics, 33, 479–484.

Wentzell, P. D., & Lohnes, M. T. (1999). Maximum likelihood principal component analysis with correlated measurement errors: theoretical and practical considerations. Chemometrics and Intelligent Laboratory System, 45, 65–85.

Westerhuis, J. A., Hoefsloot, H. C. J., Smit, S., Vis, D. J., Smilde, A. K., van Velzen, E. J. J., et al. (2008). Assessment of PLSDA cross validation. Metabolomics, 4, 81–89.

Yap, K., Guan, L., & Evans, J. (2001). Blind adaptive detection for CDMA systems based on regularized independent component analysis. IEEE Global Telecommunications Conference (pp. 249–253). New York: IEEE.

Yenyukov, I. S. (1989). Indices for projection pursuit. In E. Diday (Ed.), Data analysis, learning symbolic and numeric knowledge (pp. 181–189). New York: Nova Science Publishers.

Zhang, Z., Dai, G., & Xu, C. (2010). Regularized discriminant analysis, ridge regression and beyond. Journal of Machine Learning Research, 11, 2199–2228.