A chemometrics toolbox based on projections and latent variables

Journal of Chemometrics - Tập 28 Số 5 - Trang 332-346 - 2014
Lennart Eriksson1, Johan Trygg2, Svante Wold2
1MKS Umetrics AB, Umeå, Sweden
2Computational Life Science Cluster, Department of Chemistry Umeå University SE‐901 87 Umeå Sweden

Tóm tắt

A personal view is given about the gradual development of projection methods—also called bilinear, latent variable, and more—and their use in chemometrics. We start with the principal components analysis (PCA) being the basis for more elaborate methods for more complex problems such as soft independent modeling of class analogy, partial least squares (PLS), hierarchical PCA and PLS, PLS‐discriminant analysis, Orthogonal projection to latent structures (OPLS), OPLS‐discriminant analysis and more.From its start around 1970, this development was strongly influenced by Bruce Kowalski and his group in Seattle, and his realization that the multidimensional data profiles emerging from spectrometers, chromatographs, and other electronic instruments, contained interesting information that was not recognized by the current one variable at a time approaches to chemical data analysis.This led to the adoption of what in statistics is called the data analytical approach, often called also the data driven approach, soft modeling, and more. This approach combined with PCA and later PLS, turned out to work very well in the analysis of chemical data. This because of the close correspondence between, on the one hand, the matrix decomposition at the heart of PCA and PLS and, on the other hand, the analogy concept on which so much of chemical theory and experimentation are based. This extends to numerical and conceptual stability and good approximation properties of these models.The development is informally summarized and described and illustrated by a few examples and anecdotes. Copyright © 2014 John Wiley & Sons, Ltd.

Từ khóa


Tài liệu tham khảo

10.1021/ja00771a016

10.1021/ja00784a007

JöreskogKG.Herman Wold ed.s systems under indirect observation parts I and II. Cartigny proceedings North‐Holland Amsterdam 1982.

Mathematics and statistics in chemistry proceedings NATO Adv. Study inst. On Chemometrics Cosenza Italy Sept. 1983. (BR Kowalski Ed.) D.Reidel Dordrecht The Netherlands 1984.

10.1016/0003-2670(86)80028-9

10.1016/S0003-2670(01)85039-X

10.1016/S0262-4079(13)61120-3

10.1007/BFb0062108

10.1021/ac9705733

Horst P, 1965, Factor Analysis of Data Matrices

Wold H, 1966, Multivariate Analysis, 391

10.1002/cem.1248

10.1080/00401706.1978.10489693

10.1002/cem.1086

Martens H, 1989, Multivariate calibration H. Martens, T. Næs, Multivariate Calibration

10.1016/0169-7439(87)80021-7

10.1016/0169-7439(89)80110-8

10.1524/zkri.1966.123.5.388

Wold S, 1974, Chemica Scripta: A theoretical foundation of extrathermodynamic relationships (linear free energy relationships), Chemica Scripta, 5, 97

10.1021/ja01280a022

10.1002/cem.1180090105

10.1021/ac00063a019

10.1002/cem.860

10.1002/cem.1180060404

10.1016/S0169-7439(98)00014-8

Miller P, 1998, Contribution plots: a missing link in multivariate quality control, App. Math. Comp. Sci., 8, 775

10.1016/0031-3203(76)90014-5

10.1007/s00216-005-3394-y

10.1080/00224065.1996.11979699

WiseBM VeltkampDJ DavisB RickerNL KowalskiBR. "Principal components analysis for monitoring the west valley liquid fed ceramic melter " Waste Management '88 Proceedings pps. 811–818 Tucson AZ1988.

10.1016/j.chemolab.2012.05.015

10.1137/S0895479800368354

10.1016/S0003-2670(00)86335-7

10.1080/00401706.1970.10488634

10.1137/0905052

WoldS AlbanoC DunnIIIWJ EsbensenK HellbergS JohanssonE SjöströmM.Pattern recognition: finding and using regularities in multi‐variate data(H.Martens and H Russwurm Jr Ed.s). Proc. IUFOST Conf. Food Research and Data Analysis 1983 pp.147–188.

10.1016/S0003-2670(00)85460-4

10.1002/cem.695

10.1016/0169-7439(93)85002-X

10.1016/j.carbpol.2004.05.015

10.1366/0003702894202201

10.1016/B978-0-444-87877-9.50042-X

10.1002/cem.1006

10.1002/cem.1254

Wold S, 1999, PLS in Chemistry, The Encyclopedia of Computational Chemistry, 2006

Macgregor JF, 1988, Online statistical process control, Chem. Eng. Prog., 84, 21

10.1002/cem.1007

WoldS KettanehN TjessemK.Journal of Chemometrics Hierarchical multiblock PLS and PC models for easier model interpretation and as an alternative to variable selection.1996 10:463–482.

Fisher RA, 1935, The Design of Experiments

Box GEP, 1978, Statistics for Experimenters: An Introduction to Design, Data Analysis, and Model Building

10.1016/S0003-2670(00)86294-7

Carlson R, 1991, Design and Optimization in Organic Synthesis

10.1002/cem.863

Tukey JW, 1977, Exploratory Data Analysis