OPLS discriminant analysis: combining the strengths of PLS‐DA and SIMCA classification

Journal of Chemometrics - Tập 20 Số 8-10 - Trang 341-351 - 2006
Max Bylesjö1, Mattias Rantalainen2, Olivier Cloarec2, Jeremy K. Nicholson2, Elaine Holmes2, Johan Trygg1
1Research Group for Chemometrics, Department of Chemistry, Umeå University, Umeå, SE-901 87, Sweden
2Biological Chemistry, Biomedical Sciences Division, Faculty of Natural Sciences, Imperial College London, South Kensington SW7 2AZ, UK

Tóm tắt

AbstractThe characteristics of the OPLS method have been investigated for the purpose of discriminant analysis (OPLS‐DA). We demonstrate how class‐orthogonal variation can be exploited to augment classification performance in cases where the individual classes exhibit divergence in within‐class variation, in analogy with soft independent modelling of class analogy (SIMCA) classification. The prediction results will be largely equivalent to traditional supervised classification using PLS‐DA if no such variation is present in the classes. A discriminatory strategy is thus outlined, combining the strengths of PLS‐DA and SIMCA classification within the framework of the OPLS‐DA method. Furthermore, resampling methods have been employed to generate distributions of predicted classification results and subsequently assess classification belief. This enables utilisation of the class‐orthogonal variation in a proper statistical context. The proposed decision rule is compared to common decision rules and is shown to produce comparable or less class‐biased classification results. Copyright © 2007 John Wiley & Sons, Ltd.

Từ khóa


Tài liệu tham khảo

10.1016/S0169-7439(02)00046-1

Johnson RA, 1992, Applied Multivariate Analysis

10.1137/0905052

10.1021/ci034108k

10.1007/s10822-005-3785-3

10.1016/0031-3203(76)90014-5

10.1016/0169-7439(87)80084-9

10.1016/S0003-2670(01)83107-X

10.1021/tx990210t

Bicciato S, 2004, Marker identification and classification of cancer types using gene expression data and SIMCA, Methods Inf. Med., 43, 4, 10.1055/s-0038-1633413

Martens H, 1992, Multivariate Calibration

10.1039/b501890k

Perez‐Enciso M, 2003, Prediction of clinical outcome with microarray data: a partial least squares discriminant analysis (PLS‐DA) approach, Hum. Genet., 112, 581

10.1016/S0169-7439(98)00109-9

10.1002/cem.695

10.1016/S0169-7439(01)00102-2

Kvalheim OM, 1989, Interpretation of latent‐variable regression‐models, Chemometrics Intell. Lab. Syst., 2, 37

10.1021/ac048630x

10.1002/cem.860

10.1016/S0003-2670(03)00094-1

10.1093/bioinformatics/18.1.39

10.1002/cem.724

10.1214/aoms/1177732979

10.1186/1471-2105-6-250

10.1080/01621459.1993.10476299

10.1073/pnas.87.23.9193