Rapid cell population identification in flow cytometry data

Nima Aghaeepour1,2, Radina Nikolic3,2, Holger H. Hoos4, Ryan R. Brinkman5,2
1Department of Bioinformatics, University of British Columbia, British Columbia, Canada
2Terry Fox Laboratory, BC Cancer Agency, Vancouver, British Columbia, Canada.
3Department of Statistics, University of Oxford, Oxford, United Kingdom
4Department of Computer Science, University of British Columbia, British Columbia, Canada
5Department of Medical Genetics, University of British Columbia, British Columbia, Canada;

Tóm tắt

Abstract

We have developed flowMeans, a time‐efficient and accurate method for automated identification of cell populations in flow cytometry (FCM) data based on K‐means clustering. Unlike traditional K‐means, flowMeans can identify concave cell populations by modelling a single population with multiple clusters. flowMeans uses a change point detection algorithm to determine the number of sub‐populations, enabling the method to be used in high throughput FCM data analysis pipelines. Our approach compares favorably to manual analysis by human experts and current state‐of‐the‐art automated gating algorithms. flowMeans is freely available as an open source R package through Bioconductor. © 2010 International Society for Advancement of Cytometry

Từ khóa


Tài liệu tham khảo

10.1002/cyto.a.20617

10.1002/cyto.a.20905

10.1002/cyto.a.20901

10.1016/j.leukres.2007.08.022

10.1002/cyto.10084

10.1515/CCLM.2003.052

Hahne F, 2009, Per‐channel basis normalization methods for flow cytometry data, Cytometry Part A, 77, 121

Bashashati A, 2009, A Survey of Flow Cytometry Data Analysis Methods, Adv Bioinformatics, 584603

10.1002/cyto.a.20531

Finak G, 2009, Merging mixture model components for improved cell population identification in high throughput flow cytometry data, Advances in Bioinformatics, 100

BaudryJ RafteryA CeleuxG LoK GottardoR.Combining mixture components for clustering. Journal of Computational and Graphical Statistics 2010;19:332–353.

10.1073/pnas.0903028106

10.1002/cyto.a.20754

10.1186/1471-2105-11-403

Murphy R, 2005, Automated identification of subpopulations in flow cytometric list mode data using cluster analysis, Cytometry Part A, 6, 302, 10.1002/cyto.990060405

Pelleg D, 2000, Proceedings of the Seventeenth International Conference on Machine Learning table of contents, 727

Hamerly G, 2004, Learning the K in k‐means, Advances in Neural Information Processing Systems, 17, 281

10.1002/9780470316801

10.1016/j.csda.2008.02.035

10.1002/9780470316849

RosenbergA HirschbergJ.V‐measure: A conditional entropy‐based external cluster evaluation measure. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP‐CoNLL); Prague Czech Republic.2007. p410–420.

AghaeepourN KhodabakhshiAH BrinkmanRR. An empirical study of cluster evaluation metrics using flow cytometry data. Clustering Theory Workshop Neural Information Processing Systems (NIPS). Whistler British Columbia Canada December 2009.http://clusteringtheory.org/papers/empiricalmetrics.pdf.

10.1016/j.bbmt.2007.02.002

IgelC SuttorpT HansenN.A computational efficient covariance matrix update and a (1+ 1)‐CMA for evolution strategies. In Proceedings of the 8th annual conference on Genetic and Evolutionary Computation GECCO ’︁06. New York: ACM. ISBN 1‐59593‐186‐4; 2006: p 453–460.

AghaeepourN. FlowMeans package at Bioconductor Available at:http://www.bioconductor.org/packages/devel/bioc/html/flowMeans.html.2010.

10.1186/gb-2004-5-10-r80