An active learning approach for clustering single-cell RNA-seq data

Laboratory Investigation - Tập 102 - Trang 227-235 - 2022
Xiang Lin1, Haoran Liu1, Zhi Wei1, Senjuti Basu Roy1, Nan Gao2
1Department of Computer Science, New Jersey Institute of Technology, Newark, NJ, USA
2Department of Biological Sciences, Rutgers University, Newark, NJ, USA

Tài liệu tham khảo

Bacher, 2016, Design and computational analysis of single-cell RNA-sequencing experiments, Genome Biol, 17, 10.1186/s13059-016-0927-y Sun, 2019, Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis, Genome Biol, 20, 1, 10.1186/s13059-019-1898-6 Tian, 2019, Clustering single-cell RNA-seq data with a model-based deep learning approach, Nat. Mach. Intell, 1, 191, 10.1038/s42256-019-0037-0 Wang, 2019, Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data, BMC Bioinform, 20 Talwar, 2018, AutoImpute: Autoencoder based imputation of single-cell RNA-seq data, Sci Rep, 8, 1, 10.1038/s41598-018-34688-x Kiselev, 2019, Challenges in unsupervised clustering of single-cell RNA-seq data, Nat Rev Genet, 20, 273, 10.1038/s41576-018-0088-9 Jaitin, 2014, Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types, Science, 343, 776, 10.1126/science.1247651 Wang, 2020, BREM-SC: a bayesian random effects mixture model for joint clustering single cell multi-omics data, Nucleic Acids Res, 48, 5814, 10.1093/nar/gkaa314 Ringeling, 2021, Linear-time cluster ensembles of large-scale single-cell RNA-seq and multimodal data, Genome Res, 31, 677, 10.1101/gr.267906.120 Ji Z, Ji H. TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis. Nucleic Acids Res. 2016:44;e117–e117. Kiselev, 2017, SC3: consensus clustering of single-cell RNA-seq data, Nat Methods, 14, 483, 10.1038/nmeth.4236 Butler, 2018, Integrating single-cell transcriptomic data across different conditions, technologies, and species, Nat Biotechnol, 36, 411, 10.1038/nbt.4096 Tian, 2021, Model-based deep embedding for constrained clustering analysis of single cell RNA-seq data, Nat Commun, 12, 1, 10.1038/s41467-021-22008-3 Chen, 2021, Single-cell RNA-seq data semi-supervised clustering and annotation via structural regularized domain adaptation, Bioinformatics, 37, 775, 10.1093/bioinformatics/btaa908 Settles B. Active learning literature survey. University of Wisconsin-Madison Department of Computer Sciences; MINDS@UW; 2009. Prince, 2004, Does active learning work? A review of the research, J Eng Educ, 93, 223, 10.1002/j.2168-9830.2004.tb00809.x Hubert, 1985, Comparing partitions, J Classif, 2, 193, 10.1007/BF01908075 Strehl, 2002, Cluster ensembles–a knowledge reuse framework for combining multiple partitions, J. Mach Learn Res, 3, 583 Kuhn, 1955, The Hungarian method for the assignment problem, Nav Res Logist Q, 2, 83, 10.1002/nav.3800020109 Balcan M-F, Broder A, Zhang T. Margin based active learning. In: International Conference on Computational Learning Theory. 35–50. Springer; International Conference on Computational Learning Theory; 2007. Zheng, 2017, Massively parallel digital transcriptional profiling of single cells, Nat Commun, 8, 1, 10.1038/ncomms14049 Cao, 2017, Comprehensive single-cell transcriptional profiling of a multicellular organism, Science, 357, 661, 10.1126/science.aam8940 Young, 2018, Single-cell transcriptomes from human kidneys reveal the cellular identity of renal tumors, Science, 361, 594, 10.1126/science.aat1699 Han, 2018, Mapping the mouse cell atlas by microwell-seq, Cell, 172, 1091, 10.1016/j.cell.2018.02.001 Wei C, Sohn K, Mellina C, Yuille A, Yang, F. Crest: a class-rebalancing self-training framework for imbalanced semi-supervised learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021;10857–66.