Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis

PLoS Genetics - Tập 3 Số 9 - Trang e161
Jeffrey T. Leek1, John D. Storey1,2
1Department of Biostatistics, University of Washington, Seattle, Washington, United States of America
2Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America

Tóm tắt

Từ khóa


Tài liệu tham khảo

2006, Assessing stability of gene selection in microarray data analysis., BMC Bioinformatics, 7, 50, 10.1186/1471-2105-7-50

2006, Treating expression levels of different genes as a sample in microarray data analysis: is it worth a risk?, Stat Appl Genet Mol Biol, 5, art9

2000, Analysis of variance for gene expression microarray data., J Comput Biol, 7, 819, 10.1089/10665270050514954

2001, Experimental design for gene expression microarrays., Biostatistics, 2, 183, 10.1093/biostatistics/2.2.183

2000, Fundamental patterns underlying gene expression profiles: simplicity from complexity., Proc Natl Acad Sci U S A, 97, 8409, 10.1073/pnas.150242097

2000, Genomic expression programs in the response of yeast cells to environmental changes., Mol Biol Cell, 11, 4241, 10.1091/mbc.11.12.4241

2004, A transcriptional profile of aging in the human kidney., PLoS Biol, 2, 2191

2005, Significance analysis of time course microarray experiments., Proc Natl Acad Sci U S A, 102, 12837, 10.1073/pnas.0504609102

1997, Exploring the metabolic and genetic control of gene expression on a genomic scale., Science, 278, 680, 10.1126/science.278.5338.680

2002, Genetic dissection of transcriptional regulation in budding yeast., Science, 296, 752, 10.1126/science.1069516

2003, Genetics of gene expression surveyed in maize, mouse and man., Nature, 422, 297, 10.1038/nature01434

2001, Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects., Nucleic Acids Res, 29, 2540

2002, Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation., Nucleic Acids Res, 30, e15, 10.1093/nar/30.4.e15

2005, Correlation between gene expression levels and limitations of the empirical bayes methodology for finding differentially expressed genes., Stat Appl Genet Mol Biol, 4, art34

2004, Genetic analysis of genome-wide variation in human gene expression., Nature, 430, 743, 10.1038/nature02797

2005, Integrative analysis of the cancer transcriptome., Nat Genet, 37, 31, 10.1038/ng1570

2006, Molecular heterogeneity of inflammatory breast cancer: A hyperproliferative phenotype., Clin Cancer Res, 12, 5047, 10.1158/1078-0432.CCR-05-2248

1999, Flourescent cdna microarray hybridization reveals complexity and heterogeneity of cellular genotoxic stress response., Oncogene, 18, 3666, 10.1038/sj.onc.1202676

2006, The connectivity map: using gene-expression signatures to connect small molecules, genes and disease., Science, 313, 1929, 10.1126/science.1132939

2007, A new approach to intensity-dependent normalization of two-channel microarrays., Biostatistics, 8, 128, 10.1093/biostatistics/kxj038

2005, Genetic interactions between polymorphisms that affect gene expression in yeast., Nature, 436, 701, 10.1038/nature03865

2001, Gene-expression profiles in hereditary breast cancer., New Engl J Med, 344, 539, 10.1056/NEJM200102223440801

2003, Statistical significance for genome-wide studies., Proc Natl Acad Sci USA, 100, 9440, 10.1073/pnas.1530509100

2006, A reanalysis of a published Affymetrix genechip control dataset., Genome Biol, 7, 401, 10.1186/gb-2006-7-3-401

RiceJA 1995 Mathematical statistics and data analysis. 2nd edition Belmont (California) Duxbury Press

2002, A direct approach to false discovery rates., J Royal Stat Soc Ser B, 64, 479, 10.1111/1467-9868.00346

1992, Remarks on parallel analysis., Multivariate Behav Res, 27, 509, 10.1207/s15327906mbr2704_2

LehmanELRomanoJP 2005 Testing statistical hypotheses New York Springer-Verlag

2005, Variance of the number of false discoveries., J Royal Stat Soc Ser B, 67, 411, 10.1111/j.1467-9868.2005.00509.x

2006, Some comments on instability of false discovery rate estimation., J Bioinform Comput Biol, 4, 1057, 10.1142/S0219720006002338

2004, Large-scale simultaneous hypothesis testing: the choice of a null hypothesis., J Am Stat Assoc, 99, 96, 10.1198/016214504000000089

2007, Correlation and large-scale simultaneous significance testing., J Am Stat Assoc, 102, 93, 10.1198/016214506000001211

2006, Modified simes' critical values under positive dependence., J Stat Plan Inference, 136, 4129, 10.1016/j.jspi.2005.06.004

2001, The control of the false discovery rate in multiple testing under dependency., Ann Stat, 29, 1165

2006, Estimation of false discovery proportion under general dependence., Bioinformatics, 22, 3025, 10.1093/bioinformatics/btl527

2003, Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors., Nat Genet, 35, 57, 10.1038/ng1222

1998, Cluster analysis and display of genome-wide expression patterns Proc., Natl Acad Sci U S A, 95, 14863, 10.1073/pnas.95.25.14863

2003, Molecular classification of familial non-brca1/brca2 breast cancer., Proc Natl Acad Sci U S A, 100, 2532, 10.1073/pnas.0533805100

MardiaKVKentJTBibbyJM 1980 Multivariate analysis London Academic Press

2000, Singular value decomposition for genome-wide expression data processing and modeling., Proc Natl Acad Sci U S A, 97, 10101, 10.1073/pnas.97.18.10101

2006, Principal components analysis corrects for stratification in genome-wide association studies., Nat Genet, 38, 904, 10.1038/ng1847

2005, Multiple locus linkage analysis of genomewide expression in yeast., PLoS Biol, 3, 1380

R Development Core Team 2004 R: a language and environment for statistical computing Vienna R Foundation for Statistical Computing

HastieTTibshiraniR 1990 Generalized additive models New York Chapman & Hall