Discovering the False Discovery Rate

Yoav Benjamini1
1Tel Aviv University, Israel

Tóm tắt

SummaryI describe the background for the paper ‘Controlling the false discovery rate: a new and powerful approach to multiple comparisons’ by Benjamini and Hochberg that was published in the Journal of the Royal Statistical Society, Series B, in 1995. I review the progress since made on the false discovery rate, as well as the major conceptual developments that followed.

Từ khóa


Tài liệu tham khảo

Abramovich, 1996, Adaptive thresholding of wavelet coefficients, Computnl Statist. Data Anal., 22, 351, 10.1016/0167-9473(96)00003-5

Abramovich, 2006, Adapting to unknown sparsity by controlling the false discovery rate, Ann. Statist., 34, 584, 10.1214/009053606000000074

Benjamini, 2010, Simultaneous and selective inference: current successes and future challenges, Biometr. J., 10.1002/bimj.200900299

Benjamini, 2009, A simple forward selection procedure based on false discovery rate control, Ann. Appl. Statist., 3, 179, 10.1214/08-AOAS194

Benjamini, 2009, Selective inference in complex research, Phil. Trans. R. Soc. Lond. A, 367, 4255

Benjamini, 1995, Controlling the false discovery rate: a practical and powerful approach to multiple testing, J. R. Statist. Soc. B, 57, 289

Benjamini, 2000, On the adaptive control of the false discovery rate in multiple testing with independent statistics, J. Educ. Behav. Statist., 25, 60, 10.2307/1165312

Benjamini, 2006, Adaptive linear step-up procedures that control the false discovery rate, Biometrika, 93, 491, 10.1093/biomet/93.3.491

Benjamini, 2001, The control of the False Discovery Rate in multiple testing under dependency, Ann. Statist., 29, 1165, 10.1214/aos/1013699998

Benjamini, 2005, False discovery rate controlling confidence intervals for selected parameters, J. Am. Statist. Ass., 100, 71, 10.1198/016214504000001907

Blanchard, 2009, Adaptive FDR control under independence and dependence, J. Mach. Learn. Res., 10, 2837

Donoho, 2004, Higher criticism for detecting sparse heterogeneous mixtures, Ann. Statist., 32, 962, 10.1214/009053604000000265

Donoho, 2006, Asymptotic minimaxity of false discovery rate thresholding for sparse exponential data, Ann. Statist., 34, 2980, 10.1214/009053606000000920

Efron, 2008, Microarrays, empirical Bayes and the two groups model, Statist. Sci., 23, 1

Gavrilov, 2009, An adaptive step-down procedure with proven FDR control under independence, Ann. Statist., 37, 619, 10.1214/07-AOS586

Genovese, 2002, Operating characteristics and extensions of the false discovery rate procedure, J. R. Statist. Soc. B, 64, 499, 10.1111/1467-9868.00347

Hochberg, 1990, More powerful procedures for multiple significance testing, Statist. Med., 9, 811, 10.1002/sim.4780090710

Van Der Laan, 2007, Multiple Testing Procedures with Applications to Genomics

Reiner-Benaim, 2007, FDR control by the BH procedure for two-sided correlated tests with implications to gene expression data analysis, Biometr. J., 49, 107, 10.1002/bimj.200510313

Sarkar, 1998, Some probability inequalities for ordered MTP2 random variables: a proof of Sime’s conjecture, Ann. Statist., 26, 494, 10.1214/aos/1028144846

Schweder, 1982, Plots of p-values to evaluate many tests simultaneously, Biometrika, 69, 493, 10.1093/biomet/69.3.493

Soriç, 1989, Statistical ‘‘discoveries’’ and effect size estimation, J. Am. Statist. Ass., 84, 608

Storey, 2002, A direct approach to false discovery rates, J. R. Statist. Soc. B, 64, 479, 10.1111/1467-9868.00346

Storey, 2004, Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach, J. R. Statist. Soc. B, 66, 187, 10.1111/j.1467-9868.2004.00439.x

Storey, 2003, Statistical significance for genome-wide experiments, Proc. Natn. Acad. Sci. USA, 100, 9440, 10.1073/pnas.1530509100

Yekutieli, 2010, Adjusted Bayesian inference for selected parameters, Preprint arXiv: 0801.0499v4

Yekutieli, 1999, Resampling based False Discovery Rate controlling procedure for dependent test statistics, J. Statist. Planng Inf., 82, 171, 10.1016/S0378-3758(99)00041-5

Benjamini, 2007, False discovery rates for spatial signals, J. Am. Statist. Ass., 102, 1272, 10.1198/016214507000000941

Benjamini, Controlling the false discovery rate: a practical and powerful approach to multiple testing, J. R. Statist. Soc. B, 57, 289

Benjamini, 1997, Multiple hypothesis testing with weights, Scand. J. Statist., 24, 407, 10.1111/1467-9469.00072

Benjamini, 2001, The control of the false discovery rate in multiple testing under dependency, Ann. Statist., 29, 1165, 10.1214/aos/1013699998

Berger, 1982, Multiparameter hypothesis testing and acceptance sampling, Technometrics, 24, 295, 10.2307/1267823

Cook, 1996, Multiplicity considerations in the design and analysis of clinical trials, J. R. Statist. Soc. A, 159, 93, 10.2307/2983471

Cox, 1965, A remark on multiple comparison methods, Technometrics, 7, 223, 10.1080/00401706.1965.10490250

Dudoit, 2002, Statistical methods for identifying differentially expressed genes in replicated cdna microarray experiments, Statist. Sin., 12, 111

Genovese, 2002, Thresholding of statistical maps in functional neuroimaging using false discovery rate, Neuroimage, 15, 870, 10.1006/nimg.2001.1037

Genovese, 2006, False discovery control with P-value weighting, Biometrika, 93, 509, 10.1093/biomet/93.3.509

Green, 2008, Statistical methods for the analysis of microarray data

Hu, 2009, Multiple hypothesis testing with groups. Technical Report

Jones, 2008, Control of the false discovery rate accounts for multiple testing in comparisons of healthcare providers, J. Clin. Epidem., 61, 232, 10.1016/j.jclinepi.2007.04.017

Meinshausen, 2008, Hierarchical testing of variable importance, Biometrika, 95, 265, 10.1093/biomet/asn007

O’Brien, 1984, Procedures for comparing samples with multiple endpoints, Biometrics, 40, 1079, 10.2307/2531158

Pacificoa, 2007, Scan clustering: a false discovery approach, J. Multiv. Anal., 98, 1441, 10.1016/j.jmva.2006.11.011

Patti, 2003, Coordinated reduction of genes of oxidative metabolism in humans with insulin resistance and diabetes: potential role of pgc1and nrf1, Proc. Natn. Acad. Sci. USA, 100, 8466, 10.1073/pnas.1032913100

Reiner-Benaim, 2007, Associating quantitative behavioral traits with gene expression in the brain: searching for diamonds in the hay, Bioinformatics, 23, 2239, 10.1093/bioinformatics/btm300

Storey, A direct approach to false discovery rates, J. R. Statist. Soc. B, 64, 479, 10.1111/1467-9868.00346

Storey, 2003, Statistical significance for genomewide studies, Proc. Natn. Acad. Sci. USA, 100, 9440, 10.1073/pnas.1530509100

Tuke, 2008, Gene profiling for determining pluripotent genes in a time course microarray experiment, Biostatistics, 10, 80, 10.1093/biostatistics/kxn017

Tusher, 2001, Significance analysis of microarrays applied to the ionizing radiation response, Proc. Natn. Acad. Sci. USA, 98, 5116, 10.1073/pnas.091062498

Wacholder, 2004, Assessing the probability that a positive report is false: an approach for molecular epidemiology studies, J. Natn. Cancer Inst., 96, 434, 10.1093/jnci/djh075

Weisberg, 2003, Obesity is associated with macrophage accumulation in adipose tissue, J. Clin. Investgn, 112, 1796, 10.1172/JCI200319246

Wellek, 2002, Testing Statistical Hypotheses of Equivalence, 10.1201/9781420035964

Wilkinson, 1999, Statistical methods in psychology journals—guidelines and explanations, Am. Psychol., 54, 594, 10.1037/0003-066X.54.8.594

Yekutieli, 2008, Hierarchical false discovery rate controlling methodology, J. Am. Statist. Ass., 103, 309, 10.1198/016214507000001373

Zehetmayer, 2005, Two-stage designs for experiments with a large number of hypotheses, Bioinformatics, 21, 3771, 10.1093/bioinformatics/bti604