A Direct Approach to False Discovery Rates

John D. Storey1
1Stanford University - USA > > > >

Tóm tắt

SummaryMultiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for a single-hypothesis test, a compound error rate is controlled for multiple-hypothesis tests. For example, controlling the false discovery rate FDR traditionally involves intricate sequential p-value rejection methods based on the observed data. Whereas a sequential p-value method fixes the error rate and estimates its corresponding rejection region, we propose the opposite approach—we fix the rejection region and then estimate its corresponding error rate. This new approach offers increased applicability, accuracy and power. We apply the methodology to both the positive false discovery rate pFDR and FDR, and provide evidence for its benefits. It is shown that pFDR is probably the quantity of interest over FDR. Also discussed is the calculation of the q-value, the pFDR analogue of the p-value, which eliminates the need to set the error rate beforehand as is traditionally done. Some simple numerical examples are presented that show that this new approach can yield an increase of over eight times in power compared with the Benjamini–Hochberg FDR method.

Từ khóa


Tài liệu tham khảo

Benjamini, 1995, Controlling the false discovery rate: a practical and powerful approach to multiple testing, J. R. Statist. Soc., 57, 289

2000, On the adaptive control of the false discovery rate in multiple testing with independent statistics, J. Educ. Behav. Statist., 25, 60, 10.3102/10769986025001060

Benjamini, 1999, A step-down multiple-hypothesis procedure that controls the false discovery rate under independence, J. Statist. Planng Inf., 82, 163, 10.1016/S0378-3758(99)00040-3

Efron, 1993, An Introduction to the Bootstrap, 10.1007/978-1-4899-4541-9

Efron, 2001, Empirical Bayes analysis of a microarray experiment, J. Am. Statist. Ass., 96, 1151, 10.1198/016214501753382129

Friedman, 2001, The role of statistics in the data revolution, Int. Statist. Rev., 69, 5, 10.1111/j.1751-5823.2001.tb00474.x

Storey, 2001, The positive False Discovery Rate: a Bayesian interpretation and the q-value

Storey, 2001, Estimating false discovery rates under dependence, with applications to DNA microarrays

Tusher, 2001, Significance analysis of microarrays applied to transcriptional responses to ionizing radiation, Proc. Natn. Acad. Sci. USA, 98, 5116, 10.1073/pnas.091062498

Yekutieli, 1999, Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics, J. Statist. Planng Inf., 82, 171, 10.1016/S0378-3758(99)00041-5