A Direct Approach to False Discovery Rates
Tóm tắt
Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for a single-hypothesis test, a compound error rate is controlled for multiple-hypothesis tests. For example, controlling the false discovery rate FDR traditionally involves intricate sequential p-value rejection methods based on the observed data. Whereas a sequential p-value method fixes the error rate and estimates its corresponding rejection region, we propose the opposite approach—we fix the rejection region and then estimate its corresponding error rate. This new approach offers increased applicability, accuracy and power. We apply the methodology to both the positive false discovery rate pFDR and FDR, and provide evidence for its benefits. It is shown that pFDR is probably the quantity of interest over FDR. Also discussed is the calculation of the q-value, the pFDR analogue of the p-value, which eliminates the need to set the error rate beforehand as is traditionally done. Some simple numerical examples are presented that show that this new approach can yield an increase of over eight times in power compared with the Benjamini–Hochberg FDR method.
Từ khóa
Tài liệu tham khảo
Benjamini, 1995, Controlling the false discovery rate: a practical and powerful approach to multiple testing, J. R. Statist. Soc., 57, 289
2000, On the adaptive control of the false discovery rate in multiple testing with independent statistics, J. Educ. Behav. Statist., 25, 60, 10.3102/10769986025001060
Benjamini, 1999, A step-down multiple-hypothesis procedure that controls the false discovery rate under independence, J. Statist. Planng Inf., 82, 163, 10.1016/S0378-3758(99)00040-3
Efron, 2001, Empirical Bayes analysis of a microarray experiment, J. Am. Statist. Ass., 96, 1151, 10.1198/016214501753382129
Friedman, 2001, The role of statistics in the data revolution, Int. Statist. Rev., 69, 5, 10.1111/j.1751-5823.2001.tb00474.x
Storey, 2001, The positive False Discovery Rate: a Bayesian interpretation and the q-value
Storey, 2001, Estimating false discovery rates under dependence, with applications to DNA microarrays
Tusher, 2001, Significance analysis of microarrays applied to transcriptional responses to ionizing radiation, Proc. Natn. Acad. Sci. USA, 98, 5116, 10.1073/pnas.091062498