Do multiple outcome measures require p-value adjustment?

Ronald J. Feise1
1Institute of Evidence-Based Chiropractic, Fort Collins

Tóm tắt

Từ khóa


Tài liệu tham khảo

Godfrey K: Statistics in practice. Comparing the means of several groups. N Engl J Med. 1985, 313: 1450-1456.

Feise RJ: Behavioral-graded activity compared with usual care after first-time disk surgery: Considerations of the design of a randomized clinical trial (Letter). J Manipulative Physiol Ther. 2001, 24: 67-68. 10.1067/mmt.2001.112007.

Ostelo RW, de Vet HC: Behavioral-graded activity compared with usual care after first-time disk surgery: Considerations of the design of a randomized clinical trial (Letter). J Manipulative Physiol Ther. 2001, 24: 68-10.1067/mmt.2001.112008.

Tukey JW: Some thoughts on clinical trials, especially problems of multiplicity. Science. 1977, 198: 679-684.

Bland JM, Altman DG: Multiple significance tests: the Bonferroni method. BMJ. 1995, 310: 170-

Greenhalgh T: Statistics for the non-statistician. l. Different types of data need different statistical tests. BMJ. 1997, 315: 364-366.

Ludbrook J: Multiple comparison procedures updated. Clin Exp Pharmacol Physiol. 1998, 25: 1032-1037.

Ahlbom A: Biostatistics for Epidemiologists. Boca Raton (FL), Lewis Publishers. 1993, 52-53.

Steenland K, Bray I, Greenland S, Boffetta P: Empirical bayes adjustments for multiple results in hypothesis-generating or surveillance studies. Cancer Epidemiol Biomarkers Prev. 2000, 9: 895-903.

Sidak Z: Rectangular confidence regions for the means of multivariate normal distribution. J Am Statist Assoc. 1967, 62: 626-633.

Williams DA: A test for differences between treatment means when several dose levels are compared with a zero dose control. Biometrics. 1971, 27: 103-117.

Holm S: A simple sequentially rejective multiple test procedure. Scand J Statis. 1979, 6: 65-70.

Mantel N: Assessing laboratory evidence for neoplastic activity. Biometrics. 1980, 36: 381-399.

Stoline MR: The status of multiple comparisons: simultaneous estimation of all pairwise comparisons in one-way ANOVA designs. Am Stat. 1981, 35: 134-141.

Tukey JW, Ciminera JL, Heyse JF: Testing the statistical certainty of a response to increasing doses of a drug. Biometrics. 1985, 41: 295-301.

Shaffer JP: Modified sequentially rejective multiple test procedures. J Amer Stat Assn. 1986, 81: 826-831.

Hochberg Y, Tamhane AC: Multiple comparison procedures. New York, John Wiley. 1987

Hommel G: A stepwise rejective multiple test procedure based on a modified Bonferroni test. Biometrika. 1988, 75: 383-386.

Westfall PH, Young SS: p-Value adjustments for multiple tests in multivariate binomial models. J Amer Stat Assn. 1989, 84: 780-786.

Tarone RE: A modified Bonferroni method for discrete data. Biometrics. 1990, 46: 515-522.

Turkheimer F, Pettigrew K, Sokoloff L, Smith CB, Schmidt K: Selection of an adaptive test statistic for use with multiple comparison analyses of neuroimaging data. Neuroimage. 2000, 12: 219-229. 10.1006/nimg.2000.0608.

Neyman J, Pearson ES: On the use and interpretation of certain test criteria for purposes of statistical inference. Biometrika. 1928, 20A: 175-240.

Perneger TV: What's wrong with Bonferroni adjustments. BMJ. 1998, 316: 1236-1238.

Rothman KJ: No adjustments are needed for multiple comparisons. Epidemiology. 1990, 1: 43-46.

Savitz DA, Olshan AF: Multiple comparisons and related issues in the interpretation of epidemiologic data. Am J Epidemiol. 1995, 142: 904-908.

Thompson JR: Invited commentary: Re: "Multiple comparisons and related issues in the interpretation of epidemiologic data". Am J Epidemiol. 1998, 147: 801-806.

Cole P: The evolving case-control study. J Chronic Dis. 1979, 32: 15-27.

Thomas DC, Siemiatycki J, Dewar R, Robins J, Goldberg M, Armstrong BG: The problem of multiple inference in studies designed to generate hypotheses. Am J Epidemiol. 1985, 122: 1080-1095.

Aickin M, Gensler H: Adjusting for multiple testing when reporting research results: the Bonferroni vs Holm methods. Am J Public Health. 1996, 86: 726-728.

Manor O, Peritz E: Re: "Multiple comparisons and related issues in the interpretation of epidemiologic data". Am J Epidemiol. 1997, 145: 84-85.

O'Brien PC: Procedures for comparing samples with multiple endpoints. Biometrics. 1984, 40: 1079-1087.

Simes RJ: An improved Bonferroni procedure for multiple tests of significance. Biometrika. 1988, 73: 751-754.

Goldsmith CH, Smythe HA, Helewa A: Interpretation and power of a pooled index. J Rheumatol. 1993, 20: 575-578.

Zhang J, Quan H, Ng J, Stepanavage ME: Some statistical methods for multiple endpoints in clinical trials. Control Clin Trials. 1997, 18: 204-221. 10.1016/S0197-2456(96)00129-8.

Walker AM: Reporting the results of epidemiological studies. Am J Public Health. 1986, 76: 556-558.

deGruy F: Significance of multiple inferential tests. J Fam Pract. 1990, 30: 15-16.

Hart AA: The interpretation of multiple P-values. Radiother Oncol. 1994, 33: 177-178.

Voss S, George S: Multiple significance tests. BMJ. 1995, 310: 1073-

Goodman SN: Multiple comparisons, explained. Am J Epidemiol. 1998, 147: 807-815.

Small RD, Schor SS: Bayesian and non-Bayesian methods of inference. Ann Intern Med. 1983, 99: 857-859.