Assessing additive effects of air pollutants on mortality rate in Massachusetts
Tóm tắt
We previously found additive effects of long- and short-term exposures to fine particulate matter (PM2.5), ozone (O3), and nitrogen dioxide (NO2) on all-cause mortality rate using a generalized propensity score (GPS) adjustment approach. The study addressed an important question of how many early deaths were caused by each exposure. However, the study was computationally expensive, did not capture possible interactions and high-order nonlinearities, and omitted potential confounders. We proposed two new methods and reconducted the analysis using the same cohort of Medicare beneficiaries in Massachusetts during 2000–2012, which consisted of 1.5 million individuals with 3.8 billion person-days of follow-up. The first method, weighted least squares (WLS), leveraged large volume of data by aggregating person-days, which gave equivalent results to the linear probability model (LPM) method in the previous analysis but significantly reduced computational burden. The second method, m-out-of-n random forests (moonRF), implemented scaling random forests that captured all possible interactions and nonlinearities in the GPS model. To minimize confounding bias, we additionally controlled relative humidity and health care utilizations that were not included previously. Further, we performed low-level analysis by restricting to person-days with exposure levels below increasingly stringent thresholds. We found consistent results between LPM/WLS and moonRF: all exposures were positively associated with mortality rate, even at low levels. For long-term PM2.5 and O3, the effect estimates became larger at lower levels. Long-term exposure to PM2.5 posed the highest risk: 1 μg/m3 increase in long-term PM2.5 was associated with 1053 (95% confidence interval [CI]: 984, 1122; based on LPM/WLS methods) or 1058 (95% CI: 988, 1127; based on moonRF method) early deaths each year among the Medicare population in Massachusetts. This study provides more rigorous causal evidence between PM2.5, O3, and NO2 exposures and mortality, even at low levels. The largest effect estimate for long-term PM2.5 suggests that reducing PM2.5 could gain the most substantial benefits. The consistency between LPM/WLS and moonRF suggests that there were not many interactions and high-order nonlinearities. In the big data context, the proposed methods will be useful for future scientific work in estimating causality on an additive scale.
Tài liệu tham khảo
Dockery DW, Pope CA, Xu X, Spengler JD, Ware JH, Fay ME, et al. An association between air pollution and mortality in six U.S. cities. N Engl J Med. 1993;329(24):1753–9.
Di Q, Dominici F, Schwartz JD. Air pollution and mortality in the Medicare population. N Engl J Med. 2017;377(15):1498–9.
Di Q, Dai L, Wang Y, Zanobetti A, Choirat C, Schwartz JD, et al. Association of Short-term Exposure to air pollution with mortality in older adults. Jama. 2017;318(24):2446–56.
Wei Y, Wang Y, Di Q, Choirat C, Wang Y, Koutrakis P, et al. Short term exposure to fine particulate matter and hospital admission risks and costs in the Medicare population: time stratified, case crossover study. BMJ. 2019;367:l6258.
Jerrett M, Burnett RT, Pope CA 3rd, Ito K, Thurston G, Krewski D, et al. Long-term ozone exposure and mortality. N Engl J Med. 2009;360(11):1085–95.
Faustini A, Rapp R, Forastiere F. Nitrogen dioxide and mortality: review and meta-analysis of long-term studies. Eur Respir J. 2014;44(3):744–53.
Lilienfeld DE. Definitions of epidemiology. Am J Epidemiol. 1978;107(2):87–90.
Wang Y, Lee M, Liu P, Shi L, Yu Z, Abu Awad Y, et al. Doubly Robust Additive Hazards Models to Estimate Effects of a Continuous Exposure on Survival. Epidemiology (Cambridge, Mass). 2017;28(6):771–9.
Schwartz J, Austin E, Bind MA, Zanobetti A, Koutrakis P. Estimating causal associations of fine particles with daily deaths in Boston. Am J Epidemiol. 2015;182(7):644–50.
Wang Y, Kloog I, Coull BA, Kosheleva A, Zanobetti A, Schwartz JD. Estimating causal effects of Long-term PM2.5 exposure on mortality in New Jersey. Environ Health Perspect. 2016;124(8):1182–8.
Schwartz J, Fong K, Zanobetti A. A National Multicity Analysis of the causal effect of local pollution, [formula: see text], and [formula: see text] on mortality. Environ Health Perspect. 2018;126(8):87004.
Agency USEP. National Ambient Air Quality Standards Table 2016 [Available from: https://www.epa.gov/criteria-air-pollutants/naaqs-table.
Dominici F, Peng RD, Barr CD, Bell ML. Protecting human health from air pollution: shifting from a single-pollutant to a multipollutant approach. Epidemiology (Cambridge, Mass). 2010;21(2):187–94.
Thakrar SK, Balasubramanian S, Adams PJ, IsML A, Muller NZ, Pandis SN, et al. Reducing mortality from air pollution in the United States by targeting specific emission sources. Environmental Science and Technology Letters. 2020;7(9):639–45.
Krzyzanowski M, Cohen A, Anderson R, Group WHOW. Quantification of health effects of exposure to air pollution. Occup Environ Med. 2002;59(12):791–3.
Noordzij M, van Diepen M, Caskey FC, Jager KJ. Relative risk versus absolute risk: one cannot be interpreted without the other. Nephrol Dial Transplant. 2017;32(suppl_2):ii13–i8.
Owens EO, Patel MM, Kirrane E, Long TC, Brown J, Cote I, et al. Framework for assessing causality of air pollution-related health effects for reviews of the National Ambient air Quality Standards. Regul Toxicol Pharmacol. 2017;88:332–7.
De Gonzalez A. Cox D. Interpretation of interaction: A review. 2007;1(2):371–85.
Wei Y, Wang Y, Wu X, Di Q, Shi L, Koutrakis P, et al. Causal effects of air pollution on mortality in Massachusetts. Am J Epidemiol. 2020.
Fleming TR, Harrington DP. Counting Processes and Survival Analysis. 2nd ed: Wiley; 2013.
Di Q, Amini H, Shi L, Kloog I, Silvern R, Kelly J, et al. An ensemble-based model of PM2.5 concentration across the contiguous United States with high spatiotemporal resolution. Environ Int. 2019;130:104909.
Requia WJ, Di Q, Silvern R, Kelly JT, Koutrakis P, Mickley LJ, et al. An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States. Environmental science & technology. 2020.
Di Q, Amini H, Shi L, Kloog I, Silvern RF, Kelly JT, et al. Assessing NO2 concentration and model uncertainty with high spatiotemporal resolution across the contiguous United States using ensemble model averaging. Environmental science & technology. 2019;54(3):1372–84.
Institute ESR. Esri Data & Maps 10. Redlands: An Esri White Paper; 2010.
VanderWeele TJ. Principles of confounder selection. Eur J Epidemiol. 2019;34(3):211–9.
Kalnay E, Kanamitsu M, Kistler R, Collins W, Deaven D, Gandin L, et al. The NCEP/NCAR 40-year reanalysis project. Bull Am Meteorol Soc. 1996;77(3):437–72.
Mapper U. ZIP code to ZCTA crosswalk; 2014.
CDC. Behavioral Risk Factor Surveillance System Survey Questionnaire. Atlanta, Georgia: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention; 2004.
Cronenwett JL, Birkmeyer JD. The Dartmouth atlas of vascular health care. Cardiovasc Surg. 2000;8(6):409–10.
Barreca AI. Climate change, humidity, and mortality in the United States. J Environ Econ Manage. 2012;63(1):19–34.
Lee KC, Sturgeon D, Lipsitz S, Weissman JS, Mitchell S, Cooper Z. Mortality and health care utilization among Medicare patients undergoing emergency general surgery vs those with acute medical conditions. JAMA Surg. 2020;155(3):216–23.
Zhao S, van Dyk DA, Imai K. Propensity score-based methods for causal inference in observational studies with non-binary treatments. Stat Methods Med Res. 2020;29(3):709–27.
Rubin DB. For objective causal inference, design trumps analysis. Ann Appl Stat. 2008;2(3):808–40.
Hirano K, Imbens GW. The Propensity Score with Continuous Treatments. In: Gelman A, Meng XL, editors. Applied Bayesian Modeling and Causal Inference from Incomplete-Data. Hoboken, NJ: John Wiley & Sons, Ltd; 2004. p. 73–84.
Vellaisamy P, Vijay V. Collapsibility of regression coefficients and its extensions. Journal of Statistical Planning and Inference. 2008;138(4):982–94.
Robins JM, Hernan MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology (Cambridge, Mass). 2000;11(5):550–60.
Battey HS, Cox DR, Jackson MV. On the linear in probability model for binary data. R Soc Open Sci. 2019;6(5):190067.
Genuera R, Poggib J-M, Malotc C, Vialaneix NV. Random Forests for Big Data. Big Data Research. 2017;9:28–46.
Bickel PJ, Bickel F. Zwet WRv. Resampling fewer than n. observations: gains, losses, and remedies for losses. Stat Sin. 1997;7(1):1–31.
Dudık M, Langford J, Li L. Doubly robust policy evaluation and learning. The international conference on machine learning. International Machine Learning Society: Bellevue, Washington; 2011.
Team RCD. R: A language and environment for statistical computing. 3.5.1 ed: R Fundation for Statistical Computing; 2010.
Wright M, Wager S, Probst P. Package ‘ranger’: A Fast Implementation of Random Forests. 0.12.1 ed: CRAN; 2020.
Lumley T. Package ‘biglm’: bounded memory linear and generalized linear models. 0.9.1 ed: CRAN; 2020.
Semple DR, Song F, Gao Y. Seasonal characteristics of ambient nitrogen oxides and ground–level ozone in metropolitan northeastern New Jersey. Atmospheric Pollution Research. 2012;3(2):247–57.
Weisskopf MG, Webster TF. Trade-offs of Personal Versus More Proxy Exposure Measures in Environmental Epidemiology. Epidemiology (Cambridge, Mass). 2017;28(5):635–43.
Butland BK, Samoli E, Atkinson RW, Barratt B, Katsouyanni K. Measurement error in a multi-level analysis of air pollution and health: a simulation study. Environmental health : a global access science source. 2019;18(1):13.