Public health utility of cause of death data: applying empirical algorithms to improve data quality

Sarah Johnson1, Matthew Cunningham1, Ilse N. Dippenaar1, Fablina Sharara1, Eve Wool1, Kareha M Agesa1, Chieh Han1, Molly K. Miller-Petrie2, Shadrach Wilson1, John E Fuller1, Shelly Balassyano1, Gregory J Bertolacci1, Nicole Davis Weaver1, Jalal Arabloo, Alaa Badawi, Akshaya Srikanth Bhagavathula, Katrin Burkart, Luis Cámera, Félix Carvalho, Carlos Castañeda-Orjuela, Jee‐Young Jasmine Choi, Dinh‐Toi Chu, Xiaochen Dai, Mostafa Dianatinasab, Sophia Emmons-Bell, Eduarda Fernandes, Florian Fischer, Ahmad Ghashghaee, Mahaveer Golechha, Simon I Hay, Khezar Hayat, Nathaniel J. Henry, Ramesh Holla, Mowafa Househ, Segun Emmanuel Ibitoye, Maryam Keramati, Ejaz Ahmad Khan, Yun Jin Kim, Adnan Kısa, Hamidreza Komaki, Ai Koyanagi, Samantha Leigh Larson, Kate E LeGrand, Xuefeng Li, Azeem Majeed, Reza Malekzadeh, Bahram Mohajer, Abdollah Mohammadian-Hafshejani, Reza Mohammadpourhodki, Shafiu Mohammed, Farnam Mohebi, Ali H Mokdad, Mariam Molokhia, Lorenzo Monasta, Mohammad Ali Moni, Muhammad Naveed, Huong Lan Thi Nguyen, Andrew T Olagunju, Samuel M Ostroff, Fatemeh Pashazadeh Kan, Paula B. Andrade, Hai Quang Pham, Salman Rawaf, David Laith Rawaf, Andre Μ Ν Renzaho, Luca Ronfani, Abdallah M. Samy, Subramanian Senthilkumaran, Sadaf G Sepanlou, Masood Ali Shaikh, David Shaw, Kenji Shibuya, Jasvinder A Singh, Valentin Yurievich Skryabin, Anna Aleksandrovna Skryabina, Emma Elizabeth Spurlock, Eyayou Girma Tadesse, Mohamad-Hani Temsah, Marcos Roberto Tovani‐Palone, Bach Xuan Tran, Gebiyaw Wudie Tsegaye, Pascual Valdéz, Prashant M. Vishwanath, Giang Thu Vu, Yasir Waheed, Naohiro Yonemoto, Rafael Lozano, Alan D. López1,3,4, Christopher J L Murray4,1, Mohsen Naghavi1,4,5
1Institute for Health Metrics and Evaluation, University of Washington, Seattle, USA
2Department of Environmental and Occupational Health Sciences, University of Washington, Seattle, USA
3Melbourne School of Population and Global Health, University of Melbourne, Melbourne, Australia
4Department of Health Metrics Sciences, School of Medicine, University of Washington, Seattle, USA
5Department of Health Metrics Sciences, Director of Subnational Burden of Disease Estimation, Institute for Health Metrics and Evaluation School of Medicine, University of Washington, Seattle, USA

Tóm tắt

Abstract Background Accurate, comprehensive, cause-specific mortality estimates are crucial for informing public health decision making worldwide. Incorrectly or vaguely assigned deaths, defined as garbage-coded deaths, mask the true cause distribution. The Global Burden of Disease (GBD) study has developed methods to create comparable, timely, cause-specific mortality estimates; an impactful data processing method is the reallocation of garbage-coded deaths to a plausible underlying cause of death. We identify the pattern of garbage-coded deaths in the world and present the methods used to determine their redistribution to generate more plausible cause of death assignments. Methods We describe the methods developed for the GBD 2019 study and subsequent iterations to redistribute garbage-coded deaths in vital registration data to plausible underlying causes. These methods include analysis of multiple cause data, negative correlation, impairment, and proportional redistribution. We classify garbage codes into classes according to the level of specificity of the reported cause of death (CoD) and capture trends in the global pattern of proportion of garbage-coded deaths, disaggregated by these classes, and the relationship between this proportion and the Socio-Demographic Index. We examine the relative importance of the top four garbage codes by age and sex and demonstrate the impact of redistribution on the annual GBD CoD rankings. Results The proportion of least-specific (class 1 and 2) garbage-coded deaths ranged from 3.7% of all vital registration deaths to 67.3% in 2015, and the age-standardized proportion had an overall negative association with the Socio-Demographic Index. When broken down by age and sex, the category for unspecified lower respiratory infections was responsible for nearly 30% of garbage-coded deaths in those under 1 year of age for both sexes, representing the largest proportion of garbage codes for that age group. We show how the cause distribution by number of deaths changes before and after redistribution for four countries: Brazil, the United States, Japan, and France, highlighting the necessity of accounting for garbage-coded deaths in the GBD. Conclusions We provide a detailed description of redistribution methods developed for CoD data in the GBD; these methods represent an overall improvement in empiricism compared to past reliance on a priori knowledge.

Từ khóa


Tài liệu tham khảo

Alter GC, Carmichael AG. Classifying the dead: toward a history of the registration of causes of death. J Hist Med Allied Sci. 1999;54(2):114–32.

GBD 2019 Diseases, Injuries, and Impairments Collaborators. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. The Lancet. in press

Principles and Recommendations for a Vital Statistics System Revision 2 [Internet]. United Nations; 2001 [cited 2020 May 29]. https://unstats.un.org/unsd/publication/SeriesM/SeriesM_19rev2E.pdf

Sibai AM. Mortality certification and cause-of-death reporting in developing countries. Bull World Health Organ. 2004;82(2):83.

Jha P. Reliable direct measurement of causes of death in low- and middle-income countries. BMC Med. 2014;12(1):19.

AbouZahr C, Boerma T. Health information systems: the foundations of public health. Bull World Health Organ. 2005;83(8):578–83.

Ruzicka LT, Lopez AD. The use of cause-of-death statistics for health situation assessment: national and international experiences. World Health Stat Q Rapp Trimest Stat Sanit Mond. 1990;43(4):249–58.

World Health Organization, editor. International statistical classification of diseases and related health problems. 10th revision, 2nd edition. Geneva: World Health Organization; 2004

Barber JB. Improving accuracy of death certificates. J Natl Med Assoc. 1992;84(12):1007–8.

Campos-Outcalt D. Cause-of-death certification: not as easy as it seems. J Fam Pract. 2005;54(2):134–9.

Lakkireddy DR, Basarakodu KR, Vacek JL, Kondur AK, Ramachandruni SK, Esterbrooks DJ, et al. Improving death certificate completion: a trial of two training interventions. J Gen Intern Med. 2007;22(4):544–8.

Naghavi M, Makela S, Foreman K, O’Brien J, Pourmalek F, Lozano R. Algorithms for enhancing public health utility of national causes-of-death data. Popul Health Metr. 2010;8(1):9.

Mathers CD, Fat DM, Inoue M, Rao C, Lopez AD. Counting the dead and what they died from: an assessment of the global status of cause of death data. Bull World Health Organ. 2005;83:171–7.

Rudd K, Johnson S, Agesa K, Shackelford K, Tsoi D, Kievlan D, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017. The Lancet. 2020;395(10219):200–11.

Hernández B, Ramírez-Villalobos D, Romero M, Gómez S, Atkinson C, Lozano R. Assessing quality of medical death certification: Concordance between gold standard diagnosis and underlying cause of death in selected Mexican hospitals. Popul Health Metr. 2011;9(1):38.

Rao C, Lopez AD, Yang G, Begg S, Ma J. Evaluating national cause-of-death statistics: principles and application to the case of China. Bull World Health Organ. 2005;83(8):618–25.

Lu TH, Lee MC, Chou MC. Accuracy of cause-of-death coding in Taiwan: types of miscoding and effects on mortality statistics. Int J Epidemiol. 2000;29(2):336–43.

de Lima RB, Frederes A, Marinho MF, da Cunha CC, Adair T, França EB. Investigation of garbage code deaths to improve the quality of cause-of-death in Brazil: results from a pilot study. Rev Bras Epidemiol. 2019;22:e19004.supl.3.

Ellingsen CL, Ebbing M, Alfsen GC, Vollset SE. Injury death certificates without specification of the circumstances leading to the fatal injury—the Norwegian Cause of Death Registry 2005–2014. Popul Health Metr. 2018;16(1):20.

Metcalf P, Meyer M, Suchindran C, Heiss G. Assessment of a regression method to reclassify deaths attributable to heart failure. Glob J Health Sci. 2016;9(3):p13.

Danilova I, Shkolnikov VM, Jdanov DA, Meslé F, Vallin J. Identifying potential differences in cause-of-death coding practices across Russian regions. Popul Health Metr. 2016;14(1):8.

Qaddumi JAS, Nazzal Z, Yacoub A, Mansour M. Physicians’ knowledge and practice on death certification in the North West Bank, Palestine: across sectional study. BMC Health Serv Res. 2018;18:8.

Madadin M, Alhumam AS, Bushulaybi NA, Alotaibi AR, Aldakhil HA, Alghamdi AY, et al. Common errors in writing the cause of death certificate in the Middle East. J Forensic Leg Med. 2019;68:101864.

Teixeira RA, Naghavi M, Guimarães MDC, Ishitani LH, França EB, Teixeira RA, et al. Quality of cause-of-death data in Brazil: garbage codes among registered deaths in 2000 and 2015. Rev Bras Epidemiol. 2019;22Suppl:e19002.supl.3.

GBD 2019 Risk Factors Collaborators. The unfulfilled promise of prevention: the global burden of 87 risk factors, 1990–2019; a systematic analysis for the Global Burden of Disease Study 2019. Lancet Press.

GBD 2019 Demographics Collaborators. Global, regional, and national age-sex-specific fertility, mortality, and population estimates, 1950–2019: a comprehensive demographic analysis for the Global Burden of Disease Study 2019. Lancet. 2020;in press.

Phillips DE, Lozano R, Naghavi M, Atkinson C, Gonzalez-Medina D, Mikkelsen L, et al. A composite metric for assessing data on mortality and causes of death: the vital statistics performance index. Popul Health Metr. 2014;12:14.

Stevens GA, Alkema L, Black RE, Boerma JT, Collins GS, Ezzati M, et al. Guidelines for accurate and transparent health estimates reporting: the GATHER statement. PLOS Med. 2016;13(6):e1002056.

R Core Team. R: A language and environment for statistical computing. [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2019. http://www.R-project.org/

Python Software Foundation. Python Language Reference, version 3.0. [Internet]. Python.org. [cited 2021 Mar 3]. https://www.python.org/

WHO|Exposing misclassified HIV/AIDS deaths in South Africa [Internet]. WHO. World Health Organization; 2020 [cited 2020 May 29]. http://www.who.int/bulletin/volumes/89/4/11-086280/en/

Groenewald P, Nannan N, Bourne D, Laubscher R, Bradshaw D. Identifying deaths from AIDS in South Africa. AIDS. 2005;19(2):193–201.

GBD 2017 Causes of Death Collaborators. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the Global Burden of Disease Study 2017—The Lancet. Lancet. 2018;392(10159):1736–88.

Naghavi M, Abajobir AA, Abbafati C, Abbas KM, Abd-Allah F, Abera SF, et al. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2017;390(10100):1151–210.

Naghavi M, Richards N, Chowdhury H, Eynstone-Hinkins J, Franca E, Hegnauer M, et al. Improving the quality of cause of death data for public health policy: are all ‘garbage’ codes equally problematic? BMC Med. 2020;18(1):55.

Global, regional, and national age–sex specific all-cause and cause-specific mortality for 240 causes of death, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. The Lancet. 2015;385(9963):117–71.

Kircher T, Anderson RE. Cause of death. Proper completion of the death certificate. JAMA. 1987;258(3):349–52.

Puffer RR. New approaches for epidemiologic studies of mortality statistics. Bull Pan Am Health Organ. 1989;23(4):365–83.

Foreman KJ, Naghavi M, Ezzati M. Improving the usefulness of US mortality data: new methods for reclassification of underlying cause of death. Popul Health Metr. 2016;14(1):14.

Snyder ML, Love S-A, Sorlie PD, Rosamond WD, Antini C, Metcalf PA, et al. Redistribution of heart failure as the cause of death: the Atherosclerosis Risk in Communities Study. Popul Health Metr. 2014;12(1):10.

Stevens GA, King G, Shibuya K. Deaths from heart failure: using coarsened exact matching to correct cause-of-death statistics. Popul Health Metr. 2010;8(1):6.

Murray CJL, Dias RH, Kulkarni SC, Lozano R, Stevens GA, Ezzati M. Improving the comparability of diabetes mortality statistics in the U.S. and Mexico. Diabetes Care. 2008;31(3):451–8.

Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B Methodol. 1996;58(1):267–88.

Friedman JH, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1–22.

Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48.

Fullman N, Yearwood J, Abay SM, Abbafati C, Abd-Allah F, Abdela J, et al. Measuring performance on the Healthcare Access and Quality Index for 195 countries and territories and selected subnational locations: a systematic analysis from the Global Burden of Disease Study 2016. The Lancet. 2018;391(10136):2236–71.

Injury Data and Resources - ICD Injury Matrices [Internet]. 2019 [cited 2020 Jun 18]. https://www.cdc.gov/nchs/injury/injury_matrices.htm

World Health Organization. Injury surveillance guidelines. 2001;(WHO/NMH/VIP/01.02). https://apps.who.int/iris/handle/10665/42451

ICD-10 Version:2019 [Internet]. 2020 [cited 2020 Jun 18]. https://icd.who.int/browse10/2019/en#/Y34

Ahern RM, Lozano R, Naghavi M, Foreman K, Gakidou E, Murray CJ. Improving the public health utility of global cardiovascular mortality data: the rise of ischemic heart disease. Popul Health Metr. 2011;9(1):8.

Suthar AB, Khalifa A, Yin S, Wenz K, Fat DM, Mills SL, et al. Evaluation of approaches to strengthen civil registration and vital statistics systems: A systematic review and synthesis of policies in 25 countries. PLOS Med. 2019;16(9):e1002929.

Hart JD, Sorchik R, Bo KS, Chowdhury HR, Gamage S, Joshi R, et al. Improving medical certification of cause of death: effective strategies and approaches based on experiences from the Data for Health Initiative. BMC Med. 2020;18(1):74.

WHO|Stroke: a global response is needed [Internet]. WHO. World Health Organization; 2020 [cited 2020 Jun 29]. http://www.who.int/bulletin/volumes/94/9/16-181636/en/

Mikkelsen L, Moesgaard K, Hegnauer M, Lopez AD. ANACONDA: a new tool to improve mortality and cause of death data. BMC Med. 2020;18(1):61.