Development and validation of personalised risk prediction models for early detection and diagnosis of primary liver cancer among the English primary care population using the QResearch® database: research protocol and statistical analysis plan
Tóm tắt
The incidence and mortality of liver cancer have been increasing in the UK in recent years. However, liver cancer is still under-studied. The Early Detection of Hepatocellular Liver Cancer (DeLIVER-QResearch) project aims to address the research gap and generate new knowledge to improve early detection and diagnosis of primary liver cancer from general practice and at the population level. There are three research objectives: (1) to understand the current epidemiology of primary liver cancer in England, (2) to identify and quantify the symptoms and comorbidities associated with liver cancer, and (3) to develop and validate prediction models for early detection of liver cancer suitable for implementation in clinical settings. This population-based study uses the QResearch® database (version 46) and includes adult patients aged 25–84 years old and without a diagnosis of liver cancer at the cohort entry (study period: 1 January 2008–30 June 2021). The team conducted a literature review (with additional clinical input) to inform the inclusion of variables for data extraction from the QResearch database. A wide range of statistical techniques will be used for the three research objectives, including descriptive statistics, multiple imputation for missing data, conditional logistic regression to investigate the association between the clinical features (symptoms and comorbidities) and the outcome, fractional polynomial terms to explore the non-linear relationship between continuous variables and the outcome, and Cox/competing risk regression for the prediction model. We have a specific focus on the 1-year, 5-year, and 10-year absolute risks of developing liver cancer, as risks at different time points have different clinical implications. The internal–external cross-validation approach will be used, and the discrimination and calibration of the prediction model will be evaluated. The DeLIVER-QResearch project uses large-scale representative population-based data to address the most relevant research questions for early detection and diagnosis of primary liver cancer in England. This project has great potential to inform the national cancer strategic plan and yield substantial public and societal benefits.
Tài liệu tham khảo
Cancer Research UK. Liver cancer statistics. 2021. Available from: https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/liver-cancer.
Cancer Research UK. Liver cancer deaths climb by around 50% in the last decade. 2019 6 April 2021]; Available from: https://www.cancerresearchuk.org/about-us/cancer-news/press-release/2019-11-01-liver-cancer-deaths-climb-by-around-50-in-the-last-decade.
Burton A, et al. Primary liver cancer in the UK: Incidence, incidence-based mortality, and survival by subtype, sex, and nation. JHEP Rep. 2021;3(2):100232.
Richards MA. The size of the prize for earlier diagnosis of cancer in England. Br J Cancer. 2009;101(Suppl 2):S125–9.
Hiom SC. Diagnosing cancer earlier: reviewing the evidence for improving cancer survival. Br J Cancer. 2015;112(Suppl 1):S1-5.
World Health Organization. Promoting Cancer Early Diagnosis. 2022. Available from: https://www.who.int/activities/promoting-cancer-early-diagnosis.
Yang JD, Heimbach JK. New advances in the diagnosis and management of hepatocellular carcinoma. BMJ. 2020;371:m3544.
Hippisley-Cox J, Coupland C. Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study. BMJ Open. 2015;5(3):e007825.
Hippisley-Cox J, Coupland C. Symptoms and risk factors to identify men with suspected cancer in primary care: derivation and validation of an algorithm. Br J Gen Pract. 2013;63(606):e1-10.
Hippisley-Cox J, Coupland C. Symptoms and risk factors to identify women with suspected cancer in primary care: derivation and validation of an algorithm. Br J Gen Pract. 2013;63(606):e11-21.
NHS. NHS Cancer Services for Teenagers & Young Adults. 2015. Available from: https://www.england.nhs.uk/commissioning/wp-content/uploads/sites/12/2015/12/nhs-canc-serv-tya.pdf.
Davis GL, et al. Hepatocellular carcinoma: management of an increasingly common problem. Proc (Bayl Univ Med Cent). 2008;21(3):266–80.
Richardson DB. An incidence density sampling program for nested case-control analyses. Occup Environ Med. 2004;61(12):e59–e59.
Schafer J, Graham J. Missing data: our view of the state of the art. Psychol Methods. 2002;7:147–77.
Group, T.A.M. Academic Medicine: problems and solutions. BMJ. 1989;298:573–9.
Steyerberg EW, van Veen M. Imputation is beneficial for handling missing data in predictive models. J Epidemiol Community Health. 2007;60:979.
Moons KGM, et al. Using the outcome for imputation of missing predictor values was preferred. J Epidemiol Community Health. 2006;59:1092.
Moons KG, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1-73.
Rubin DB. Multiple imputation for non-response in surveys. New York: John Wiley; 1987.
von Elm E, et al. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. Lancet. 2007;370(9596):1453–7.
Riley RD, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020;368:m441.
Cancer Research UK. Liver cancer incidence. 2021. Available from: https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/liver-cancer/incidence.
Royston P, Altman DG. Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling. Appl Stat. 1994;43(3):429–67. https://doi.org/10.2307/2986270. https://www.jstor.org/stable/2986270?origin=crossref#metadata_info_tab_contents.
Royston P, Ambler G, Sauerbrei W. The use of fractional polynomials to model continuous risk variables in epidemiology. Int J Epidemiol. 1999;28(5):964–74.
Hippisley-Cox J, et al. Predicting risk of type 2 diabetes in England and Wales: prospective derivation and validation of QDScore. BMJ. 2009;338:b880-.
Hippisley-Cox J, Coupland C. Predicting risk of osteoporotic fracture in men and women in England and Wales: prospective derivation and validation of QFractureScores. BMJ. 2009;339:b4229.
Hippisley-Cox J, et al. Performance of the QRISK cardiovascular risk prediction algorithm in an independent UK sample of patients from general practice: a validation study. Heart. 2008;94:34–9.
Hippisley-Cox J, Coupland C, Brindle P. Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. BMJ. 2017;357:j2099.
Hippisley-Cox J, Coupland C. Development and validation of QDiabetes-2018 risk prediction algorithm to estimate future risk of type 2 diabetes: cohort study. BMJ. 2017;359:j5019.
Steyerberg EW, Harrell FE Jr. Prediction models need appropriate internal, internal-external, and external validation. J Clin Epidemiol. 2016;69:245–7.
Takada T, et al. Internal-external cross-validation helped to evaluate the generalizability of prediction models in large clustered datasets. J Clin Epidemiol. 2021;137:83–91.
Steyerberg EW. Validation in prediction research: the waste by data splitting. J Clin Epidemiol. 2018;103:131–3.
Hosmer D, Lemeshow S. Applied Logistic Regressopm. New York: John Wiley & Sons Inc.; 1989.
Ma X, et al. Risk prediction models for hepatocellular carcinoma in different populations. Chin J Cancer Res. 2016;28(2):150–60.
Sauerbrei W, et al. State of the art in selection of variables and functional forms in multivariable analysis-outstanding issues. Diagn Progn Res. 2020;4:3.
Hippisley-Cox J, Coupland C, Brindle P. The performance of seven QPrediction risk scores in an independent external sample of patients from general practice: a validation study. BMJ Open. 2014;4(8):e005809.
Royston P. Explained variation for survival models. Stata J. 2006;6:1–14.
Royston P, Sauerbrei W. A new measure of prognostic separation in survival data. Stat Med. 2004;23:723–48.
Brier GW. Verification of forecasts expressed in terms of probability. Mon Weather Rev. 1950;78:1–3.
Harrell F, Lee K, Mark D. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15:361–87.
Wolbers M, et al. Prognostic models with competing risks: methods and application to coronary risk prediction. Epidemiology. 2009;20(4):555–61.
Kamarudin AN, Cox T, Kolamunnage-Dona R. Time-dependent ROC curve analysis in medical research: current methods and applications. BMC Med Res Methodol. 2017;17(1):53.
Newson RB. Comparing the predictive powers of survival models using Harrell’s C or Somers’ D. Stata J. 2010;10(3):339–58.
Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26(6):565–74.
The National Institute for Health and Care Excellence (NICE). Suspected cancer: recognition and referral. NICE guideline [NG12]. 2020 23 Feb 2021]; Available from: https://www.nice.org.uk/guidance/ng12.
European Association for the Study of the Liver. Electronic address, e.e.e. and L. European Association for the Study of the, EASL Clinical Practice Guidelines: Management of hepatocellular carcinoma. J Hepatol. 2018;69(1):182–236.
Collins GS, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): The TRIPOD StatementThe TRIPOD Statement. Ann Intern Med. 2015;162(1):55–63.
Collins GS, Altman DG. An independent and external validation of QRISK2 cardiovascular disease risk score: a prospective open cohort study. BMJ. 2010;340:c2442.
Collins GS, Altman DG. External validation of QDSCORE((R)) for predicting the 10-year risk of developing Type 2 diabetes. Diabet Med. 2011;28(5):599–607.
Collins GS, Mallett S, Altman DG. Predicting risk of osteoporotic and hip fracture in the United Kingdom: prospective independent and external validation of QFractureScores. BMJ. 2011;342:d3651.
NHS England. NHS long term plan ambitions for cancer. 2021. Available from: https://www.england.nhs.uk/cancer/strategy/.