Machine Learning for Understanding and Predicting Injuries in Football
Tóm tắt
Attempts to better understand the relationship between training and competition load and injury in football are essential for helping to understand adaptation to training programmes, assessing fatigue and recovery, and minimising the risk of injury and illness. To this end, technological advancements have enabled the collection of multiple points of data for use in analysis and injury prediction. The full breadth of available data has, however, only recently begun to be explored using suitable statistical methods. Advances in automatic and interactive data analysis with the help of machine learning are now being used to better establish the intricacies of the player load and injury relationship. In this article, we examine this recent research, describing the analyses and algorithms used, reporting the key findings, and comparing model fit. To date, the vast array of variables used in analysis as proxy indicators of player load, alongside differences in approach to key aspects of data treatment—such as response to data imbalance, model fitting, and a lack of multi-season data—limit a systematic evaluation of findings and the drawing of a unified conclusion. If, however, the limitations of current studies can be addressed, machine learning has much to offer the field and could in future provide solutions to the training load and injury paradox through enhanced and systematic analysis of athlete data.
Tài liệu tham khảo
De Silva V, Caine M, Skinner J, et al. Player tracking data analytics as a tool for physical performance management in football: a case study from Chelsea football club academy. Sports. 2018;6(4):130.
Rein R, Memmert D. Big data and tactical analysis in elite football: future challenges and opportunities for sports science. SpringerPlus. 2016. https://doi.org/10.1186/s40064-016-3108-2.
Anderson C, Sally D. The numbers game: why everything you know about football is wrong. Choice Rev Online; 2014.
Bourdon PC, Cardinale M, Murray A, et al. Monitoring athlete training loads: consensus statement. Int J Sports Physiol Perform. 2017;12:161–70.
Claudino JG, de Capanema DO, de Souza TV, et al. Current approaches to the use of artificial intelligence for injury risk assessment and performance prediction in team sports: a systematic review. Sports Med Open. 2019. https://doi.org/10.1186/s40798-019-0202-3.
Van Eetvelde H, Mendonça LD, Ley C, et al. Machine learning methods in sport injury prediction and prevention: a systematic review. J Exp Orthop. 2021;8(1):1–15.
Rossi A, Pappalardo L, Cintia P. A narrative review for a machine learning application in sports: an example based on injury forecasting in soccer. Sport. 2022;10:5.
Kalkhoven JT, Watsford ML, Coutts AJ, et al. Training load and injury: causal pathways and future directions. Sport Med. 2021. https://doi.org/10.1007/s40279-020-01413-6.
Halson SL. Monitoring training load to understand fatigue in athletes. Sports Med Springer. 2014;44:139–47.
Gabbett TJ. The training-injury prevention paradox: should athletes be training smarter and harder? Br J Sports Med. 2016;50(5):273–80.
Windt J, Gabbett TJ. How do training and competition workloads relate to injury? The workload - Injury aetiology model. Br J Sports Med. 2017;51(5):428–35.
Drew MK, Cook J, Finch CF. Sports-related workload and injury risk: simply knowing the risks will not prevent injuries: narrative review. Br J Sports Med. 2016;50(21):1306–8.
Soligard T, Schwellnus M, Alonso JM, et al. How much is too much? (Part 1). International Olympic Committee consensus statement on load in sport and risk of injury. Br J Sports Med. 2016;50(17):1030–41.
Calvert TW, Banister EW, Savage MV, et al. A systems model of training for athletic performance. Aust J Sport Med. 1976;6(2):94–102.
Buist I, Bredeweg SW, Lemmink KAPM, et al. The gronorun study: is a graded training program for novice runners effective in preventing running related injuries? Design of a randomized controlled trial. BMC Musculoskelet Disord. 2007. https://doi.org/10.1186/1471-2474-8-24.
Hulin BT, Gabbett TJ, Blanch P, et al. Spikes in acute workload are associated with increased injury risk in elite cricket fast bowlers. Br J Sports Med. 2014;48(8):708–12.
Impellizzeri FM, Woodcock S, Coutts AJ, et al. What role do chronic workloads play in the acute to chronic workload ratio? Time to dismiss ACWR and Its underlying theory. Sport Med. 2020. https://doi.org/10.1007/s40279-020-01378-6.
Foster C. Monitoring training in athletes with reference to overtraining syndrome. Med Sci Sports Exerc. 1998;30(7):1164–8.
Anderson L, Triplett-McBride T, Foster C, et al. Impact of training patterns on incidence of illness and injury during a women’s collegiate basketball season. J Strength Cond Res. 2003;17(4):734–8.
Brink MS, Visscher C, Arends S, et al. Monitoring stress and recovery: new insights for the prevention of injuries and illnesses in elite youth football players. Br J Sports Med. 2010;44(11):809–15.
Buchheit M, Racinais S, Bilsborough JC, et al. Monitoring fitness, fatigue and running performance during a pre-season training camp in elite football players. J Sci Med Sport. 2013;16(6):550–5.
Racinais S, Buchheit M, Bilsborough J, et al. Physiological and performance responses to a training camp in the heat in professional australian football players. Int J Sports Physiol Perform. 2014;9(4):598–603.
Hastie T, Tibshirani R, Friedman J. Springer Series in Statistics. The elements of statistical learning data mining, inference, and prediction; 2009.
Ruddy JD, Cormack SJ, Whiteley R, et al. Modeling the risk of team sport injuries: a narrative review of different statistical approaches. Front Physiol. 2019;10:1–16.
Belle V, Papantonis I. Principles and practice of explainable machine learning. 2020; Available from: https://arxiv.org/abs/2009.11698.
Loyola-Gonzalez O. Black-box vs. white-Box: understanding their advantages and weaknesses from a practical point of view. IEEE Access. 2019;7:154096–113.
Krawczyk B. Learning from imbalanced data: open challenges and future directions. Prog Artif Intell. 2016;5(4):221–32.
Leevy JL, Khoshgoftaar TM, Bauder RA, et al. A survey on addressing high-class imbalance in big data. J Big Data. 2018;5(1):1–30.
Saito T, Rehmsmeier M. The precision-recall plot is more informative than the roc plot when evaluating binary classifiers on imbalanced datasets. PLoS One. 2015;10(3):e0118432.
Kamiri J, Mariga G. Research Methods in Machine Learning: A Content Analysis. Int J Comput Inf Technol. 2021; 30:10(2).
Gibert K, Sanchez-Marre M, Izquierdo SJ. A survey on pre-processing techniques: relevant issues in the context of environmental data mining. AI Commun. 2016;29(6):627–63.
Kotsiantis SB, Kanellopoulos D, Pintelas PE. Data preprocessing for supervised leaning. Int J Comput Inf Engg. 2007; 1(12).
Rossi A, Pappalardo L, Cintia P, et al. Effective injury forecasting in football with GPS training data and machine learning. PLoS One. 2018;13(7):1–15.
He H, Bai Y, Garcia EA, Li S. ADASYN: adaptive synthetic sampling approach for imbalanced learning. In: Proceedings of the International Joint Conference on Neural Networks. 2008; pp. 1322–1328.
Naglah A, Khalifa F, Mahmoud A, et al. Athlete-customized injury prediction using training load statistical records and machine learning. IEEE Int Symp Signal Process Inf Technol (ISSPIT). 2018;2018:459–64.
López-Valenciano A, Ayala F, Puerta Jos M, et al. A preventive model for muscle Injuries: a novel approach based on learning algorithms. Med Sci Sports Exerc. 2018;50(5):915–27.
Ayala F, López-Valenciano A, Gámez Martín JA, et al. A preventive model for hamstring injuries in professional football: learning algorithms. Int J Sports Med. 2019;40(5):344–53.
Chawla NV, Bowyer KW, Hall LO, et al. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–57.
Rommers N, Rössler R, Verhagen E, et al. A machine learning approach to assess injury risk in elite youth football players. Med Sci Sport Exerc. 2020;52(8):1745–51.
Lundberg S, Lee S-I. A unified approach to interpreting model predictions. Adv Neural Inf Process Syst. 2017; 4766–4775.
Oliver JL, Ayala F, De Ste Croix MBA, et al. Using machine learning to improve our understanding of injury risk and prediction in elite male youth football players. J Sci Med Sport. 2020;23(11):1044–8.
Vallance E, Sutton-Charani N, Imoussaten A, et al. Combining internal- and external-training-loads to predict non-contact injuries in football. Appl Sci. 2020;10(15):5261.
Venturelli M, Schena F, Zanolla L, et al. Injury risk factors in young football players detected by a multivariate survival model. J Sci Med Sport. 2011;14(4):293–8.
Kampakis S. Predictive modelling of football injuries. 2016; Available from: http://arxiv.org/abs/1609.07480.
Ribeiro MT, Singh S, Guestrin C. “Why should I trust you?” explaining the predictions of any classifier. arXiv.org. 2016; arXiv: 1602.04938.
Goldstein A, Kapelner A, Bleich J et al. Peeking inside the black box: visualizing statistical learning with plots of individual conditional expectation. arXiv.org. 2014; arXiv: arXiv:1309.6392
Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29(5):1189–232.