Time-to-Injury Forecasting in Elite Female Football: A DeepHit Survival Approach

๐Ÿ“… 2026-01-27
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Current approaches to predicting soccer injuries often rely on static preseason data and binary outcomes, limiting their capacity for dynamic, individualized risk assessment. This study addresses this gap by applying the DeepHit deep survival modelโ€”used here for the first timeโ€”to predict time-to-injury absence among elite female soccer players. Leveraging longitudinal monitoring data, the model provides time-varying risk estimates and incorporates SHAP values to enhance clinical interpretability. Employing a multilayer perceptron architecture, multiple missing-data imputation strategies, and temporal cross-validation, the approach achieves a C-index of 0.762 on the SoccerMon dataset, outperforming baseline methods such as Random Forest and XGBoost. The analysis also identifies key risk factors consistent with established clinical knowledge.

Technology Category

Application Category

๐Ÿ“ Abstract
Injury occurrence in football poses significant challenges for athletes and teams, carrying personal, competitive, and financial consequences. While machine learning has been applied to injury prediction before, existing approaches often rely on static pre-season data and binary outcomes, limiting their real-world utility. This study investigates the feasibility of using a DeepHit neural network to forecast time-to-injury from longitudinal athlete monitoring data, while providing interpretable predictions. The analysis utilised the publicly available SoccerMon dataset, containing two seasons of training, match, and wellness records from elite female footballers. Data was pre-processed through cleaning, feature engineering, and the application of three imputation strategies. Baseline models (Random Forest, XGBoost, Logistic Regression) were optimised via grid search for benchmarking, while the DeepHit model, implemented with a multilayer perceptron backbone, was evaluated using chronological and leave-one-player-out (LOPO) validation. DeepHit achieved a concordance index of 0.762, outperforming baseline models and delivering individualised, time-varying risk estimates. Shapley Additive Explanations (SHAP) identified clinically relevant predictors consistent with established risk factors, enhancing interpretability. Overall, this study provides a novel proof of concept: survival modelling with DeepHit shows strong potential to advance injury forecasting in football, offering accurate, explainable, and actionable insights for injury prevention across competitive levels.
Problem

Research questions and friction points this paper is trying to address.

injury prediction
time-to-injury
longitudinal data
survival modelling
elite female football
Innovation

Methods, ideas, or system contributions that make the work stand out.

DeepHit
time-to-injury forecasting
survival analysis
interpretable machine learning
longitudinal athlete monitoring
๐Ÿ”Ž Similar Papers
No similar papers found.
V
Victoria Catterall
Loughborough University, Epinal Way, Loughborough, UK
Cise Midoglu
Cise Midoglu
Forzasys AS
S
Stephen Lynch
Loughborough University, Epinal Way, Loughborough, UK