A data science approach for early-stage prediction of Patient's susceptibility to acute side effects of advanced radiotherapy
Authors
Aldraimli, M.Soria, D.
Grishchuck, D.
Ingram, Samuel
Lyon, R.
Mistry, A.
Oliveira, J.
Samuel, R.
Shelley, L. E. A.
Osman, S.
Dwek, M. V.
Azria, D.
Chang-Claude, J.
Gutiérrez-Enríquez, S.
De Santis, M. C.
Rosenstein, B. S.
De Ruysscher, D
Sperk, E.
Symonds, R. P.
Stobart, H.
Vega, A.
Veldeman, L.
Webb, A.
Talbot, C. J.
West, Catharine M L
Rattay, T
Chaussalet, T. J.
Affiliation
The Health Innovation Ecosystem, University of Westminster, London, UKIssue Date
2021
Metadata
Show full item recordAbstract
The prediction by classification of side effects incidence in a given medical treatment is a common challenge in medical research. Machine Learning (ML) methods are widely used in the areas of risk prediction and classification. The primary objective of such algorithms is to use several features to predict dichotomous responses (e.g., disease positive/negative). Similar to statistical inference modelling, ML modelling is subject to the class imbalance problem and is affected by the majority class, increasing the false-negative rate. In this study, seventy-nine ML models were built and evaluated to classify approximately 2000 participants from 26 hospitals in eight different countries into two groups of radiotherapy (RT) side effects incidence based on recorded observations from the international study of RT related toxicity "REQUITE". We also examined the effect of sampling techniques and cost-sensitive learning methods on the models when dealing with class imbalance. The combinations of such techniques used had a significant impact on the classification. They resulted in an improvement in incidence status prediction by shifting classifiers' attention to the minority group. The best classification model for RT acute toxicity prediction was identified based on domain experts' success criteria. The Area Under Receiver Operator Characteristic curve of the models tested with an isolated dataset ranged from 0.50 to 0.77. The scale of improved results is promising and will guide further development of models to predict RT acute toxicities. One model was optimised and found to be beneficial to identify patients who are at risk of developing acute RT early-stage toxicities as a result of undergoing breast RT ensuring relevant treatment interventions can be appropriately targeted. The design of the approach presented in this paper resulted in producing a preclinical-valid prediction model. The study was developed by a multi-disciplinary collaboration of data scientists, medical physicists, oncologists and surgeons in the UK Radiotherapy Machine Learning Network.Citation
Aldraimli M, Soria D, Grishchuck D, Ingram S, Lyon R, Mistry A, et al. A data science approach for early-stage prediction of Patient’s susceptibility to acute side effects of advanced radiotherapy. Computers in Biology and Medicine. 2021 Aug;135:104624.Journal
Computers in Biology and MedicineDOI
10.1016/j.compbiomed.2021.104624PubMed ID
34247131Additional Links
https://dx.doi.org/10.1016/j.compbiomed.2021.104624Type
ArticleLanguage
enae974a485f413a2113503eed53cd6c53
10.1016/j.compbiomed.2021.104624
Scopus Count
Collections
Related articles
- Development and Optimization of a Machine-Learning Prediction Model for Acute Desquamation After Breast Radiation Therapy in the Multicenter REQUITE Cohort.
- Authors: Aldraimli M, Osman S, Grishchuck D, Ingram S, Lyon R, Mistry A, Oliveira J, Samuel R, Shelley LEA, Soria D, Dwek MV, Aguado-Barrera ME, Azria D, Chang-Claude J, Dunning A, Giraldo A, Green S, Gutiérrez-Enríquez S, Herskind C, van Hulle H, Lambrecht M, Lozza L, Rancati T, Reyes V, Rosenstein BS, de Ruysscher D, de Santis MC, Seibold P, Sperk E, Symonds RP, Stobart H, Taboada-Valadares B, Talbot CJ, Vakaet VJL, Vega A, Veldeman L, Veldwijk MR, Webb A, Weltens C, West CM, Chaussalet TJ, Rattay T, REQUITE consortium
- Issue date: 2022 May-Jun
- External Validation of a Predictive Model for Acute Skin Radiation Toxicity in the REQUITE Breast Cohort.
- Authors: Rattay T, Seibold P, Aguado-Barrera ME, Altabas M, Azria D, Barnett GC, Bultijnck R, Chang-Claude J, Choudhury A, Coles CE, Dunning AM, Elliott RM, Farcy Jacquet MP, Gutiérrez-Enríquez S, Johnson K, Müller A, Post G, Rancati T, Reyes V, Rosenstein BS, De Ruysscher D, de Santis MC, Sperk E, Stobart H, Symonds RP, Taboada-Valladares B, Vega A, Veldeman L, Webb AJ, West CM, Valdagni R, Talbot CJ, REQUITE consortium
- Issue date: 2020
- Consultation length and no-show prediction for improving appointment scheduling efficiency at a cardiology clinic: A data analytics approach.
- Authors: Srinivas S, Salah H
- Issue date: 2021 Jan
- Early prediction of radiotherapy-induced parotid shrinkage and toxicity based on CT radiomics and fuzzy classification.
- Authors: Pota M, Scalco E, Sanguineti G, Farneti A, Cattaneo GM, Rizzo G, Esposito M
- Issue date: 2017 Sep
- Machine learning algorithms, bull genetic information, and imbalanced datasets used in abortion incidence prediction models for Iranian Holstein dairy cattle.
- Authors: Keshavarzi H, Sadeghi-Sefidmazgi A, Mirzaei A, Ravanifard R
- Issue date: 2020 Feb