Prediction of Allogeneic Hematopoietic Stem-Cell Transplantation Mortality 100 Days After Transplantation Using a Machine Learning Algorithm: A European Group for Blood and Marrow Transplantation Acute Leukemia Working Party Retrospective Data Mining Study
Standard
Prediction of Allogeneic Hematopoietic Stem-Cell Transplantation Mortality 100 Days After Transplantation Using a Machine Learning Algorithm: A European Group for Blood and Marrow Transplantation Acute Leukemia Working Party Retrospective Data Mining Study. / Shouval, Roni; Labopin, Myriam; Bondi, Ori; Mishan-Shamay, Hila; Shimoni, Avichai; Ciceri, Fabio; Esteve, Jordi; Giebel, Sebastian; Gorin, Norbert C; Schmid, Christoph; Polge, Emmanuelle; Aljurf, Mahmoud; Kroger, Nicolaus; Craddock, Charles; Bacigalupo, Andrea; Cornelissen, Jan J; Baron, Frederic; Unger, Ron; Nagler, Arnon; Mohty, Mohamad.
In: J CLIN ONCOL, Vol. 33, No. 28, 01.10.2015, p. 3144-51.Research output: SCORING: Contribution to journal › SCORING: Journal article › Research › peer-review
Harvard
APA
Vancouver
Bibtex
}
RIS
TY - JOUR
T1 - Prediction of Allogeneic Hematopoietic Stem-Cell Transplantation Mortality 100 Days After Transplantation Using a Machine Learning Algorithm: A European Group for Blood and Marrow Transplantation Acute Leukemia Working Party Retrospective Data Mining Study
AU - Shouval, Roni
AU - Labopin, Myriam
AU - Bondi, Ori
AU - Mishan-Shamay, Hila
AU - Shimoni, Avichai
AU - Ciceri, Fabio
AU - Esteve, Jordi
AU - Giebel, Sebastian
AU - Gorin, Norbert C
AU - Schmid, Christoph
AU - Polge, Emmanuelle
AU - Aljurf, Mahmoud
AU - Kroger, Nicolaus
AU - Craddock, Charles
AU - Bacigalupo, Andrea
AU - Cornelissen, Jan J
AU - Baron, Frederic
AU - Unger, Ron
AU - Nagler, Arnon
AU - Mohty, Mohamad
N1 - © 2015 by American Society of Clinical Oncology.
PY - 2015/10/1
Y1 - 2015/10/1
N2 - PURPOSE: Allogeneic hematopoietic stem-cell transplantation (HSCT) is potentially curative for acute leukemia (AL), but carries considerable risk. Machine learning algorithms, which are part of the data mining (DM) approach, may serve for transplantation-related mortality risk prediction.PATIENTS AND METHODS: This work is a retrospective DM study on a cohort of 28,236 adult HSCT recipients from the AL registry of the European Group for Blood and Marrow Transplantation. The primary objective was prediction of overall mortality (OM) at 100 days after HSCT. Secondary objectives were estimation of nonrelapse mortality, leukemia-free survival, and overall survival at 2 years. Donor, recipient, and procedural characteristics were analyzed. The alternating decision tree machine learning algorithm was applied for model development on 70% of the data set and validated on the remaining data.RESULTS: OM prevalence at day 100 was 13.9% (n=3,936). Of the 20 variables considered, 10 were selected by the model for OM prediction, and several interactions were discovered. By using a logistic transformation function, the crude score was transformed into individual probabilities for 100-day OM (range, 3% to 68%). The model's discrimination for the primary objective performed better than the European Group for Blood and Marrow Transplantation score (area under the receiver operating characteristics curve, 0.701 v 0.646; P<.001). Calibration was excellent. Scores assigned were also predictive of secondary objectives.CONCLUSION: The alternating decision tree model provides a robust tool for risk evaluation of patients with AL before HSCT, and is available online (http://bioinfo.lnx.biu.ac.il/∼bondi/web1.html). It is presented as a continuous probabilistic score for the prediction of day 100 OM, extending prediction to 2 years. The DM method has proved useful for clinical prediction in HSCT.
AB - PURPOSE: Allogeneic hematopoietic stem-cell transplantation (HSCT) is potentially curative for acute leukemia (AL), but carries considerable risk. Machine learning algorithms, which are part of the data mining (DM) approach, may serve for transplantation-related mortality risk prediction.PATIENTS AND METHODS: This work is a retrospective DM study on a cohort of 28,236 adult HSCT recipients from the AL registry of the European Group for Blood and Marrow Transplantation. The primary objective was prediction of overall mortality (OM) at 100 days after HSCT. Secondary objectives were estimation of nonrelapse mortality, leukemia-free survival, and overall survival at 2 years. Donor, recipient, and procedural characteristics were analyzed. The alternating decision tree machine learning algorithm was applied for model development on 70% of the data set and validated on the remaining data.RESULTS: OM prevalence at day 100 was 13.9% (n=3,936). Of the 20 variables considered, 10 were selected by the model for OM prediction, and several interactions were discovered. By using a logistic transformation function, the crude score was transformed into individual probabilities for 100-day OM (range, 3% to 68%). The model's discrimination for the primary objective performed better than the European Group for Blood and Marrow Transplantation score (area under the receiver operating characteristics curve, 0.701 v 0.646; P<.001). Calibration was excellent. Scores assigned were also predictive of secondary objectives.CONCLUSION: The alternating decision tree model provides a robust tool for risk evaluation of patients with AL before HSCT, and is available online (http://bioinfo.lnx.biu.ac.il/∼bondi/web1.html). It is presented as a continuous probabilistic score for the prediction of day 100 OM, extending prediction to 2 years. The DM method has proved useful for clinical prediction in HSCT.
KW - Adult
KW - Algorithms
KW - Area Under Curve
KW - Data Mining
KW - Decision Support Techniques
KW - Decision Trees
KW - Disease Progression
KW - Disease-Free Survival
KW - Europe
KW - Female
KW - Hematopoietic Stem Cell Transplantation
KW - Humans
KW - Kaplan-Meier Estimate
KW - Leukemia, Myeloid, Acute
KW - Logistic Models
KW - Machine Learning
KW - Male
KW - Middle Aged
KW - Postoperative Complications
KW - Precursor Cell Lymphoblastic Leukemia-Lymphoma
KW - Predictive Value of Tests
KW - ROC Curve
KW - Registries
KW - Reproducibility of Results
KW - Retrospective Studies
KW - Risk Assessment
KW - Risk Factors
KW - Time Factors
KW - Transplantation, Homologous
KW - Treatment Outcome
U2 - 10.1200/JCO.2014.59.1339
DO - 10.1200/JCO.2014.59.1339
M3 - SCORING: Journal article
C2 - 26240227
VL - 33
SP - 3144
EP - 3151
JO - J CLIN ONCOL
JF - J CLIN ONCOL
SN - 0732-183X
IS - 28
ER -