Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer

Ali Sabbagh; Samuel L Washington; Derya Tilki; Julian C Hong; Jean Feng; Gilmer Valdes; Ming-Hui Chen; Jing Wu; Hartwig Huland; Markus Graefen; Thomas Wiegel; Dirk Böhmer; Janet E Cowan; Matthew Cooperberg; Felix Y Feng; Mack Roach; Bruce J Trock; Alan W Partin; Anthony V D'Amico; Peter R Carroll; Osama Mohamad

doi:10.1016/j.euo.2023.02.006

Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer

Standard

Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer. / Sabbagh, Ali; Washington, Samuel L; Tilki, Derya; Hong, Julian C; Feng, Jean; Valdes, Gilmer; Chen, Ming-Hui; Wu, Jing; Huland, Hartwig; Graefen, Markus; Wiegel, Thomas; Böhmer, Dirk; Cowan, Janet E; Cooperberg, Matthew; Feng, Felix Y; Roach, Mack; Trock, Bruce J; Partin, Alan W; D'Amico, Anthony V; Carroll, Peter R; Mohamad, Osama.

In: EUR UROL ONCOL, Vol. 6, No. 5, 10.2023, p. 501-507.

Research output: SCORING: Contribution to journal › SCORING: Journal article › Research › peer-review

Harvard

Sabbagh, A, Washington, SL, Tilki, D, Hong, JC, Feng, J, Valdes, G, Chen, M-H, Wu, J, Huland, H, Graefen, M, Wiegel, T, Böhmer, D, Cowan, JE, Cooperberg, M, Feng, FY, Roach, M, Trock, BJ, Partin, AW, D'Amico, AV, Carroll, PR & Mohamad, O 2023, 'Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer', EUR UROL ONCOL, vol. 6, no. 5, pp. 501-507. https://doi.org/10.1016/j.euo.2023.02.006

APA

Sabbagh, A., Washington, S. L., Tilki, D., Hong, J. C., Feng, J., Valdes, G., Chen, M-H., Wu, J., Huland, H., Graefen, M., Wiegel, T., Böhmer, D., Cowan, J. E., Cooperberg, M., Feng, F. Y., Roach, M., Trock, B. J., Partin, A. W., D'Amico, A. V., ... Mohamad, O. (2023). Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer. EUR UROL ONCOL, 6(5), 501-507. https://doi.org/10.1016/j.euo.2023.02.006

Vancouver

Sabbagh A, Washington SL, Tilki D, Hong JC, Feng J, Valdes G et al. Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer. EUR UROL ONCOL. 2023 Oct;6(5):501-507. https://doi.org/10.1016/j.euo.2023.02.006

Bibtex

@article{b9d6bff1c5e34c71b7b5a9dc7680711b,

title = "Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer",

abstract = "BACKGROUND: Pelvic lymph node dissection (PLND) is the gold standard for diagnosis of lymph node involvement (LNI) in patients with prostate cancer. The Roach formula, Memorial Sloan Kettering Cancer Center (MSKCC) calculator, and Briganti 2012 nomogram are elegant and simple traditional tools used to estimate the risk of LNI and select patients for PLND.OBJECTIVE: To determine whether machine learning (ML) can improve patient selection and outperform currently available tools for predicting LNI using similar readily available clinicopathologic variables.DESIGN, SETTING, AND PARTICIPANTS: Retrospective data for patients treated with surgery and PLND between 1990 and 2020 in two academic institutions were used.OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: We trained three models (two logistic regression models and one gradient-boosted trees-based model [XGBoost]) on data provided from one institution (n = 20267) with age, prostate-specific antigen (PSA) levels, clinical T stage, percentage positive cores, and Gleason scores as inputs. We externally validated these models using data from another institution (n = 1322) and compared their performance to that of the traditional models using the area under the receiver operating characteristic curve (AUC), calibration, and decision curve analysis (DCA).RESULTS AND LIMITATIONS: LNI was present in 2563 patients (11.9%) overall, and in 119 patients (9%) in the validation data set. XGBoost had the best performance among all the models. On external validation, its AUC outperformed that of the Roach formula by 0.08 (95% confidence interval [CI] 0.042-0.12), the MSKCC nomogram by 0.05 (95% CI 0.016-0.070), and the Briganti nomogram by 0.03 (95% CI 0.0092-0.051; all p < 0.05). It also had better calibration and clinical utility in terms of net benefit on DCA across relevant clinical thresholds. The main limitation of the study is its retrospective design.CONCLUSIONS: Taking all measures of performance together, ML using standard clinicopathologic variables outperforms traditional tools in predicting LNI.PATIENT SUMMARY: Determining the risk of cancer spread to the lymph nodes in patients with prostate cancer allows surgeons to perform lymph node dissection only in patients who need it and avoid the side effects of the procedure in those who do not. In this study, we used machine learning to develop a new calculator to predict the risk of lymph node involvement that outperformed traditional tools currently used by oncologists.",

author = "Ali Sabbagh and Washington, {Samuel L} and Derya Tilki and Hong, {Julian C} and Jean Feng and Gilmer Valdes and Ming-Hui Chen and Jing Wu and Hartwig Huland and Markus Graefen and Thomas Wiegel and Dirk B{\"o}hmer and Cowan, {Janet E} and Matthew Cooperberg and Feng, {Felix Y} and Mack Roach and Trock, {Bruce J} and Partin, {Alan W} and D'Amico, {Anthony V} and Carroll, {Peter R} and Osama Mohamad",

year = "2023",

month = oct,

doi = "10.1016/j.euo.2023.02.006",

language = "English",

volume = "6",

pages = "501--507",

journal = "EUR UROL ONCOL",

issn = "2588-9311",

publisher = "Elsevier",

number = "5",

}

RIS

TY - JOUR

T1 - Development and External Validation of a Machine Learning Model for Prediction of Lymph Node Metastasis in Patients with Prostate Cancer

AU - Sabbagh, Ali

AU - Washington, Samuel L

AU - Tilki, Derya

AU - Hong, Julian C

AU - Feng, Jean

AU - Valdes, Gilmer

AU - Chen, Ming-Hui

AU - Wu, Jing

AU - Huland, Hartwig

AU - Graefen, Markus

AU - Wiegel, Thomas

AU - Böhmer, Dirk

AU - Cowan, Janet E

AU - Cooperberg, Matthew

AU - Feng, Felix Y

AU - Roach, Mack

AU - Trock, Bruce J

AU - Partin, Alan W

AU - D'Amico, Anthony V

AU - Carroll, Peter R

AU - Mohamad, Osama

PY - 2023/10

Y1 - 2023/10

N2 - BACKGROUND: Pelvic lymph node dissection (PLND) is the gold standard for diagnosis of lymph node involvement (LNI) in patients with prostate cancer. The Roach formula, Memorial Sloan Kettering Cancer Center (MSKCC) calculator, and Briganti 2012 nomogram are elegant and simple traditional tools used to estimate the risk of LNI and select patients for PLND.OBJECTIVE: To determine whether machine learning (ML) can improve patient selection and outperform currently available tools for predicting LNI using similar readily available clinicopathologic variables.DESIGN, SETTING, AND PARTICIPANTS: Retrospective data for patients treated with surgery and PLND between 1990 and 2020 in two academic institutions were used.OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: We trained three models (two logistic regression models and one gradient-boosted trees-based model [XGBoost]) on data provided from one institution (n = 20267) with age, prostate-specific antigen (PSA) levels, clinical T stage, percentage positive cores, and Gleason scores as inputs. We externally validated these models using data from another institution (n = 1322) and compared their performance to that of the traditional models using the area under the receiver operating characteristic curve (AUC), calibration, and decision curve analysis (DCA).RESULTS AND LIMITATIONS: LNI was present in 2563 patients (11.9%) overall, and in 119 patients (9%) in the validation data set. XGBoost had the best performance among all the models. On external validation, its AUC outperformed that of the Roach formula by 0.08 (95% confidence interval [CI] 0.042-0.12), the MSKCC nomogram by 0.05 (95% CI 0.016-0.070), and the Briganti nomogram by 0.03 (95% CI 0.0092-0.051; all p < 0.05). It also had better calibration and clinical utility in terms of net benefit on DCA across relevant clinical thresholds. The main limitation of the study is its retrospective design.CONCLUSIONS: Taking all measures of performance together, ML using standard clinicopathologic variables outperforms traditional tools in predicting LNI.PATIENT SUMMARY: Determining the risk of cancer spread to the lymph nodes in patients with prostate cancer allows surgeons to perform lymph node dissection only in patients who need it and avoid the side effects of the procedure in those who do not. In this study, we used machine learning to develop a new calculator to predict the risk of lymph node involvement that outperformed traditional tools currently used by oncologists.

AB - BACKGROUND: Pelvic lymph node dissection (PLND) is the gold standard for diagnosis of lymph node involvement (LNI) in patients with prostate cancer. The Roach formula, Memorial Sloan Kettering Cancer Center (MSKCC) calculator, and Briganti 2012 nomogram are elegant and simple traditional tools used to estimate the risk of LNI and select patients for PLND.OBJECTIVE: To determine whether machine learning (ML) can improve patient selection and outperform currently available tools for predicting LNI using similar readily available clinicopathologic variables.DESIGN, SETTING, AND PARTICIPANTS: Retrospective data for patients treated with surgery and PLND between 1990 and 2020 in two academic institutions were used.OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: We trained three models (two logistic regression models and one gradient-boosted trees-based model [XGBoost]) on data provided from one institution (n = 20267) with age, prostate-specific antigen (PSA) levels, clinical T stage, percentage positive cores, and Gleason scores as inputs. We externally validated these models using data from another institution (n = 1322) and compared their performance to that of the traditional models using the area under the receiver operating characteristic curve (AUC), calibration, and decision curve analysis (DCA).RESULTS AND LIMITATIONS: LNI was present in 2563 patients (11.9%) overall, and in 119 patients (9%) in the validation data set. XGBoost had the best performance among all the models. On external validation, its AUC outperformed that of the Roach formula by 0.08 (95% confidence interval [CI] 0.042-0.12), the MSKCC nomogram by 0.05 (95% CI 0.016-0.070), and the Briganti nomogram by 0.03 (95% CI 0.0092-0.051; all p < 0.05). It also had better calibration and clinical utility in terms of net benefit on DCA across relevant clinical thresholds. The main limitation of the study is its retrospective design.CONCLUSIONS: Taking all measures of performance together, ML using standard clinicopathologic variables outperforms traditional tools in predicting LNI.PATIENT SUMMARY: Determining the risk of cancer spread to the lymph nodes in patients with prostate cancer allows surgeons to perform lymph node dissection only in patients who need it and avoid the side effects of the procedure in those who do not. In this study, we used machine learning to develop a new calculator to predict the risk of lymph node involvement that outperformed traditional tools currently used by oncologists.

U2 - 10.1016/j.euo.2023.02.006

DO - 10.1016/j.euo.2023.02.006

M3 - SCORING: Journal article

C2 - 36868922

VL - 6

SP - 501

EP - 507

JO - EUR UROL ONCOL

JF - EUR UROL ONCOL

SN - 2588-9311

IS - 5

ER -