Comparison of Cox Model Methods in A Low-dimensional Setting with Few Events

Standard

Comparison of Cox Model Methods in A Low-dimensional Setting with Few Events. / Ojeda, Francisco M; Müller, Christian; Börnigen, Daniela; Trégouët, David-Alexandre; Schillert, Arne; Heinig, Matthias; Zeller, Tanja; Schnabel, Renate B.

In: GENOM PROTEOM BIOINF, Vol. 14, No. 4, 08.2016, p. 235-343.

Research output: SCORING: Contribution to journalSCORING: Journal articleResearchpeer-review

Harvard

APA

Vancouver

Ojeda FM, Müller C, Börnigen D, Trégouët D-A, Schillert A, Heinig M et al. Comparison of Cox Model Methods in A Low-dimensional Setting with Few Events. GENOM PROTEOM BIOINF. 2016 Aug;14(4):235-343. https://doi.org/10.1016/j.gpb.2016.03.006

Bibtex

@article{91baecea00754b9091986e0e2cc67cad,
title = "Comparison of Cox Model Methods in A Low-dimensional Setting with Few Events",
abstract = "Prognostic models based on survival data frequently make use of the Cox proportional hazards model. Developing reliable Cox models with few events relative to the number of predictors can be challenging, even in low-dimensional datasets, with a much larger number of observations than variables. In such a setting we examined the performance of methods used to estimate a Cox model, including (i) full model using all available predictors and estimated by standard techniques, (ii) backward elimination (BE), (iii) ridge regression, (iv) least absolute shrinkage and selection operator (lasso), and (v) elastic net. Based on a prospective cohort of patients with manifest coronary artery disease (CAD), we performed a simulation study to compare the predictive accuracy, calibration, and discrimination of these approaches. Candidate predictors for incident cardiovascular events we used included clinical variables, biomarkers, and a selection of genetic variants associated with CAD. The penalized methods, i.e., ridge, lasso, and elastic net, showed a comparable performance, in terms of predictive accuracy, calibration, and discrimination, and outperformed BE and the full model. Excessive shrinkage was observed in some cases for the penalized methods, mostly on the simulation scenarios having the lowest ratio of a number of events to the number of variables. We conclude that in similar settings, these three penalized methods can be used interchangeably. The full model and backward elimination are not recommended in rare event scenarios.",
keywords = "Biomarkers, Coronary Artery Disease, Genetic Variation, Humans, Prognosis, Proportional Hazards Models, Prospective Studies, Journal Article, Research Support, Non-U.S. Gov't",
author = "Ojeda, {Francisco M} and Christian M{\"u}ller and Daniela B{\"o}rnigen and David-Alexandre Tr{\'e}gou{\"e}t and Arne Schillert and Matthias Heinig and Tanja Zeller and Schnabel, {Renate B}",
note = "Copyright {\textcopyright} 2016 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.",
year = "2016",
month = aug,
doi = "10.1016/j.gpb.2016.03.006",
language = "English",
volume = "14",
pages = "235--343",
journal = "GENOM PROTEOM BIOINF",
issn = "1672-0229",
publisher = "Beijing Genomics Institute",
number = "4",

}

RIS

TY - JOUR

T1 - Comparison of Cox Model Methods in A Low-dimensional Setting with Few Events

AU - Ojeda, Francisco M

AU - Müller, Christian

AU - Börnigen, Daniela

AU - Trégouët, David-Alexandre

AU - Schillert, Arne

AU - Heinig, Matthias

AU - Zeller, Tanja

AU - Schnabel, Renate B

N1 - Copyright © 2016 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

PY - 2016/8

Y1 - 2016/8

N2 - Prognostic models based on survival data frequently make use of the Cox proportional hazards model. Developing reliable Cox models with few events relative to the number of predictors can be challenging, even in low-dimensional datasets, with a much larger number of observations than variables. In such a setting we examined the performance of methods used to estimate a Cox model, including (i) full model using all available predictors and estimated by standard techniques, (ii) backward elimination (BE), (iii) ridge regression, (iv) least absolute shrinkage and selection operator (lasso), and (v) elastic net. Based on a prospective cohort of patients with manifest coronary artery disease (CAD), we performed a simulation study to compare the predictive accuracy, calibration, and discrimination of these approaches. Candidate predictors for incident cardiovascular events we used included clinical variables, biomarkers, and a selection of genetic variants associated with CAD. The penalized methods, i.e., ridge, lasso, and elastic net, showed a comparable performance, in terms of predictive accuracy, calibration, and discrimination, and outperformed BE and the full model. Excessive shrinkage was observed in some cases for the penalized methods, mostly on the simulation scenarios having the lowest ratio of a number of events to the number of variables. We conclude that in similar settings, these three penalized methods can be used interchangeably. The full model and backward elimination are not recommended in rare event scenarios.

AB - Prognostic models based on survival data frequently make use of the Cox proportional hazards model. Developing reliable Cox models with few events relative to the number of predictors can be challenging, even in low-dimensional datasets, with a much larger number of observations than variables. In such a setting we examined the performance of methods used to estimate a Cox model, including (i) full model using all available predictors and estimated by standard techniques, (ii) backward elimination (BE), (iii) ridge regression, (iv) least absolute shrinkage and selection operator (lasso), and (v) elastic net. Based on a prospective cohort of patients with manifest coronary artery disease (CAD), we performed a simulation study to compare the predictive accuracy, calibration, and discrimination of these approaches. Candidate predictors for incident cardiovascular events we used included clinical variables, biomarkers, and a selection of genetic variants associated with CAD. The penalized methods, i.e., ridge, lasso, and elastic net, showed a comparable performance, in terms of predictive accuracy, calibration, and discrimination, and outperformed BE and the full model. Excessive shrinkage was observed in some cases for the penalized methods, mostly on the simulation scenarios having the lowest ratio of a number of events to the number of variables. We conclude that in similar settings, these three penalized methods can be used interchangeably. The full model and backward elimination are not recommended in rare event scenarios.

KW - Biomarkers

KW - Coronary Artery Disease

KW - Genetic Variation

KW - Humans

KW - Prognosis

KW - Proportional Hazards Models

KW - Prospective Studies

KW - Journal Article

KW - Research Support, Non-U.S. Gov't

U2 - 10.1016/j.gpb.2016.03.006

DO - 10.1016/j.gpb.2016.03.006

M3 - SCORING: Journal article

C2 - 27224515

VL - 14

SP - 235

EP - 343

JO - GENOM PROTEOM BIOINF

JF - GENOM PROTEOM BIOINF

SN - 1672-0229

IS - 4

ER -