Random forest-based modelling to detect biomarkers for prostate cancer progression

Standard

Random forest-based modelling to detect biomarkers for prostate cancer progression. / Toth, Reka; Schiffmann, Heiko; Hube-Magg, Claudia; Büscheck, Franziska; Höflmayer, Doris; Weidemann, Sören; Lebok, Patrick; Fraune, Christoph; Minner, Sarah; Schlomm, Thorsten; Sauter, Guido; Plass, Christoph; Assenov, Yassen; Simon, Ronald; Meiners, Jan; Gerhäuser, Clarissa.

In: CLIN EPIGENETICS, Vol. 11, No. 1, 22.10.2019, p. 148.

Research output: SCORING: Contribution to journalSCORING: Journal articleResearchpeer-review

Harvard

APA

Vancouver

Bibtex

@article{315b1762d2df49aaa21b6dd4c872d330,
title = "Random forest-based modelling to detect biomarkers for prostate cancer progression",
abstract = "BACKGROUND: The clinical course of prostate cancer (PCa) is highly variable, demanding an individualized approach to therapy. Overtreatment of indolent PCa cases, which likely do not progress to aggressive stages, may be associated with severe side effects and considerable costs. These could be avoided by utilizing robust prognostic markers to guide treatment decisions.RESULTS: We present a random forest-based classification model to predict aggressive behaviour of prostate cancer. DNA methylation changes between PCa cases with good or poor prognosis (discovery cohort with n = 70) were used as input. DNA was extracted from formalin-fixed tumour tissue, and genome-wide DNA methylation differences between both groups were assessed using Illumina HumanMethylation450 arrays. For the random forest-based modelling, the discovery cohort was randomly split into a training (80%) and a test set (20%). Our methylation-based classifier demonstrated excellent performance in discriminating prognosis subgroups in the test set (Kaplan-Meier survival analyses with log-rank p value < 0.0001). The area under the receiver operating characteristic curve (AUC) for the sensitivity analysis was 95%. Using the ICGC cohort of early- and late-onset prostate cancer (n = 222) and the TCGA PRAD cohort (n = 477) for external validation, AUCs for sensitivity analyses were 77.1% and 68.7%, respectively. Cancer progression-related DNA hypomethylation was frequently located in 'partially methylated domains' (PMDs)-large-scale genomic areas with progressive loss of DNA methylation linked to mitotic cell division. We selected several candidate genes with differential methylation in gene promoter regions for additional validation at the protein expression level by immunohistochemistry in > 12,000 tissue micro-arrayed PCa cases. Loss of ZIC2 protein expression was associated with poor prognosis and correlated with significantly shorter time to biochemical recurrence. The prognostic value of ZIC2 proved to be independent from established clinicopathological variables including Gleason grade, tumour stage, nodal stage and prostate-specific-antigen.CONCLUSIONS: Our results highlight the prognostic relevance of methylation loss in PMD regions, as well as of several candidate genes not previously associated with PCa progression. Our robust and externally validated PCa classification model either directly or via protein expression analyses of the identified top-ranked candidate genes will support the clinical management of prostate cancer.",
author = "Reka Toth and Heiko Schiffmann and Claudia Hube-Magg and Franziska B{\"u}scheck and Doris H{\"o}flmayer and S{\"o}ren Weidemann and Patrick Lebok and Christoph Fraune and Sarah Minner and Thorsten Schlomm and Guido Sauter and Christoph Plass and Yassen Assenov and Ronald Simon and Jan Meiners and Clarissa Gerh{\"a}user",
year = "2019",
month = oct,
day = "22",
doi = "10.1186/s13148-019-0736-8",
language = "English",
volume = "11",
pages = "148",
journal = "CLIN EPIGENETICS",
issn = "1868-7075",
publisher = "Springer",
number = "1",

}

RIS

TY - JOUR

T1 - Random forest-based modelling to detect biomarkers for prostate cancer progression

AU - Toth, Reka

AU - Schiffmann, Heiko

AU - Hube-Magg, Claudia

AU - Büscheck, Franziska

AU - Höflmayer, Doris

AU - Weidemann, Sören

AU - Lebok, Patrick

AU - Fraune, Christoph

AU - Minner, Sarah

AU - Schlomm, Thorsten

AU - Sauter, Guido

AU - Plass, Christoph

AU - Assenov, Yassen

AU - Simon, Ronald

AU - Meiners, Jan

AU - Gerhäuser, Clarissa

PY - 2019/10/22

Y1 - 2019/10/22

N2 - BACKGROUND: The clinical course of prostate cancer (PCa) is highly variable, demanding an individualized approach to therapy. Overtreatment of indolent PCa cases, which likely do not progress to aggressive stages, may be associated with severe side effects and considerable costs. These could be avoided by utilizing robust prognostic markers to guide treatment decisions.RESULTS: We present a random forest-based classification model to predict aggressive behaviour of prostate cancer. DNA methylation changes between PCa cases with good or poor prognosis (discovery cohort with n = 70) were used as input. DNA was extracted from formalin-fixed tumour tissue, and genome-wide DNA methylation differences between both groups were assessed using Illumina HumanMethylation450 arrays. For the random forest-based modelling, the discovery cohort was randomly split into a training (80%) and a test set (20%). Our methylation-based classifier demonstrated excellent performance in discriminating prognosis subgroups in the test set (Kaplan-Meier survival analyses with log-rank p value < 0.0001). The area under the receiver operating characteristic curve (AUC) for the sensitivity analysis was 95%. Using the ICGC cohort of early- and late-onset prostate cancer (n = 222) and the TCGA PRAD cohort (n = 477) for external validation, AUCs for sensitivity analyses were 77.1% and 68.7%, respectively. Cancer progression-related DNA hypomethylation was frequently located in 'partially methylated domains' (PMDs)-large-scale genomic areas with progressive loss of DNA methylation linked to mitotic cell division. We selected several candidate genes with differential methylation in gene promoter regions for additional validation at the protein expression level by immunohistochemistry in > 12,000 tissue micro-arrayed PCa cases. Loss of ZIC2 protein expression was associated with poor prognosis and correlated with significantly shorter time to biochemical recurrence. The prognostic value of ZIC2 proved to be independent from established clinicopathological variables including Gleason grade, tumour stage, nodal stage and prostate-specific-antigen.CONCLUSIONS: Our results highlight the prognostic relevance of methylation loss in PMD regions, as well as of several candidate genes not previously associated with PCa progression. Our robust and externally validated PCa classification model either directly or via protein expression analyses of the identified top-ranked candidate genes will support the clinical management of prostate cancer.

AB - BACKGROUND: The clinical course of prostate cancer (PCa) is highly variable, demanding an individualized approach to therapy. Overtreatment of indolent PCa cases, which likely do not progress to aggressive stages, may be associated with severe side effects and considerable costs. These could be avoided by utilizing robust prognostic markers to guide treatment decisions.RESULTS: We present a random forest-based classification model to predict aggressive behaviour of prostate cancer. DNA methylation changes between PCa cases with good or poor prognosis (discovery cohort with n = 70) were used as input. DNA was extracted from formalin-fixed tumour tissue, and genome-wide DNA methylation differences between both groups were assessed using Illumina HumanMethylation450 arrays. For the random forest-based modelling, the discovery cohort was randomly split into a training (80%) and a test set (20%). Our methylation-based classifier demonstrated excellent performance in discriminating prognosis subgroups in the test set (Kaplan-Meier survival analyses with log-rank p value < 0.0001). The area under the receiver operating characteristic curve (AUC) for the sensitivity analysis was 95%. Using the ICGC cohort of early- and late-onset prostate cancer (n = 222) and the TCGA PRAD cohort (n = 477) for external validation, AUCs for sensitivity analyses were 77.1% and 68.7%, respectively. Cancer progression-related DNA hypomethylation was frequently located in 'partially methylated domains' (PMDs)-large-scale genomic areas with progressive loss of DNA methylation linked to mitotic cell division. We selected several candidate genes with differential methylation in gene promoter regions for additional validation at the protein expression level by immunohistochemistry in > 12,000 tissue micro-arrayed PCa cases. Loss of ZIC2 protein expression was associated with poor prognosis and correlated with significantly shorter time to biochemical recurrence. The prognostic value of ZIC2 proved to be independent from established clinicopathological variables including Gleason grade, tumour stage, nodal stage and prostate-specific-antigen.CONCLUSIONS: Our results highlight the prognostic relevance of methylation loss in PMD regions, as well as of several candidate genes not previously associated with PCa progression. Our robust and externally validated PCa classification model either directly or via protein expression analyses of the identified top-ranked candidate genes will support the clinical management of prostate cancer.

U2 - 10.1186/s13148-019-0736-8

DO - 10.1186/s13148-019-0736-8

M3 - SCORING: Journal article

C2 - 31640781

VL - 11

SP - 148

JO - CLIN EPIGENETICS

JF - CLIN EPIGENETICS

SN - 1868-7075

IS - 1

ER -