DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection

Standard

DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection. / Hausmann, Fabian; Ergen, Can; Khatri, Robin; Marouf, Mohamed; Hänzelmann, Sonja; Gagliani, Nicola; Huber, Samuel; Machart, Pierre; Bonn, Stefan.

in: GENOME BIOL, Jahrgang 24, Nr. 1, 20.09.2023, S. 212.

Publikationen: SCORING: Beitrag in Fachzeitschrift/ZeitungSCORING: ZeitschriftenaufsatzForschungBegutachtung

Harvard

APA

Vancouver

Bibtex

@article{f558daa9d8c14208b69fe30565cc0d27,
title = "DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection",
abstract = "BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads to suboptimal clustering and cell type identification.RESULTS: Here, we present DISCERN, a novel deep generative network that precisely reconstructs missing single-cell gene expression using a reference dataset. DISCERN outperforms competing algorithms in expression inference resulting in greatly improved cell clustering, cell type and activity detection, and insights into the cellular regulation of disease. We show that DISCERN is robust against differences between batches and is able to keep biological differences between batches, which is a common problem for imputation and batch correction algorithms. We use DISCERN to detect two unseen COVID-19-associated T cell types, cytotoxic CD4+ and CD8+ Tc2 T helper cells, with a potential role in adverse disease outcome. We utilize T cell fraction information of patient blood to classify mild or severe COVID-19 with an AUROC of 80% that can serve as a biomarker of disease stage. DISCERN can be easily integrated into existing single-cell sequencing workflow.CONCLUSIONS: Thus, DISCERN is a flexible tool for reconstructing missing single-cell gene expression using a reference dataset and can easily be applied to a variety of data sets yielding novel insights, e.g., into disease mechanisms.",
keywords = "Humans, COVID-19/genetics, Algorithms, Cell Cycle, Cell Differentiation, Cluster Analysis",
author = "Fabian Hausmann and Can Ergen and Robin Khatri and Mohamed Marouf and Sonja H{\"a}nzelmann and Nicola Gagliani and Samuel Huber and Pierre Machart and Stefan Bonn",
note = "{\textcopyright} 2023. BioMed Central Ltd., part of Springer Nature.",
year = "2023",
month = sep,
day = "20",
doi = "10.1186/s13059-023-03049-x",
language = "English",
volume = "24",
pages = "212",
journal = "GENOME BIOL",
issn = "1474-760X",
number = "1",

}

RIS

TY - JOUR

T1 - DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection

AU - Hausmann, Fabian

AU - Ergen, Can

AU - Khatri, Robin

AU - Marouf, Mohamed

AU - Hänzelmann, Sonja

AU - Gagliani, Nicola

AU - Huber, Samuel

AU - Machart, Pierre

AU - Bonn, Stefan

N1 - © 2023. BioMed Central Ltd., part of Springer Nature.

PY - 2023/9/20

Y1 - 2023/9/20

N2 - BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads to suboptimal clustering and cell type identification.RESULTS: Here, we present DISCERN, a novel deep generative network that precisely reconstructs missing single-cell gene expression using a reference dataset. DISCERN outperforms competing algorithms in expression inference resulting in greatly improved cell clustering, cell type and activity detection, and insights into the cellular regulation of disease. We show that DISCERN is robust against differences between batches and is able to keep biological differences between batches, which is a common problem for imputation and batch correction algorithms. We use DISCERN to detect two unseen COVID-19-associated T cell types, cytotoxic CD4+ and CD8+ Tc2 T helper cells, with a potential role in adverse disease outcome. We utilize T cell fraction information of patient blood to classify mild or severe COVID-19 with an AUROC of 80% that can serve as a biomarker of disease stage. DISCERN can be easily integrated into existing single-cell sequencing workflow.CONCLUSIONS: Thus, DISCERN is a flexible tool for reconstructing missing single-cell gene expression using a reference dataset and can easily be applied to a variety of data sets yielding novel insights, e.g., into disease mechanisms.

AB - BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads to suboptimal clustering and cell type identification.RESULTS: Here, we present DISCERN, a novel deep generative network that precisely reconstructs missing single-cell gene expression using a reference dataset. DISCERN outperforms competing algorithms in expression inference resulting in greatly improved cell clustering, cell type and activity detection, and insights into the cellular regulation of disease. We show that DISCERN is robust against differences between batches and is able to keep biological differences between batches, which is a common problem for imputation and batch correction algorithms. We use DISCERN to detect two unseen COVID-19-associated T cell types, cytotoxic CD4+ and CD8+ Tc2 T helper cells, with a potential role in adverse disease outcome. We utilize T cell fraction information of patient blood to classify mild or severe COVID-19 with an AUROC of 80% that can serve as a biomarker of disease stage. DISCERN can be easily integrated into existing single-cell sequencing workflow.CONCLUSIONS: Thus, DISCERN is a flexible tool for reconstructing missing single-cell gene expression using a reference dataset and can easily be applied to a variety of data sets yielding novel insights, e.g., into disease mechanisms.

KW - Humans

KW - COVID-19/genetics

KW - Algorithms

KW - Cell Cycle

KW - Cell Differentiation

KW - Cluster Analysis

U2 - 10.1186/s13059-023-03049-x

DO - 10.1186/s13059-023-03049-x

M3 - SCORING: Journal article

C2 - 37730638

VL - 24

SP - 212

JO - GENOME BIOL

JF - GENOME BIOL

SN - 1474-760X

IS - 1

ER -