An inter‑laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities

Standard

An inter‑laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities. / O’Sullivan, Denise; Doyle, Ronan; Temisak, Sasithon; Redshaw, Nicholas; Whale, Alexandra; Logan, Grace; Huang, Jiabin; Fischer, Nicole; Amos, Gregory; Preston, Mark; Marchesi, Julian; Wagner, Josef; Parkhill, Julian; Motro, Yair; Denise, Hubert; Finn, Robert; Kay, Gemma; O'Grady, Justin ; Ransom-Jones, Emma ; Wu, Huihai ; Laing, Emma ; Benavente, Ernest ; Phelan, Jody ; Clark , Taane ; Moran-Gilad, Jacob ; Huggett, Jim .

in: SCI REP-UK, Jahrgang 11, Nr. 1, 19.05.2021, S. 10590.

Publikationen: SCORING: Beitrag in Fachzeitschrift/ZeitungSCORING: ZeitschriftenaufsatzForschungBegutachtung

Harvard

O’Sullivan, D, Doyle, R, Temisak, S, Redshaw, N, Whale, A, Logan, G, Huang, J, Fischer, N, Amos, G, Preston, M, Marchesi, J, Wagner, J, Parkhill, J, Motro, Y, Denise, H, Finn, R, Kay, G, O'Grady, J, Ransom-Jones, E, Wu, H, Laing, E, Benavente, E, Phelan, J, Clark , T, Moran-Gilad, J & Huggett, J 2021, 'An inter‑laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities', SCI REP-UK, Jg. 11, Nr. 1, S. 10590. https://doi.org/10.1038/s41598-021-89881-2

APA

O’Sullivan, D., Doyle, R., Temisak, S., Redshaw, N., Whale, A., Logan, G., Huang, J., Fischer, N., Amos, G., Preston, M., Marchesi, J., Wagner, J., Parkhill, J., Motro, Y., Denise, H., Finn, R., Kay, G., O'Grady, J., Ransom-Jones, E., ... Huggett, J. (2021). An inter‑laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities. SCI REP-UK, 11(1), 10590. https://doi.org/10.1038/s41598-021-89881-2

Vancouver

Bibtex

@article{302a9b6b270e4d01936fadadb58b7495,
title = "An inter‑laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities",
abstract = "Despite the advent of whole genome metagenomics, targeted approaches (such as 16S rRNA gene amplicon sequencing) continue to be valuable for determining the microbial composition of samples. Amplicon microbiome sequencing can be performed on clinical samples from a normally sterile site to determine the aetiology of an infection (usually single pathogen identification) or samples from more complex niches such as human mucosa or environmental samples where multiple microorganisms need to be identified. The methodologies are frequently applied to determine both presence of micro-organisms and their quantity or relative abundance. There are a number of technical steps required to perform microbial community profiling, many of which may have appreciable precision and bias that impacts final results. In order for these methods to be applied with the greatest accuracy, comparative studies across different laboratories are warranted. In this study we explored the impact of the bioinformatic approaches taken in different laboratories on microbiome assessment using 16S rRNA gene amplicon sequencing results. Data were generated from two mock microbial community samples which were amplified using primer sets spanning five different variable regions of 16S rRNA genes. The PCR-sequencing analysis included three technical repeats of the process to determine the repeatability of their methods. Thirteen laboratories participated in the study, and each analysed the same FASTQ files using their choice of pipeline. This study captured the methods used and the resulting sequence annotation and relative abundance output from bioinformatic analyses. Results were compared to digital PCR assessment of the absolute abundance of each target representing each organism in the mock microbial community samples and also to analyses of shotgun metagenome sequence data. This ring trial demonstrates that the choice of bioinformatic analysis pipeline alone can result in different estimations of the composition of the microbiome when using 16S rRNA gene amplicon sequencing data. The study observed differences in terms of both presence and abundance of organisms and provides a resource for ensuring reproducible pipeline development and application. The observed differences were especially prevalent when using custom databases and applying high stringency operational taxonomic unit (OTU) cut-off limits. In order to apply sequencing approaches with greater accuracy, the impact of different analytical steps needs to be clearly delineated and solutions devised to harmonise microbiome analysis results.",
author = "Denise O{\textquoteright}Sullivan and Ronan Doyle and Sasithon Temisak and Nicholas Redshaw and Alexandra Whale and Grace Logan and Jiabin Huang and Nicole Fischer and Gregory Amos and Mark Preston and Julian Marchesi and Josef Wagner and Julian Parkhill and Yair Motro and Hubert Denise and Robert Finn and Gemma Kay and Justin O'Grady and Emma Ransom-Jones and Huihai Wu and Emma Laing and Ernest Benavente and Jody Phelan and Taane Clark and Jacob Moran-Gilad and Jim Huggett",
year = "2021",
month = may,
day = "19",
doi = "10.1038/s41598-021-89881-2",
language = "English",
volume = "11",
pages = "10590",
journal = "SCI REP-UK",
issn = "2045-2322",
publisher = "NATURE PUBLISHING GROUP",
number = "1",

}

RIS

TY - JOUR

T1 - An inter‑laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities

AU - O’Sullivan, Denise

AU - Doyle, Ronan

AU - Temisak, Sasithon

AU - Redshaw, Nicholas

AU - Whale, Alexandra

AU - Logan, Grace

AU - Huang, Jiabin

AU - Fischer, Nicole

AU - Amos, Gregory

AU - Preston, Mark

AU - Marchesi, Julian

AU - Wagner, Josef

AU - Parkhill, Julian

AU - Motro, Yair

AU - Denise, Hubert

AU - Finn, Robert

AU - Kay, Gemma

AU - O'Grady, Justin

AU - Ransom-Jones, Emma

AU - Wu, Huihai

AU - Laing, Emma

AU - Benavente, Ernest

AU - Phelan, Jody

AU - Clark , Taane

AU - Moran-Gilad, Jacob

AU - Huggett, Jim

PY - 2021/5/19

Y1 - 2021/5/19

N2 - Despite the advent of whole genome metagenomics, targeted approaches (such as 16S rRNA gene amplicon sequencing) continue to be valuable for determining the microbial composition of samples. Amplicon microbiome sequencing can be performed on clinical samples from a normally sterile site to determine the aetiology of an infection (usually single pathogen identification) or samples from more complex niches such as human mucosa or environmental samples where multiple microorganisms need to be identified. The methodologies are frequently applied to determine both presence of micro-organisms and their quantity or relative abundance. There are a number of technical steps required to perform microbial community profiling, many of which may have appreciable precision and bias that impacts final results. In order for these methods to be applied with the greatest accuracy, comparative studies across different laboratories are warranted. In this study we explored the impact of the bioinformatic approaches taken in different laboratories on microbiome assessment using 16S rRNA gene amplicon sequencing results. Data were generated from two mock microbial community samples which were amplified using primer sets spanning five different variable regions of 16S rRNA genes. The PCR-sequencing analysis included three technical repeats of the process to determine the repeatability of their methods. Thirteen laboratories participated in the study, and each analysed the same FASTQ files using their choice of pipeline. This study captured the methods used and the resulting sequence annotation and relative abundance output from bioinformatic analyses. Results were compared to digital PCR assessment of the absolute abundance of each target representing each organism in the mock microbial community samples and also to analyses of shotgun metagenome sequence data. This ring trial demonstrates that the choice of bioinformatic analysis pipeline alone can result in different estimations of the composition of the microbiome when using 16S rRNA gene amplicon sequencing data. The study observed differences in terms of both presence and abundance of organisms and provides a resource for ensuring reproducible pipeline development and application. The observed differences were especially prevalent when using custom databases and applying high stringency operational taxonomic unit (OTU) cut-off limits. In order to apply sequencing approaches with greater accuracy, the impact of different analytical steps needs to be clearly delineated and solutions devised to harmonise microbiome analysis results.

AB - Despite the advent of whole genome metagenomics, targeted approaches (such as 16S rRNA gene amplicon sequencing) continue to be valuable for determining the microbial composition of samples. Amplicon microbiome sequencing can be performed on clinical samples from a normally sterile site to determine the aetiology of an infection (usually single pathogen identification) or samples from more complex niches such as human mucosa or environmental samples where multiple microorganisms need to be identified. The methodologies are frequently applied to determine both presence of micro-organisms and their quantity or relative abundance. There are a number of technical steps required to perform microbial community profiling, many of which may have appreciable precision and bias that impacts final results. In order for these methods to be applied with the greatest accuracy, comparative studies across different laboratories are warranted. In this study we explored the impact of the bioinformatic approaches taken in different laboratories on microbiome assessment using 16S rRNA gene amplicon sequencing results. Data were generated from two mock microbial community samples which were amplified using primer sets spanning five different variable regions of 16S rRNA genes. The PCR-sequencing analysis included three technical repeats of the process to determine the repeatability of their methods. Thirteen laboratories participated in the study, and each analysed the same FASTQ files using their choice of pipeline. This study captured the methods used and the resulting sequence annotation and relative abundance output from bioinformatic analyses. Results were compared to digital PCR assessment of the absolute abundance of each target representing each organism in the mock microbial community samples and also to analyses of shotgun metagenome sequence data. This ring trial demonstrates that the choice of bioinformatic analysis pipeline alone can result in different estimations of the composition of the microbiome when using 16S rRNA gene amplicon sequencing data. The study observed differences in terms of both presence and abundance of organisms and provides a resource for ensuring reproducible pipeline development and application. The observed differences were especially prevalent when using custom databases and applying high stringency operational taxonomic unit (OTU) cut-off limits. In order to apply sequencing approaches with greater accuracy, the impact of different analytical steps needs to be clearly delineated and solutions devised to harmonise microbiome analysis results.

U2 - 10.1038/s41598-021-89881-2

DO - 10.1038/s41598-021-89881-2

M3 - SCORING: Journal article

VL - 11

SP - 10590

JO - SCI REP-UK

JF - SCI REP-UK

SN - 2045-2322

IS - 1

ER -