Document Type
Article
Original Publication Date
2015
Journal/Book/Conference Title
BMC Microbiology
Volume
15
Issue
66
DOI of Original Publication
10.1186/s12866-015-0351-6
Date of Submission
April 2016
Abstract
Background
Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threatening the validity of conclusions from metagenomic studies. We introduce an experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline. We used 80 bacterial mock communities comprised of prescribed proportions of cells from seven vaginally-relevant bacterial strains to assess the bias introduced in the sample processing pipeline. We created two additional sets of 80 mock communities by mixing prescribed quantities of DNA and PCR product to quantify the relative contribution to bias of (1) DNA extraction, (2) PCR amplification, and (3) sequencing and taxonomic classification for particular choices of protocols for each step. We developed models to predict the “true” composition of environmental samples based on the observed proportions, and applied them to a set of clinical vaginal samples from a single subject during four visits.
Results
We observed that using different DNA extraction kits can produce dramatically different results but bias is introduced regardless of the choice of kit. We observed error rates from bias of over 85% in some samples, while technical variation was very low at less than 5% for most bacteria. The effects of DNA extraction and PCR amplification for our protocols were much larger than those due to sequencing and classification. The processing steps affected different bacteria in different ways, resulting in amplified and suppressed observed proportions of a community. When predictive models were applied to clinical samples from a subject, the predicted microbiome profiles were better reflections of the physiology and diagnosis of the subject at the visits than the observed community compositions.
Conclusions
Bias in 16S studies due to DNA extraction and PCR amplification will continue to require attention despite further advances in sequencing technology. Analysis of mock communities can help assess bias and facilitate the interpretation of results from environmental samples.
Rights
© Brooks et al.; licensee BioMed Central. 2015
Is Part Of
VCU Study of Biological Complexity Publications
additional file 3.pdf (6 kB)
additional file 4.jpeg (73 kB)
additional file 5.jpeg (73 kB)
additional file 6.zip (1 kB)
additional file 7.pdf (196 kB)
additional file 8.txt (580 kB)
additional file 9.zip (1 kB)
additional file 10.csv (49 kB)
additional file 11.csv (49 kB)
additional file 12.txt (6 kB)
additional file 13.txt (6 kB)
additional file 14.txt (6 kB)
additional file 15.zip (18037 kB)
Comments
Originally published at: http://dx.doi.org/10.1186/s12866-015-0351-6