Assembling the human gut virome, finding the right tools for the job!

Published in Microbiology

Assembling the human gut virome, finding the right tools for the job!
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

BioMed Central
BioMed Central BioMed Central

Choice of assembly software has a critical impact on virome characterisation - Microbiome

Background The viral component of microbial communities plays a vital role in driving bacterial diversity, facilitating nutrient turnover and shaping community composition. Despite their importance, the vast majority of viral sequences are poorly annotated and share little or no homology to reference databases. As a result, investigation of the viral metagenome (virome) relies heavily on de novo assembly of short sequencing reads to recover compositional and functional information. Metagenomic assembly is particularly challenging for virome data, often resulting in fragmented assemblies and poor recovery of viral community members. Despite the essential role of assembly in virome analysis and difficulties posed by these data, current assembly comparisons have been limited to subsections of virome studies or bacterial datasets. Design This study presents the most comprehensive virome assembly comparison to date, featuring 16 metagenomic assembly approaches which have featured in human virome studies. Assemblers were assessed using four independent virome datasets, namely, simulated reads, two mock communities, viromes spiked with a known phage and human gut viromes. Results Assembly performance varied significantly across all test datasets, with SPAdes (meta) performing consistently well. Performance of MIRA and VICUNA varied, highlighting the importance of using a range of datasets when comparing assembly programs. It was also found that while some assemblers addressed the challenges of virome data better than others, all assemblers had limitations. Low read coverage and genomic repeats resulted in assemblies with poor genome recovery, high degrees of fragmentation and low-accuracy contigs across all assemblers. These limitations must be considered when setting thresholds for downstream analysis and when drawing conclusions from virome data.

The human gut virome consists primarily of bacteriophage which may both play a crucial role in regulating and shaping microbial communities of the gut and facilitate horizontal gene transfer and microbial evolution. With 90% of viral sequencing reads sharing little to no homology to reference databases, the make-up of these  viral communities also represents one of the biggest gaps in our understanding of the human microbiome. As the hosts of the majority of these viruses are also unknown, the virome research community relies heavily on sequencing and computational approaches.

When we developed our virome analysis pipeline we realised that if we were to make sense of this unknown majority we would need to move towards In silico methods and away from database dependent approaches. Our first port of call was a crucial stage in all reference-independent pipelines, the assembly step, at which short sequence reads are used to recreate the genome sequences of community members. Looking at previous studies we realised that there was no single assembly method used across all virome studies, nor had there been an extensive assembly comparison dedicated to the virome, which led us to this study.

Metagenomic assembly, or reconstructing the genome sequences of community members, is a common but challenging computational task due to the complexity of microbial communities and large amounts of sequencing data required to represent them in a meaningful way. Unfortunately for virome scientists, assembly challenges of viromes are more difficult – perhaps even by orders of magnitude. The ability of the assembler to overcome these challenges is of significant importance to a virome analysis pipeline, which is essentially built around this crucial step.

By testing 16 assembly approaches on a combination of 4 different virome datasets including both synthetic and human viromes, we observed significant variation in assemblers’ ability to overcome assembly challenges. Most assemblers failed to properly reconstruct phage genomes that we knew to be there, which was a worrying outcome. In most cases the assemblers resulted in only small proportions of the genomes being recovered and assemblies being short and fragmented. These findings have serious implications for virome analysis pipelines, as not only does the choice of assembly program used in a study directly impact which members of a viral community can be recovered, but certain viral genomes appear to challenge all current assembly approaches. We observed that extremes in abundance were responsible for aspects of poor assembly, as were the proportion of genomic repeat regions in each community member. However, these challenges did not explain the full variation in poor genome recovery, highlighting a continued need to improve and develop virome analysis approaches as well was important considerations when setting downstream analysis parameters and making final conclusions.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Microbiology
Life Sciences > Biological Sciences > Microbiology
  • Microbiome Microbiome

    This journal hopes to integrate researchers with common scientific objectives across a broad cross-section of sub-disciplines within microbial ecology. It covers studies of microbiomes colonizing humans, animals, plants or the environment, both built and natural or manipulated, as in agriculture.

Related Collections

With collections, you can get published faster and increase your visibility.

JPL's Biotechnology and Planetary Protection Group: Special Collection

In this special series in Microbiome and Environmental Microbiome, we highlight articles that explore the microbiome of aeronautics and space studies conducted by the Jet Propulsion Laboratory, Biotechnology and Planetary Protection Group. Characterizing aeronautics-associated microbiomes and microgravity compatible technologies for exploring the microbiomes of spacecraft, spacesuits and astronauts provides insights on the utilization of novel technologies and microbiota driving factors. The articles in this collection are also included in our Aeronautics and space microbiomes series.

Publishing Model: Open Access

Deadline: Ongoing