TY - JOUR
T1 - Viral dark matter and virus–host interactions resolved from publicly available microbial genomes
AU - Roux, Simon
AU - Hallam, Steven J.
AU - Woyke, Tanja
AU - Sullivan, Matthew B.
N1 - Publisher Copyright:
© 2015, Roux et al.
PY - 2015/7/22
Y1 - 2015/7/22
N2 - The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.
AB - The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.
UR - http://www.scopus.com/inward/record.url?scp=84939526110&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84939526110&partnerID=8YFLogxK
U2 - 10.7554/eLife.08490.001
DO - 10.7554/eLife.08490.001
M3 - Article
C2 - 26200428
AN - SCOPUS:84939526110
SN - 2050-084X
VL - 4
JO - eLife
JF - eLife
IS - JULY2015
M1 - e08490
ER -