TY - JOUR
T1 - Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks
AU - Bin Jang, Ho
AU - Bolduc, Benjamin
AU - Zablocki, Olivier
AU - Kuhn, Jens H.
AU - Roux, Simon
AU - Adriaenssens, Evelien M.
AU - Brister, J. Rodney
AU - Kropinski, Andrew M.
AU - Krupovic, Mart
AU - Lavigne, Rob
AU - Turner, Dann
AU - Sullivan, Matthew B.
N1 - Publisher Copyright:
© 2019, The Author(s), under exclusive licence to Springer Nature America, Inc.
PY - 2019/6/1
Y1 - 2019/6/1
N2 - Microbiomes from every environment contain a myriad of uncultivated archaeal and bacterial viruses, but studying these viruses is hampered by the lack of a universal, scalable taxonomic framework. We present vConTACT v.2.0, a network-based application utilizing whole genome gene-sharing profiles for virus taxonomy that integrates distance-based hierarchical clustering and confidence scores for all taxonomic predictions. We report near-identical (96%) replication of existing genus-level viral taxonomy assignments from the International Committee on Taxonomy of Viruses for National Center for Biotechnology Information virus RefSeq. Application of vConTACT v.2.0 to 1,364 previously unclassified viruses deposited in virus RefSeq as reference genomes produced automatic, high-confidence genus assignments for 820 of the 1,364. We applied vConTACT v.2.0 to analyze 15,280 Global Ocean Virome genome fragments and were able to provide taxonomic assignments for 31% of these data, which shows that our algorithm is scalable to very large metagenomic datasets. Our taxonomy tool can be automated and applied to metagenomes from any environment for virus classification.
AB - Microbiomes from every environment contain a myriad of uncultivated archaeal and bacterial viruses, but studying these viruses is hampered by the lack of a universal, scalable taxonomic framework. We present vConTACT v.2.0, a network-based application utilizing whole genome gene-sharing profiles for virus taxonomy that integrates distance-based hierarchical clustering and confidence scores for all taxonomic predictions. We report near-identical (96%) replication of existing genus-level viral taxonomy assignments from the International Committee on Taxonomy of Viruses for National Center for Biotechnology Information virus RefSeq. Application of vConTACT v.2.0 to 1,364 previously unclassified viruses deposited in virus RefSeq as reference genomes produced automatic, high-confidence genus assignments for 820 of the 1,364. We applied vConTACT v.2.0 to analyze 15,280 Global Ocean Virome genome fragments and were able to provide taxonomic assignments for 31% of these data, which shows that our algorithm is scalable to very large metagenomic datasets. Our taxonomy tool can be automated and applied to metagenomes from any environment for virus classification.
UR - http://www.scopus.com/inward/record.url?scp=85065314281&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85065314281&partnerID=8YFLogxK
U2 - 10.1038/s41587-019-0100-8
DO - 10.1038/s41587-019-0100-8
M3 - Article
C2 - 31061483
AN - SCOPUS:85065314281
SN - 1087-0156
VL - 37
SP - 632
EP - 639
JO - Nature biotechnology
JF - Nature biotechnology
IS - 6
ER -