TY - JOUR
T1 - Illuminating structural proteins in viral "dark matter" with metaproteomics
AU - Brum, Jennifer R.
AU - Ignacio-Espinoza, J. Cesar
AU - Kim, Eun Hae
AU - Trubl, Gareth
AU - Jones, Robert M.
AU - Roux, Simon
AU - VerBerkmoes, Nathan C.
AU - Rich, Virginia I.
AU - Sullivan, Matthew B.
N1 - Funding Information:
We thank Bonnie Poulos for preparing viral concentrates, Genoscope for viral metagenomic sequencing, members of Tucson Marine Phage Lab for comments on the manuscript, and University Information Technology Services Research Computing Group and the Arizona Research Laboratories Biotechnology Computing for High-Performance Computing Cluster access and support. We thank Kristen Corrier and Manesh Shah of University of Tennessee/Oak Ridge National Laboratory for efforts in filter-aided sample preparation (FASP) preparation of viral samples and MS analyses, and aspects of proteome informatics, respectively. The four viral concentrates were collected as part of exceptional commitment by scientists and sponsors who made the Tara Oceans expedition possible [full list in Brum et al. (6)]. Funding specific to this project was provided by a Ford Foundation Postdoctoral Fellowship (to E.-H.K.), the Gordon and Betty Moore Foundation through Grants GBMF2631 and GBMF3790 (to M.B.S.), and a grant to the UA Ecosystem Genomics Institute through the UA Technology and Research Initiative Fund and the Water, Environmental and Energy Solutions Initiative (to M.B.S. and V.I.R.). This article is contribution 35 of the Tara Oceans Expedition 2009–2012.
PY - 2016/3/1
Y1 - 2016/3/1
N2 - Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional darkmatter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over onehalf of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.
AB - Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional darkmatter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over onehalf of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.
KW - Marine
KW - Proteins
KW - Viruses
UR - http://www.scopus.com/inward/record.url?scp=84959474801&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84959474801&partnerID=8YFLogxK
U2 - 10.1073/pnas.1525139113
DO - 10.1073/pnas.1525139113
M3 - Article
C2 - 26884177
AN - SCOPUS:84959474801
SN - 0027-8424
VL - 113
SP - 2436
EP - 2441
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 9
ER -