TY - JOUR
T1 - Genome size evolution in the diverse insect order Trichoptera
AU - Heckenhauer, Jacqueline
AU - Frandsen, Paul B.
AU - Sproul, John S.
AU - Li, Zheng
AU - Paule, Juraj
AU - Larracuente, Amanda M.
AU - Maughan, Peter J.
AU - Barker, Michael S.
AU - Schneider, Julio V.
AU - Stewart, Russell J.
AU - Pauls, Steffen U.
N1 - Funding Information:
J.P. J.S.S. was supported by an NSF Postdoctoral Research Fellowship in Biology (DBI-1811930) and an NIH General Medical Sciences award (R35GM119515).
Publisher Copyright:
© 2022 The Author(s) 2022. Published by Oxford University Press GigaScience.
PY - 2022
Y1 - 2022
N2 - Background: Genome size is implicated in the form, function, and ecological success of a species. Two principally different mechanisms are proposed as major drivers of eukaryotic genome evolution and diversity: polyploidy (i.e., whole-genome duplication) or smaller duplication events and bursts in the activity of repetitive elements. Here, we generated de novo genome assemblies of 17 caddisflies covering all major lineages of Trichoptera. Using these and previously sequenced genomes, we use caddisflies as a model for understanding genome size evolution in diverse insect lineages. Results: We detect a ∼14-fold variation in genome size across the order Trichoptera. We find strong evidence that repetitive element expansions, particularly those of transposable elements (TEs), are important drivers of large caddisfly genome sizes. Using an innovative method to examine TEs associated with universal single-copy orthologs (i.e., BUSCO genes), we find that TE expansions have a major impact on protein-coding gene regions, with TE-gene associations showing a linear relationship with increasing genome size. Intriguingly, we find that expanded genomes preferentially evolved in caddisfly clades with a higher ecological diversity (i.e., various feeding modes, diversification in variable, less stable environments). Conclusion: Our findings provide a platform to test hypotheses about the potential evolutionary roles of TE activity and TE-gene associations, particularly in groups with high species, ecological, and functional diversities.
AB - Background: Genome size is implicated in the form, function, and ecological success of a species. Two principally different mechanisms are proposed as major drivers of eukaryotic genome evolution and diversity: polyploidy (i.e., whole-genome duplication) or smaller duplication events and bursts in the activity of repetitive elements. Here, we generated de novo genome assemblies of 17 caddisflies covering all major lineages of Trichoptera. Using these and previously sequenced genomes, we use caddisflies as a model for understanding genome size evolution in diverse insect lineages. Results: We detect a ∼14-fold variation in genome size across the order Trichoptera. We find strong evidence that repetitive element expansions, particularly those of transposable elements (TEs), are important drivers of large caddisfly genome sizes. Using an innovative method to examine TEs associated with universal single-copy orthologs (i.e., BUSCO genes), we find that TE expansions have a major impact on protein-coding gene regions, with TE-gene associations showing a linear relationship with increasing genome size. Intriguingly, we find that expanded genomes preferentially evolved in caddisfly clades with a higher ecological diversity (i.e., various feeding modes, diversification in variable, less stable environments). Conclusion: Our findings provide a platform to test hypotheses about the potential evolutionary roles of TE activity and TE-gene associations, particularly in groups with high species, ecological, and functional diversities.
KW - Trichoptera
KW - biodiversity; de novo genome assembly
KW - genome duplication
KW - genome size evolution
KW - genomic diversity
KW - genomics
KW - insects
KW - repetitive elements
KW - transposable elements
UR - http://www.scopus.com/inward/record.url?scp=85125551499&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85125551499&partnerID=8YFLogxK
U2 - 10.1093/gigascience/giac011
DO - 10.1093/gigascience/giac011
M3 - Article
C2 - 35217860
AN - SCOPUS:85125551499
SN - 2047-217X
VL - 11
JO - GigaScience
JF - GigaScience
M1 - giac011
ER -