Supporting data for "Inferring putative ancient whole genome duplications in the 1000 Plants (1KP) initiative: access to gene family phylogenies and age distributions"



Polyploidy or whole genome duplications (WGDs) repeatedly occurred during green plant evolution. To examine the evolutionary history of green plants in a phylogenomic framework, the 1KP project sequenced over 1000 transcriptomes across the Viridiplantae. The 1KP project provided a unique opportunity to study the distribution and occurrence of WGDs across the green plants. As an accompaniment to the capstone publication, this paper provides expanded methodological details, results validation, and descriptions of newly released data sets that will aid researchers that wish to use the extended data generated by the 1KP project. In the 1KP capstone analyses, we used a total evidence approach that combined inferences of WGDs from Ks and phylogenomic methods to infer and place 244 putative ancient WGDs across the Viridiplantae. Here, we provide an expanded explanation of our approach by describing our methodology and walkthrough examples. We also evaluated the consistency of our WGD inferences by comparing them to evidence from published syntenic analyses of plant genome assemblies. We find that our inferences are consistent with whole genome synteny analyses and our total evidence approach may minimize the false positive rate throughout the data set. Given these resources will be useful for many future analyses on gene and genome evolution in green plants, we release 383,679 nuclear gene family phylogenies and 2,306 gene age distributions with Ks plots from the 1KP capstone paper.
Date made available2020
PublisherGigaScience Database

Cite this