TY - JOUR
T1 - The Implications of Lineage-Specific Rates for Divergence Time Estimation
AU - Carruthers, Tom
AU - Sanderson, Michael J.
AU - Scotland, Robert W.
AU - Alfaro, Michael
N1 - Funding Information:
FUNDING This work was supported by a Natural Environment Research Council scholarship granted through the Environmental Research DTP programme to T.C.
Publisher Copyright:
© 2019 The Author(s) 2019. Published by Oxford University Press on behalf of the Society of Systematic Biologists.
PY - 2020/7/1
Y1 - 2020/7/1
N2 - Rate variation adds considerable complexity to divergence time estimation in molecular phylogenies. Here, we evaluate the impact of lineage-specific rates - which we define as among-branch-rate-variation that acts consistently across the entire genome. We compare its impact to residual rates - defined as among-branch-rate-variation that shows a different pattern of rate variation at each sampled locus, and gene-specific rates - defined as variation in the average rate across all branches at each sampled locus. We show that lineage-specific rates lead to erroneous divergence time estimates, regardless of how many loci are sampled. Further, we show that stronger lineage-specific rates lead to increasing error. This contrasts to residual rates and gene-specific rates, where sampling more loci significantly reduces error. If divergence times are inferred in a Bayesian framework, we highlight that error caused by lineage-specific rates significantly reduces the probability that the 95% highest posterior density includes the correct value, and leads to sensitivity to the prior. Use of a more complex rate prior - which has recently been proposed to model rate variation more accurately - does not affect these conclusions. Finally, we show that the scale of lineage-specific rates used in our simulation experiments is comparable to that of an empirical data set for the angiosperm genus Ipomoea. Taken together, our findings demonstrate that lineage-specific rates cause error in divergence time estimates, and that this error is not overcome by analyzing genomic scale multilocus data sets. [Divergence time estimation; error; rate variation.].
AB - Rate variation adds considerable complexity to divergence time estimation in molecular phylogenies. Here, we evaluate the impact of lineage-specific rates - which we define as among-branch-rate-variation that acts consistently across the entire genome. We compare its impact to residual rates - defined as among-branch-rate-variation that shows a different pattern of rate variation at each sampled locus, and gene-specific rates - defined as variation in the average rate across all branches at each sampled locus. We show that lineage-specific rates lead to erroneous divergence time estimates, regardless of how many loci are sampled. Further, we show that stronger lineage-specific rates lead to increasing error. This contrasts to residual rates and gene-specific rates, where sampling more loci significantly reduces error. If divergence times are inferred in a Bayesian framework, we highlight that error caused by lineage-specific rates significantly reduces the probability that the 95% highest posterior density includes the correct value, and leads to sensitivity to the prior. Use of a more complex rate prior - which has recently been proposed to model rate variation more accurately - does not affect these conclusions. Finally, we show that the scale of lineage-specific rates used in our simulation experiments is comparable to that of an empirical data set for the angiosperm genus Ipomoea. Taken together, our findings demonstrate that lineage-specific rates cause error in divergence time estimates, and that this error is not overcome by analyzing genomic scale multilocus data sets. [Divergence time estimation; error; rate variation.].
UR - http://www.scopus.com/inward/record.url?scp=85086749178&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85086749178&partnerID=8YFLogxK
U2 - 10.1093/sysbio/syz080
DO - 10.1093/sysbio/syz080
M3 - Article
C2 - 31808929
AN - SCOPUS:85086749178
SN - 1063-5157
VL - 69
SP - 660
EP - 670
JO - Systematic Biology
JF - Systematic Biology
IS - 4
ER -