TY - JOUR
T1 - Reconstruction of organismal and gene phylogenies from data on multigene families
T2 - Concerted evolution, homoplasy, and confidence
AU - Sanderson, Michael J.
AU - Doyle, Jeff J.
N1 - Funding Information:
For useful criticism or reviews, we thank E. A. Zimmer, John Freudenstein, an anonymous reviewer, and especially M. Goodman, who provided exceptionally detailed and thorough comments on the manuscript. J. B. Walsh and W. Fitch provided important advice on modeling issues and concepts of orthology/paralogy, respectively. This work was supported by a Sloan Foundation Postdoctoral Fellowship to M.J.S. and NSF grant BSR-8805630 to J.J.D.
PY - 1992/3
Y1 - 1992/3
N2 - The reliability of phylogenies reconstructed from data on multigene families is investigated via simulation. The evolutionary scenario used is a character-based model of a two- gene family in four species in which clocklike divergence is postulated but neither convergence nor reversal is allowed except as a result of recombination and gene conversion. Thus, any homoplasy emerging from parsimony reconstructions from the simulated data matrices can be attributed to concerted evolution. The probabilities of correctly reconstructing two standard trees are estimated by replicate runs of the simulation. One standard tree (the OP or “orthology/paralogy” tree) reflects the true gene genealogy in the absence of concerted evolution; the other (the CE or “concerted evolution” tree) depicts gene relationships under complete homogenization of the gene family. The probability of correct reconstruction of the OP tree declines quickly as concerted evolution increases, but above an intermediate level of concerted evolution the probability of correctly inferring the CE tree increases rapidly. Trees similar but not identical to the correct trees can be reconstructed above or below the critical intermediate level of concerted evolution. Levels of homoplasy and numbers of equally parsimonious minimal trees are maximized, and bootstrap confidence levels are minimized, near this intermediate level of concerted evolution. When reconstructing the correct gene tree is the goal, both consistency indices and bootstrap levels will show misleadingly high values when concerted evolution is high. However, because the correct species tree can be inferred from either the OP or CE tree (in the absence of homoplasy from sources other than concerted evolution), these same measures correlate well with fidelity of reconstructing the species tree. [Concerted evolution; phylogeny; multigene family; parsimony; homoplasy.].
AB - The reliability of phylogenies reconstructed from data on multigene families is investigated via simulation. The evolutionary scenario used is a character-based model of a two- gene family in four species in which clocklike divergence is postulated but neither convergence nor reversal is allowed except as a result of recombination and gene conversion. Thus, any homoplasy emerging from parsimony reconstructions from the simulated data matrices can be attributed to concerted evolution. The probabilities of correctly reconstructing two standard trees are estimated by replicate runs of the simulation. One standard tree (the OP or “orthology/paralogy” tree) reflects the true gene genealogy in the absence of concerted evolution; the other (the CE or “concerted evolution” tree) depicts gene relationships under complete homogenization of the gene family. The probability of correct reconstruction of the OP tree declines quickly as concerted evolution increases, but above an intermediate level of concerted evolution the probability of correctly inferring the CE tree increases rapidly. Trees similar but not identical to the correct trees can be reconstructed above or below the critical intermediate level of concerted evolution. Levels of homoplasy and numbers of equally parsimonious minimal trees are maximized, and bootstrap confidence levels are minimized, near this intermediate level of concerted evolution. When reconstructing the correct gene tree is the goal, both consistency indices and bootstrap levels will show misleadingly high values when concerted evolution is high. However, because the correct species tree can be inferred from either the OP or CE tree (in the absence of homoplasy from sources other than concerted evolution), these same measures correlate well with fidelity of reconstructing the species tree. [Concerted evolution; phylogeny; multigene family; parsimony; homoplasy.].
UR - http://www.scopus.com/inward/record.url?scp=11944255719&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=11944255719&partnerID=8YFLogxK
U2 - 10.1093/sysbio/41.1.4
DO - 10.1093/sysbio/41.1.4
M3 - Article
AN - SCOPUS:11944255719
SN - 1063-5157
VL - 41
SP - 4
EP - 17
JO - Systematic biology
JF - Systematic biology
IS - 1
ER -