Statistical binning leads to profound model violation due to gene tree error incurred by trying to avoid gene tree error
- Award ID(s):
- 1655571
- PAR ID:
- 10090086
- Date Published:
- Journal Name:
- Molecular Phylogenetics and Evolution
- Volume:
- 134
- Issue:
- C
- ISSN:
- 1055-7903
- Page Range / eLocation ID:
- 164 to 171
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Stochastic simulation can make the molecular processes of cellular control more vivid than the traditional differential equation approach by generating typical system histories, instead of just statistical measures such as the mean and variance of a population. Simple simulations are now easy for students to construct from scratch—that is, without recourse to black-box packages. In some cases, their results can also be compared directly with single-molecule experimental data. After introducing the stochastic simulation algorithm, this article gives two case studies involving gene expression and error correction, respectively. For gene expression, stochastic simulation results are compared with experimental data, an important research exercise for biophysics students. For error correction, several proofreading models are compared to find the minimal components necessary for sufficient accuracy in translation. Animations of the stochastic error correction models provide insight into the proofreading mechanisms. Code samples and resulting animations showing results are given in the online Supplemental Material .more » « less
-
Holland, Barbara (Ed.)Abstract The evolutionary histories of individual loci in a genome can be estimated independently, but this approach is error-prone due to the limited amount of sequence data available for each gene, which has led to the development of a diverse array of gene tree error correction methods which reduce the distance to the species tree. We investigate the performance of two representatives of these methods: TRACTION and TreeFix. We found that gene tree error correction frequently increases the level of error in gene tree topologies by “correcting” them to be closer to the species tree, even when the true gene and species trees are discordant. We confirm that full Bayesian inference of the gene trees under the multispecies coalescent model is more accurate than independent inference. Future gene tree correction approaches and methods should incorporate an adequately realistic model of evolution instead of relying on oversimplified heuristics.more » « less
An official website of the United States government

