skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Protocols for the Molecular Evolutionary Analysis of Membrane Protein Gene Duplicates
Gene duplication is an important process in the evolution of gene content in eukaryotic genomes. Understanding when gene duplicates contribute new molecular functions to genomes through molecular adaptation is one important goal in comparative genomics. In large gene families, however, characterizing adaptation and neofunctionalization across species is challenging, as models have traditionally quantified the timing of duplications without considering underlying gene trees. This protocol combines multiple approaches to detect adaptation in protein duplicates at a phylogenetic scale. We include a description of models for gene tree-species tree reconciliation that enable different types of inference, as well as a practical guide to their use. Although simulation-based approaches successfully detect shifts in the rate of duplica- tion/retention, the conflation between the duplication and retention processes, the distinct trajectories of duplicates under non-, sub-, and neofunctionalization, as well as dosage effects offer hitherto unexplored analytical avenues. We introduce mathematical descriptions of these probabilities and offer a road map to computational implementation whose starting point is parsimony reconciliation. Sequence evolution information based on the ratio of nonsynonymous to synonymous nucleotide substitution rates (dN/dS) can be combined with duplicate survival probabilities to better predict the emergence of new molecular functions in retained duplicates. Together, these methods enable characterization of potentially adaptive candidate duplicates whose neofunctionalization may contribute to phenotypic divergence across species.  more » « less
Award ID(s):
1442142 1701414 1854469
PAR ID:
10200600
Author(s) / Creator(s):
; ; ;
Editor(s):
Sikosek, Tobias
Date Published:
Journal Name:
Computational Methods in Protein Evolution
Volume:
1851
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. BackgroundAnalyses of microbial evolution often use reconciliation methods. However, the standard duplication-transfer-loss (DTL) model does not account for the fact that species trees are often not fully sampled and thus, from the perspective of reconciliation, a gene family may enter the species tree from the outside. Moreover, within the genome, genes are often rearranged, causing them to move to new syntenic regions. ResultsWe extend the DTL model to account for two events that commonly arise in the evolution of microbes:originof a gene from outside the sampled species tree andrearrangementof gene syntenic regions. We describe an efficient algorithm for maximum parsimony reconciliation in this new DTLOR model and then show how it can be extended to account for non-binary gene trees to handle uncertainty in gene tree topologies. Finally, we describe preliminary experimental results from the integration of our algorithm into the existing xenoGI tool for reconstructing the histories of genomic islands in closely related bacteria. ConclusionsReconciliation in the DTLOR model can offer new insights into the evolution of microbes that is not currently possible under the DTL model. 
    more » « less
  2. Synopsis Gene duplicates, or paralogs, serve as a major source of new genetic material and comprise seeds for evolutionary innovation. While originally thought to be quickly lost or nonfunctionalized following duplication, now a vast number of paralogs are known to be retained in a functional state. Daughter paralogs can provide robustness through redundancy, specialize via sub-functionalization, or neo-functionalize to play new roles. Indeed, the duplication and divergence of developmental genes have played a monumental role in the evolution of animal forms (e.g., Hox genes). Still, despite their prevalence and evolutionary importance, the precise detection of gene duplicates in newly sequenced genomes remains technically challenging and often overlooked. This presents an especially pertinent problem for evolutionary developmental biology, where hypothesis testing requires accurate detection of changes in gene expression and function, often in nontraditional model species. Frequently, these analyses rely on molecular reagents designed within coding sequences that may be highly similar in recently duplicated paralogs, leading to cross-reactivity and spurious results. Thus, care is needed to avoid erroneously assigning diverged functions of paralogs to a single gene, and potentially misinterpreting evolutionary history. This perspective aims to overview the prevalence and importance of paralogs and to shed light on the difficulty of their detection and analysis while offering potential solutions. 
    more » « less
  3. Understanding how morphology evolves requires identifying the types of mutations that contribute to changes in development. We integrated comparative genomics and transcriptomics to reconstruct the evolution and regulation offollistatinparalogs in relation to the evolution of aphid winged and wingless morphs. We find that different pea aphidfollistatinduplicates play an essential molecular role in both the male and female wing dimorphisms, linking the genetic and environmental control of morph determination in each sex, respectively. We also find that an ancestralfollistatingene likely had multiple promoters and that thefollistatinduplicates that evolved wingless-specific expression retained only the ancestral wingless-specific promoter. Our work provides a roadmap for how alternative promoter usage and subsequent gene duplication can enable the evolution of animal form. 
    more » « less
  4. Synopsis The proliferation of genomic resources for Chelicerata in the past 10 years has revealed that the evolution of chelicerate genomes is more dynamic than previously thought, with multiple waves of ancient whole genome duplications affecting separate lineages. Such duplication events are fascinating from the perspective of evolutionary history because the burst of new gene copies associated with genome duplications facilitates the acquisition of new gene functions (neofunctionalization), which may in turn lead to morphological novelties and spur net diversification. While neofunctionalization has been invoked in several contexts with respect to the success and diversity of spiders, the overall impact of whole genome duplications on chelicerate evolution and development remains imperfectly understood. The purpose of this review is to examine critically the role of whole genome duplication on the diversification of the extant arachnid orders, as well as assess functional datasets for evidence of subfunctionalization or neofunctionalization in chelicerates. This examination focuses on functional data from two focal model taxa: the spider Parasteatoda tepidariorum, which exhibits evidence for an ancient duplication, and the harvestman Phalangium opilio, which exhibits an unduplicated genome. I show that there is no evidence that taxa with genome duplications are more successful than taxa with unduplicated genomes. I contend that evidence for sub- or neofunctionalization of duplicated developmental patterning genes in spiders is indirect or fragmentary at present, despite the appeal of this postulate for explaining the success of groups like spiders. Available expression data suggest that the condition of duplicated Hox modules may have played a role in promoting body plan disparity in the posterior tagma of some orders, such as spiders and scorpions, but functional data substantiating this postulate are critically missing. Spatiotemporal dynamics of duplicated transcription factors in spiders may represent cases of developmental system drift, rather than neofunctionalization. Developmental system drift may represent an important, but overlooked, null hypothesis for studies of paralogs in chelicerate developmental biology. To distinguish between subfunctionalization, neofunctionalization, and developmental system drift, concomitant establishment of comparative functional datasets from taxa exhibiting the genome duplication, as well as those that lack the paralogy, is sorely needed. 
    more » « less
  5. Abstract To understand genome evolution in a group of microbes, we need to know the timing of events such as duplications, deletions and horizontal transfers. A common approach is to perform a gene-tree / species-tree reconciliation. While a number of software packages perform this type of analysis, none are geared toward a complete reconstruction for all families in an entire clade. Here we describe an update to the xenoGI software package which allows users to perform such an analysis using the newly developed DTLOR (duplication-transfer-loss-origin-rearrangement) reconciliation model starting from genome sequences as input. 
    more » « less