PremiseLarge genomic data sets offer the promise of resolving historically recalcitrant species relationships. However, different methodologies can yield conflicting results, especially when clades have experienced ancient, rapid diversification. Here, we analyzed the ancient radiation of Ericales and explored sources of uncertainty related to species tree inference, conflicting gene tree signal, and the inferred placement of gene and genome duplications. MethodsWe used a hierarchical clustering approach, with tree‐based homology and orthology detection, to generate six filtered phylogenomic matrices consisting of data from 97 transcriptomes and genomes. Support for species relationships was inferred from multiple lines of evidence including shared gene duplications, gene tree conflict, gene‐wise edge‐based analyses, concatenation, and coalescent‐based methods, and is summarized in a consensus framework. ResultsOur consensus approach supported a topology largely concordant with previous studies, but suggests that the data are not capable of resolving several ancient relationships because of lack of informative characters, sensitivity to methodology, and extensive gene tree conflict correlated with paleopolyploidy. We found evidence of a whole‐genome duplication before the radiation of all or most ericalean families, and demonstrate that tree topology and heterogeneous evolutionary rates affect the inferred placement of genome duplications. ConclusionsWe provide several hypotheses regarding the history of Ericales, and confidently resolve most nodes, but demonstrate that a series of ancient divergences are unresolvable with these data. Whether paleopolyploidy is a major source of the observed phylogenetic conflict warrants further investigation.
more »
« less
The roles of gene duplications in the dynamics of evolutionary conflicts
Evolutionary conflicts occur when there is antagonistic selection between different individuals of the same or different species, life stages or between levels of biological organization. Remarkably, conflicts can occur within species or within genomes. In the dynamics of evolutionary conflicts, gene duplications can play a major role because they can bring very specific changes to the genome: changes in protein dose, the generation of novel paralogues with different functions or expression patterns or the evolution of small antisense RNAs. As we describe here, by having those effects, gene duplication might spark evolutionary conflict or fuel arms race dynamics that takes place during conflicts. Interestingly, gene duplication can also contribute to the resolution of a within-locus evolutionary conflict by partitioning the functions of the gene that is under an evolutionary trade-off. In this review, we focus on intraspecific conflicts, including sexual conflict and illustrate the various roles of gene duplications with a compilation of examples. These examples reveal the level of complexity and the differences in the patterns of gene duplications within genomes under different conflicts. These examples also reveal the gene ontologies involved in conflict and the genomic location of the elements of the conflict. The examples provide a blueprint for the direct study of these conflicts or the exploration of the presence of similar conflicts in other lineages.
more »
« less
- Award ID(s):
- 1818017
- PAR ID:
- 10528817
- Publisher / Repository:
- The Royal Society Publishing
- Date Published:
- Journal Name:
- Proceedings of the Royal Society B: Biological Sciences
- Volume:
- 291
- Issue:
- 2024
- ISSN:
- 1471-2954
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Piganeau, Gwenael (Ed.)Abstract Numerous factors shape the evolution of protein-coding genes, including shifts in the strength or type of selection following gene duplications or changes in the environment. Diatoms and other silicifying organisms use a family of silicon transporters (SITs) to import dissolved silicon from the environment. Freshwaters contain higher silicon levels than oceans, and marine diatoms have more efficient uptake kinetics and less silicon in their cell walls, making them better competitors for a scarce resource. We compiled SITs from 37 diatom genomes to characterize shifts in selection following gene duplications and marine–freshwater transitions. A deep gene duplication, which coincided with a whole-genome duplication, gave rise to two gene lineages. One of them (SIT1–2) is present in multiple copies in most species and is known to actively import silicon. These SITs have evolved under strong purifying selection that was relaxed in freshwater taxa. Episodic diversifying selection was detected but not associated with gene duplications or habitat shifts. In contrast, genes in the second SIT lineage (SIT3) were present in just half the species, the result of multiple losses. Despite conservation of SIT3 in some lineages for the past 90–100 million years, repeated losses, relaxed selection, and low expression highlighted the dispensability of SIT3, consistent with a model of deterioration and eventual loss due to relaxed selection on SIT3 expression. The extensive but relatively balanced history of duplications and losses, together with paralog-specific expression patterns, suggest diatoms continuously balance gene dosage and expression dynamics to optimize silicon transport across major environmental gradients.more » « less
-
Abstract We aim to enable the accurate and efficient transfer of knowledge about gene function gained fromArabidopsis thalianaand other model organisms to other plant species. This knowledge transfer is frequently challenging in plants due to duplications of individual genes and whole genomes in plant lineages. Such duplications result in complex evolutionary relationships between related genes, which may have similar sequences but highly divergent functions. In such cases, functional inference requires more than a simple sequence similarity calculation. We have developed an online resource, PhyloGenes (phylogenes.org), that displays precomputed phylogenetic trees for plant gene families along with experimentally validated function information for individual genes within the families. A total of 40 plant genomes and 10 non‐plant model organisms are represented in over 8,000 gene families. Evolutionary events such as speciation and duplication are clearly labeled on gene trees to distinguish orthologs from paralogs. Nearly 6,000 families have at least one member with an experimentally supported annotation to a Gene Ontology (GO) molecular function or biological process term. By displaying experimentally validated gene functions associated to individual genes within a tree, PhyloGenes enables functional inference for genes of uncharacterized function, based on their evolutionary relationships to experimentally studied genes, in a visually traceable manner. For the many families containing genes that have evolved to perform different functions, PhyloGenes facilitates the use of evolutionary history to determine the most likely function of genes that have not been experimentally characterized. Future work will enrich the resource by incorporating additional gene function datasets such as plant gene expression atlas data.more » « less
-
Sikosek, Tobias (Ed.)Gene duplication is an important process in the evolution of gene content in eukaryotic genomes. Understanding when gene duplicates contribute new molecular functions to genomes through molecular adaptation is one important goal in comparative genomics. In large gene families, however, characterizing adaptation and neofunctionalization across species is challenging, as models have traditionally quantified the timing of duplications without considering underlying gene trees. This protocol combines multiple approaches to detect adaptation in protein duplicates at a phylogenetic scale. We include a description of models for gene tree-species tree reconciliation that enable different types of inference, as well as a practical guide to their use. Although simulation-based approaches successfully detect shifts in the rate of duplica- tion/retention, the conflation between the duplication and retention processes, the distinct trajectories of duplicates under non-, sub-, and neofunctionalization, as well as dosage effects offer hitherto unexplored analytical avenues. We introduce mathematical descriptions of these probabilities and offer a road map to computational implementation whose starting point is parsimony reconciliation. Sequence evolution information based on the ratio of nonsynonymous to synonymous nucleotide substitution rates (dN/dS) can be combined with duplicate survival probabilities to better predict the emergence of new molecular functions in retained duplicates. Together, these methods enable characterization of potentially adaptive candidate duplicates whose neofunctionalization may contribute to phenotypic divergence across species.more » « less
-
Abstract Gene duplication is a fundamental part of evolutionary innovation. While single-gene duplications frequently exhibit asymmetric evolutionary rates between paralogs, the extent to which this applies to multi-gene duplications remains unclear. In this study, we investigate the role of genetic context in shaping evolutionary divergence within multi-gene duplications, leveraging microsynteny to differentiate source and target copies. Using a dataset of 193 mammalian genome assemblies and a bird outgroup, we systematically analyze patterns of sequence divergence between duplicated genes and reference orthologs. We find that target copies, those relocated to new genomic environments, exhibit elevated evolutionary rates compared to source copies in the ancestral location. This asymmetry is influenced by the distance between copies and the size of the target copy. We also demonstrate that the polarization of rate asymmetry in paralogs, the “choice” of the slowly evolving copy, is biased towards collective, block-wise polarization in multi-gene duplications. Our findings highlight the importance of genetic context in modulating post-duplication divergence, where differences in cis-regulatory elements and co-expressed gene clusters between source and target copies may be responsible. This study presents a large-scale test of asymmetric evolution in multi-gene duplications, offering new insight into how genome architecture shapes functional diversification of paralogs. Significance statementAfter a gene is duplicated, reduced selective constraints can lead the two copies to rapidly diverge, with one copy often evolving faster and occasionally gaining a new function. We quantify the influence of genetic context in choosing which copy of a duplicated gene has an elevated substitution rate. In a representative dataset of 193 mammalian genomes, we found strong evidence that gene copies pasted into new genomic locations tend to evolve faster than the corresponding copies in ancestral locations, suggesting an important role for the regulatory environment. The asymmetry in evolutionary rates of duplicated genes persists even for very large multigenic duplications, up to the scale of megabases, indicating that regulatory interactions frequently reach farther than previously thought.more » « less
An official website of the United States government

