skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Multimerization variants as potential drivers of neofunctionalization
Whole-genome duplications are common during evolution, creating genetic redundancy that can enable cellular innovations. Novel protein-protein interactions provide a route to diversified gene functions, but, at present, there is limited proteome-scale knowledge on the extent to which variability in protein complex formation drives neofunctionalization. Here, we used protein correlation profiling to test for variability in apparent mass among thousands of orthologous proteins isolated from diverse species and cell types. Variants in protein complex size were unexpectedly common, in some cases appearing after relatively recent whole-genome duplications or an allopolyploidy event. In other instances, variants such as those in the carbonic anhydrase orthologous group reflected the neofunctionalization of ancient paralogs that have been preserved in extant species. Our results demonstrate that homo- and heteromer formation have the potential to drive neofunctionalization in diverse classes of enzymes, signaling, and structural proteins.  more » « less
Award ID(s):
1951819
PAR ID:
10230885
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Science Advances
Volume:
7
Issue:
13
ISSN:
2375-2548
Page Range / eLocation ID:
eabf0984
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Synopsis The proliferation of genomic resources for Chelicerata in the past 10 years has revealed that the evolution of chelicerate genomes is more dynamic than previously thought, with multiple waves of ancient whole genome duplications affecting separate lineages. Such duplication events are fascinating from the perspective of evolutionary history because the burst of new gene copies associated with genome duplications facilitates the acquisition of new gene functions (neofunctionalization), which may in turn lead to morphological novelties and spur net diversification. While neofunctionalization has been invoked in several contexts with respect to the success and diversity of spiders, the overall impact of whole genome duplications on chelicerate evolution and development remains imperfectly understood. The purpose of this review is to examine critically the role of whole genome duplication on the diversification of the extant arachnid orders, as well as assess functional datasets for evidence of subfunctionalization or neofunctionalization in chelicerates. This examination focuses on functional data from two focal model taxa: the spider Parasteatoda tepidariorum, which exhibits evidence for an ancient duplication, and the harvestman Phalangium opilio, which exhibits an unduplicated genome. I show that there is no evidence that taxa with genome duplications are more successful than taxa with unduplicated genomes. I contend that evidence for sub- or neofunctionalization of duplicated developmental patterning genes in spiders is indirect or fragmentary at present, despite the appeal of this postulate for explaining the success of groups like spiders. Available expression data suggest that the condition of duplicated Hox modules may have played a role in promoting body plan disparity in the posterior tagma of some orders, such as spiders and scorpions, but functional data substantiating this postulate are critically missing. Spatiotemporal dynamics of duplicated transcription factors in spiders may represent cases of developmental system drift, rather than neofunctionalization. Developmental system drift may represent an important, but overlooked, null hypothesis for studies of paralogs in chelicerate developmental biology. To distinguish between subfunctionalization, neofunctionalization, and developmental system drift, concomitant establishment of comparative functional datasets from taxa exhibiting the genome duplication, as well as those that lack the paralogy, is sorely needed. 
    more » « less
  2. Abstract Butterfly eyes are complex organs that are composed of a diversity of proteins and they play a central role in visual signaling and ultimately, speciation, and adaptation. Here, we utilized the whole eye transcriptome to obtain a more holistic view of the evolution of the butterfly eye while accounting for speciation events that co-occur with ancient hybridization. We sequenced and assembled transcriptomes from adult female eyes of eight species representing all major clades of the Heliconius genus and an additional outgroup species, Dryas iulia. We identified 4,042 orthologous genes shared across all transcriptome data sets and constructed a transcriptome-wide phylogeny, which revealed topological discordance with the mitochondrial phylogenetic tree in the Heliconius pupal mating clade. We then estimated introgression among lineages using additional genome data and found evidence for ancient hybridization leading to the common ancestor of Heliconius hortense and Heliconius clysonymus. We estimated the Ka/Ks ratio for each orthologous cluster and performed further tests to demonstrate genes showing evidence of adaptive protein evolution. Furthermore, we characterized patterns of expression for a subset of these positively selected orthologs using qRT-PCR. Taken together, we identified candidate eye genes that show signatures of adaptive molecular evolution and provide evidence of their expression divergence between species, tissues, and sexes. Our results demonstrate: 1) greater evolutionary changes in younger Heliconius lineages, that is, more positively selected genes in the cydno–melpomene–hecale group as opposed to the sara–hortense–erato group, and 2) suggest an ancient hybridization leading to speciation among Heliconius pupal-mating species. 
    more » « less
  3. Corneous proteins are an important component of the tetrapod integument. Duplication and diversification of keratins and associated proteins are linked with the origin of most novel integumentary structures like mammalian hair, avian feathers, and scutes covering turtle shells. Accordingly, the loss of integumentary structures often coincides with the loss of genes encoding keratin and associated proteins. For example, many hair keratins in dolphins and whales have become pseudogenes. The adhesive setae of geckos and anoles are composed of both intermediate filament keratins (IF-keratins, formerly known as alpha-keratins) and corneous beta-proteins (CBPs, formerly known as beta-keratins) and recent whole genome assemblies of two gecko species and an anole uncovered duplications in seta-specific CBPs in each of these lineages. While anoles evolved adhesive toepads just once, there are two competing hypotheses about the origin(s) of digital adhesion in geckos involving either a single origin or multiple origins. Using data from three published gecko genomes, I examine CBP gene evolution in geckos and find support for a hypothesis where CBP gene duplications are associated with the repeated evolution of digital adhesion. Although these results are preliminary, I discuss how additional gecko genome assemblies, combined with phylogenies of keratin and associated protein genes and gene duplication models, can provide rigorous tests of several hypotheses related to gecko CBP evolution. This includes a taxon sampling strategy for sequencing and assembly of gecko genomes that could help resolve competing hypotheses surrounding the origin(s) of digital adhesion. 
    more » « less
  4. Rogers, Rebekah (Ed.)
    Abstract Whole-genome duplications (WGDs) have shaped the gene repertoire of many eukaryotic lineages. The redundancy created by WGDs typically results in a phase of massive gene loss. However, some WGD–derived paralogs are maintained over long evolutionary periods, and the relative contributions of different selective pressures to their maintenance are still debated. Previous studies have revealed a history of three successive WGDs in the lineage of the ciliate Paramecium tetraurelia and two of its sister species from the Paramecium aurelia complex. Here, we report the genome sequence and analysis of 10 additional P. aurelia species and 1 additional out group, revealing aspects of post-WGD evolution in 13 species sharing a common ancestral WGD. Contrary to the morphological radiation of vertebrates that putatively followed two WGD events, members of the cryptic P. aurelia complex have remained morphologically indistinguishable after hundreds of millions of years. Biases in gene retention compatible with dosage constraints appear to play a major role opposing post-WGD gene loss across all 13 species. In addition, post-WGD gene loss has been slower in Paramecium than in other species having experienced genome duplication, suggesting that the selective pressures against post-WGD gene loss are especially strong in Paramecium. A near complete lack of recent single-gene duplications in Paramecium provides additional evidence for strong selective pressures against gene dosage changes. This exceptional data set of 13 species sharing an ancestral WGD and 2 closely related out group species will be a useful resource for future studies on Paramecium as a major model organism in the evolutionary cell biology. 
    more » « less
  5. Stamatakis, Alexandros (Ed.)
    Abstract Motivation Comparative genome analysis of two or more whole-genome sequenced (WGS) samples is at the core of most applications in genomics. These include the discovery of genomic differences segregating in populations, case-control analysis in common diseases and diagnosing rare disorders. With the current progress of accurate long-read sequencing technologies (e.g. circular consensus sequencing from PacBio sequencers), we can dive into studying repeat regions of the genome (e.g. segmental duplications) and hard-to-detect variants (e.g. complex structural variants). Results We propose a novel framework for comparative genome analysis through the discovery of strings that are specific to one genome (‘samples-specific’ strings). We have developed a novel, accurate and efficient computational method for the discovery of sample-specific strings between two groups of WGS samples. The proposed approach will give us the ability to perform comparative genome analysis without the need to map the reads and is not hindered by shortcomings of the reference genome and mapping algorithms. We show that the proposed approach is capable of accurately finding sample-specific strings representing nearly all variation (>98%) reported across pairs or trios of WGS samples using accurate long reads (e.g. PacBio HiFi data). Availability and implementation Data, code and instructions for reproducing the results presented in this manuscript are publicly available at https://github.com/Parsoa/PingPong. Supplementary information Supplementary data are available at Bioinformatics Advances online. 
    more » « less