skip to main content


Title: Diversifying evolution of the ubiquitin-26s proteasome system in Brassicaceae and Poaceae
Genome amplification and sequence divergence provides raw materials to allow organismal adaptation. This is exemplified by the large expansion of the ubiquitin-26S proteasome system (UPS) in land plants, which primarily rely on intracellular signaling and biochemical metabolism to combat biotic and abiotic stresses. While a handful of functional genomic studies have demonstrated the adaptive role of the UPS in plant growth and development, many UPS members remain unknown. In this work, we applied a comparative genomic study to address the functional divergence of the UPS at a systematic level. We first used a closing-target-trimming annotation approach to identify most, if not all, UPS members in six species from each of two evolutionarily distant plant families, Brassicaceae and Poaceae. To reduce age-related errors, the two groups of species were selected based on their similar chronological order of speciation. Through size comparison, chronological expansion inference, evolutionary selection analyses, duplication mechanism prediction, and functional domain enrichment assays, we discovered significant diversities within the UPS, particularly between members from its three largest ubiquitin ligase gene families, the F-box (FBX), the Really Interesting New Gene (RING), and the Bric-a-Brac/Tramtrack/Broad Complex (BTB) families, between Brassicaceae and Poaceae. Uncovering independent Arabidopsis and Oryza genus–specific subclades of the 26S proteasome subunits from a comprehensive phylogenetic analysis further supported a diversifying evolutionary model of the UPS in these two genera, confirming its role in plant adaptation.  more » « less
Award ID(s):
1750361
NSF-PAR ID:
10101158
Author(s) / Creator(s):
;
Date Published:
Journal Name:
International journal of molecular sciences
Volume:
20
ISSN:
1422-0067
Page Range / eLocation ID:
3226
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Protein degradation through the Ubiquitin (Ub)-26S Proteasome System (UPS) is a major gene expression regulatory pathway in plants. In this pathway, the 76-amino acid Ub proteins are covalently linked onto a large array of UPS substrates with the help of three enzymes (E1 activating, E2 conjugating, and E3 ligating enzymes) and direct them for turnover in the 26S proteasome complex. The S-phase Kinase-associated Protein 1 (Skp1), CUL1, F-box (FBX) protein (SCF) complexes have been identified as the largest E3 ligase group in plants due to the dramatic number expansion of the FBX genes in plant genomes. Since it is the FBX proteins that recognize and determine the specificity of SCF substrates, much effort has been done to characterize their genomic, physiological, and biochemical roles in the past two decades of functional genomic studies. However, the sheer size and high sequence diversity of the FBX gene family demands new approaches to uncover unknown functions. In this work, we first identified 82 known FBX members that have been functionally characterized up to date in Arabidopsis thaliana . Through comparing the genomic structure, evolutionary selection, expression patterns, domain compositions, and functional activities between known and unknown FBX gene members, we developed a neural network machine learning approach to predict whether an unknown FBX member is likely functionally active in Arabidopsis, thereby facilitating its future functional characterization. 
    more » « less
  2. Genome sequencing has uncovered tremendous sequence variation within and between species. In plants, in addition to large variations in genome size, a great deal of sequence polymorphism is also evident in several large multi-gene families, including those involved in the ubiquitin-26S proteasome protein degradation system. However, the biological function of this sequence variation is yet not clear. In this work, we explicitly demonstrated a single origin of retroposed Arabidopsis Skp1-Like ( ASK ) genes using an improved phylogenetic analysis. Taking advantage of the 1,001 genomes project, we here provide several lines of polymorphism evidence showing both adaptive and degenerative evolutionary processes in ASK genes. Yeast two-hybrid quantitative interaction assays further suggested that recent neutral changes in the ASK2 coding sequence weakened its interactions with some F-box proteins. The trend that highly polymorphic upstream regions of ASK1 yield high levels of expression implied negative expression regulation of ASK1 by an as-yet-unknown transcriptional suppression mechanism, which may contribute to the polymorphic roles of Skp1-CUL1-F-box complexes. Taken together, this study provides new evolutionary evidence to guide future functional genomic studies of SCF-mediated protein ubiquitylation. 
    more » « less
  3. Abstract

    A signaling complex comprising members of the LORELEI (LRE)-LIKE GPI-anchored protein (LLG) and Catharanthus roseus RECEPTOR-LIKE KINASE 1-LIKE (CrRLK1L) families perceive RAPID ALKALINIZATION FACTOR (RALF) peptides and regulate growth, reproduction, immunity, and stress responses in Arabidopsis (Arabidopsis thaliana). Genes encoding these proteins are members of multigene families in most angiosperms and could generate thousands of signaling complex variants. However, the links between expansion of these gene families and the functional diversification of this critical signaling complex as well as the evolutionary factors underlying the maintenance of gene duplicates remain unknown. Here, we investigated LLG gene family evolution by sampling land plant genomes and explored the function and expression of angiosperm LLGs. We found that LLG diversity within major land plant lineages is primarily due to lineage-specific duplication events, and that these duplications occurred both early in the history of these lineages and more recently. Our complementation and expression analyses showed that expression divergence (i.e. regulatory subfunctionalization), rather than functional divergence, explains the retention of LLG paralogs. Interestingly, all but one monocot and all eudicot species examined had an LLG copy with preferential expression in male reproductive tissues, while the other duplicate copies showed highest levels of expression in female or vegetative tissues. The single LLG copy in Amborella trichopoda is expressed vastly higher in male compared to in female reproductive or vegetative tissues. We propose that expression divergence plays an important role in retention of LLG duplicates in angiosperms.

     
    more » « less
  4. Ubiquitin is a 76 amino acid polypeptide common to all eukaryotic organisms. It functions as a post-translationally modifying mark covalently linked to a large cohort of yet poorly defined protein substrates. The resulting ubiquitylated proteins can rapidly change their activities, cellular localization, or turnover through the 26S proteasome if they are no longer needed or are abnormal. Such a selective modification is essential to many signal transduction pathways particularly in those related to stress responses by rapidly enhancing or quenching output. Hence, this modification system, the so-called ubiquitin-26S proteasome system (UPS), has caught the attention in the plant research community over the last two decades for its roles in plant abiotic and biotic stress responses. Through direct or indirect mediation of plant hormones, the UPS selectively degrades key components in stress signaling to either negatively or positively regulate plant response to a given stimulus. As a result, a tightly regulated signaling network has become of much interest over the years. The ever-increasing changes of the global climate require both the development of new crops to cope with rapid changing environment and new knowledge to survey the dynamics of ecosystem. This review examines how the ubiquitin can switch and tune plant stress response and poses potential avenues to further explore this system. 
    more » « less
  5. The contemporary capacity of genome sequence analysis significantly lags behind the rapidly evolving sequencing technologies. Retrieving biological meaningful information from an ever-increasing amount of genome data would be significantly beneficial for functional genomic studies. For example, the duplication, organization, evolution, and function of superfamily genes are arguably important in many aspects of life. However, the incompleteness of annotations in many sequenced genomes often results in biased conclusions in comparative genomic studies of superfamilies. Here, we present a Perl software, called Closing Target Trimming (CTT), for automatically identifying most, if not all, members of a gene family in any sequenced genomes on CentOS 7 platform. To benefit a broader application on other operating systems, we also created a Docker application package, CTTdocker. Our test data on the F-box gene superfamily showed 78.2 and 79% gene finding accuracies in two well annotated plant genomes, Arabidopsis thaliana and rice, respectively. To further demonstrate the effectiveness of this program, we ran it through 18 plant genomes and five non-plant genomes to compare the expansion of the F-box and the BTB superfamilies. The program discovered that on average 12.7 and 9.3% of the total F-box and BTB members, respectively, are new loci in plant genomes, while it only found a small number of new members in vertebrate genomes. Therefore, different evolutionary and regulatory mechanisms of cullin-RING ubiquitin ligases may be present in plants and animals. We also annotated and compared the Pkinase family members across a wide range of organisms, including 10 fungi, 10 metazoa, 10 vertebrates, and 10 additional plants, which were randomly selected from the Ensembl database. Our CTT annotation recovered on average 14% more loci, including pseudogenes, of the Pkinase superfamily in these 40 genomes, demonstrating its robust replicability and scalability in annotating superfamiy members in any genomes. 
    more » « less