skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Massively parallel identification of sequence motifs triggering ribosome-associated mRNA quality control
Abstract Decay of mRNAs can be triggered by ribosome slowdown at stretches of rare codons or positively charged amino acids. However, the full diversity of sequences that trigger co-translational mRNA decay is poorly understood. To comprehensively identify sequence motifs that trigger mRNA decay, we use a massively parallel reporter assay to measure the effect of all possible combinations of codon pairs on mRNA levels in S. cerevisiae. In addition to known mRNA-destabilizing sequences, we identify several dipeptide repeats whose translation reduces mRNA levels. These include combinations of positively charged and bulky residues, as well as proline-glycine and proline-aspartate dipeptide repeats. Genetic deletion of the ribosome collision sensor Hel2 rescues the mRNA effects of these motifs, suggesting that they trigger ribosome slowdown and activate the ribosome-associated quality control (RQC) pathway. Deep mutational scanning of an mRNA-destabilizing dipeptide repeat reveals a complex interplay between the charge, bulkiness, and location of amino acid residues in conferring mRNA instability. Finally, we show that the mRNA effects of codon pairs are predictive of the effects of endogenous sequences. Our work highlights the complexity of sequence motifs driving co-translational mRNA decay in eukaryotes, and presents a high throughput approach to dissect their requirements at the codon level.  more » « less
Award ID(s):
1846521
PAR ID:
10501807
Author(s) / Creator(s):
; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Nucleic Acids Research
Volume:
52
Issue:
12
ISSN:
0305-1048
Format(s):
Medium: X Size: p. 7171-7187
Size(s):
p. 7171-7187
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Stability of eukaryotic mRNAs is associated with their codon, amino acid, and GC content. Yet, coding sequence motifs that predictably alter mRNA stability in human cells remain poorly defined. Here, we develop a massively parallel assay to measure mRNA effects of thousands of synthetic and endogenous coding sequence motifs in human cells. We identify several families of simple dipeptide repeats whose translation triggers mRNA destabilization. Rather than individual amino acids, specific combinations of bulky and positively charged amino acids are critical for the destabilizing effects of dipeptide repeats. Remarkably, dipeptide sequences that form extended β strands in silico and in vitro slowdown ribosomes and reduce mRNA levels in vivo. The resulting nascent peptide code underlies the mRNA effects of hundreds of endogenous peptide sequences in the human proteome. Our work suggests an intrinsic role for the ribosome as a selectivity filter against the synthesis of bulky and aggregation-prone peptides. 
    more » « less
  2. Abstract Various messenger RNA (mRNA) decay mechanisms play major roles in controlling mRNA quality and quantity in eukaryotic organisms under different conditions. While it is known that the recently discovered co‐translational mRNA decay (CTRD), the mechanism that allows mRNAs to be degraded while still being actively translated, is prevalent in yeast, humans, and various angiosperms, the regulation of this decay mechanism is less well studied. Moreover, it is still unclear whether this decay mechanism plays any role in the regulation of specific physiological processes in eukaryotes. Here, by re‐analyzing the publicly available polysome profiling or ribosome footprinting and degradome sequencing datasets, we discovered that highly translated mRNAs tend to have lower co‐translational decay levels. Based on this finding, we then identified Pelota and Hbs1, the translation‐related ribosome rescue factors, as suppressors of co‐translational mRNA decay in Arabidopsis. Furthermore, we found that Pelota and Hbs1 null mutants have lower germination rates compared to the wild‐type plants, implying that proper regulation of co‐translational mRNA decay is essential for normal developmental processes. In total, our study provides further insights into the regulation of CTRD in Arabidopsis and demonstrates that this decay mechanism does play important roles in Arabidopsis physiological processes. 
    more » « less
  3. Levy, Yaakov Koby (Ed.)
    Co-assembling peptides can be crafted into supramolecular biomaterials for use in biotechnological applications, such as cell culture scaffolds, drug delivery, biosensors, and tissue engineering. Peptide co-assembly refers to the spontaneous organization of two different peptides into a supramolecular architecture. Here we use molecular dynamics simulations to quantify the effect of anionic amino acid type on co-assembly dynamics and nanofiber structure in binary CATCH(+/-) peptide systems. CATCH peptide sequences follow a general pattern: CQCFCFCFCQC, where all C’s are either a positively charged or a negatively charged amino acid. Specifically, we investigate the effect of substituting aspartic acid residues for the glutamic acid residues in the established CATCH(6E-) molecule, while keeping CATCH(6K+) unchanged. Our results show that structures consisting of CATCH(6K+) and CATCH(6D-) form flatter β-sheets, have stronger interactions between charged residues on opposing β-sheet faces, and have slower co-assembly kinetics than structures consisting of CATCH(6K+) and CATCH(6E-). Knowledge of the effect of sidechain type on assembly dynamics and fibrillar structure can help guide the development of advanced biomaterials and grant insight into sequence-to-structure relationships. 
    more » « less
  4. Abstract MotivationThe mapping from codon to amino acid is surjective due to codon degeneracy, suggesting that codon space might harbor higher information content. Embeddings from the codon language model have recently demonstrated success in various protein downstream tasks. However, predictive models for residue-level tasks such as phosphorylation sites, arguably the most studied Post-Translational Modification (PTM), and PTM sites prediction in general, have predominantly relied on representations in amino acid space. ResultsWe introduce a novel approach for predicting phosphorylation sites by utilizing codon-level information through embeddings from the codon adaptation language model (CaLM), trained on protein-coding DNA sequences. Protein sequences are first reverse-translated into reliable coding sequences by mapping UniProt sequences to their corresponding NCBI reference sequences and extracting the exact coding sequences from their GenBank format using a dynamic programming-based global pairwise alignment. The resulting coding sequences are encoded using the CaLM encoder to generate codon-aware embeddings, which are subsequently integrated with amino acid-aware embeddings obtained from a protein language model, through an early fusion strategy. Next, a window-level representation of the site of interest, retaining the full sequence context, is constructed from the fused embeddings. A ConvBiGRU network extracts feature maps that capture spatiotemporal correlations between proximal residues within the window. This is followed by a prediction head based on a Kolmogorov-Arnold network (KAN) using the derivative of gaussian wavelet transform to generate the inference for the site. The overall model, dubbed CaLMPhosKAN, performs better than the existing approaches across multiple datasets. Availability and implementationCaLMPhosKAN is publicly available at https://github.com/KCLabMTU/CaLMPhosKAN. 
    more » « less
  5. Abstract RNA turnover is essential in maintaining messenger RNA (mRNA) homeostasis during various developmental stages and stress responses. Co‐translational mRNA decay (CTRD), a process in which mRNAs are degraded while still associated with translating ribosomes, has recently been discovered to function in yeast and three angiosperm transcriptomes. However, it is still unclear how prevalent CTRD across the plant lineage. Moreover, the sequence features of co‐translationally decayed mRNAs have not been well‐studied. Here, utilizing a collection of publicly available degradome sequencing datasets for another seven angiosperm transcriptomes, we have confirmed that CTRD is functioning in at least 10 angiosperms and likely throughout the plant lineage. Additionally, we have identified sequence features shared by the co‐translationally decayed mRNAs in these species, implying a possible conserved triggering mechanism for this pathway. Given that degradome sequencing datasets can also be used to identify actively translating upstream open reading frames (uORFs), which are quite understudied in plants, we have identified numerous actively translating uORFs in the same 10 angiosperms. These findings reveal that actively translating uORFs are prevalent in plant transcriptomes, some of which are conserved across this lineage. We have also observed conserved sequence features in the regions flanking these uORFs' stop codons that might contribute to ribosome stalling at these sequences. Finally, we discovered that there were very few overlaps between the mRNAs harboring actively translating uORFs and those sorted into the co‐translational decay pathway in the majority of the studied angiosperms, suggesting that these two processes might be nearly mutually exclusive in those species. In total, our findings provide the identification of CTRD and actively translating uORFs across a broad collection of plants and provide novel insights into the important sequence features associated with these collections of mRNAs and regulatory elements, respectively. 
    more » « less