skip to main content


Title: The evolutionary history of small RNAs in Solanaceae
Abstract The Solanaceae or “nightshade” family is an economically important group with remarkable diversity. To gain a better understanding of how the unique biology of the Solanaceae relates to the family’s small RNA (sRNA) genomic landscape, we downloaded over 255 publicly available sRNA data sets that comprise over 2.6 billion reads of sequence data. We applied a suite of computational tools to predict and annotate two major sRNA classes: (1) microRNAs (miRNAs), typically 20- to 22-nucleotide (nt) RNAs generated from a hairpin precursor and functioning in gene silencing and (2) short interfering RNAs (siRNAs), including 24-nt heterochromatic siRNAs typically functioning to repress repetitive regions of the genome via RNA-directed DNA methylation, as well as secondary phased siRNAs and trans-acting siRNAs generated via miRNA-directed cleavage of a polymerase II-derived RNA precursor. Our analyses described thousands of sRNA loci, including poorly understood clusters of 22-nt siRNAs that accumulate during viral infection. The birth, death, expansion, and contraction of these sRNA loci are dynamic evolutionary processes that characterize the Solanaceae family. These analyses indicate that individuals within the same genus share similar sRNA landscapes, whereas comparisons between distinct genera within the Solanaceae reveal relatively few commonalities.  more » « less
Award ID(s):
1842698 1942437 1754097
NSF-PAR ID:
10345716
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Plant Physiology
ISSN:
0032-0889
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Small RNAs are abundant in plant reproductive tissues, especially 24-nucleotide (nt) small interfering RNAs (siRNAs). Most 24-nt siRNAs are dependent on RNA Pol IV and RNA-DEPENDENT RNA POLYMERASE 2 (RDR2) and establish DNA methylation at thousands of genomic loci in a process called RNA-directed DNA methylation (RdDM). InBrassica rapa, RdDM is required in the maternal sporophyte for successful seed development. Here, we demonstrate that a small number of siRNA loci account for over 90% of siRNA expression duringB. rapaseed development. These loci exhibit unique characteristics with regard to their copy number and association with genomic features, but they resemble canonical 24-nt siRNA loci in their dependence on RNA Pol IV/RDR2 and role in RdDM. These loci are expressed in ovules before fertilization and in the seed coat, embryo, and endosperm following fertilization. We observed a similar pattern of 24-nt siRNA expression in diverse angiosperms despite rapid sequence evolution at siren loci. In the endosperm, siren siRNAs show a marked maternal bias, and siren expression in maternal sporophytic tissues is required for siren siRNA accumulation. Together, these results demonstrate that seed development occurs under the influence of abundant maternal siRNAs that might be transported to, and function in, filial tissues.

     
    more » « less
  2. null (Ed.)
    RNA silencing pathways control eukaryotic gene expression transcriptionally or posttranscriptionally in a sequence-specific manner. In RNA silencing, the production of double-stranded RNA (dsRNA) gives rise to various classes of 20–24 nucleotide (nt) small RNAs (smRNAs). In Arabidopsis thaliana, smRNAs are often derived from long dsRNA molecules synthesized by one of the six genomically encoded RNA-dependent RNA Polymerase (RDR) proteins. However, the full complement of the RDR-dependent smRNAs and functions that these proteins and their RNA-binding cofactors play in plant RNA silencing has not been fully uncovered. To address this gap, we performed a global genomic analysis of all six RDRs and two of their cofactors to find new substrates for RDRs and targets of the resulting RDR-derived siRNAs to uncover new functions for these proteins in plants. Based on these analyses, we identified substrates for the three RDRγ clade proteins (RDR3–5), which had not been well-characterized previously. We also identified new substrates for the other three RDRs (RDR1, RDR2, and RDR6) as well as the RDR2 cofactor RNA-directed DNA methylation 12 (RDM12) and the RDR6 cofactor suppressor of gene silencing 3 (SGS3). These findings revealed that the target substrates of SGS3 are not limited to those solely utilized by RDR6, but that this protein seems to be a more general cofactor for the RDR family of proteins. Additionally, we found that RDR6 and SGS3 are involved in the production of smRNAs that target transcripts related to abiotic stresses, including water deprivation, salt stress, and ABA response, and as expected the levels of these mRNAs are increased in rdr6 and sgs3 mutant plants. Correspondingly, plants that lack these proteins (rdr6 and sgs3 mutants) are hypersensitive to ABA treatment, tolerant to high levels of PEG8000, and have a higher survival rate under salt treatment in comparison to wild-type plants. In total, our analyses have provided an extremely data-rich resource for uncovering new functions of RDR-dependent RNA silencing in plants, while also revealing a previously unexplored link between the RDR6/SGS3-dependent pathway and plant abiotic stress responses. 
    more » « less
  3. null (Ed.)
    Abstract In monocots other than maize (Zea mays) and rice (Oryza sativa), the repertoire and diversity of microRNAs (miRNAs) and the populations of phased, secondary, small interfering RNAs (phasiRNAs) are poorly characterized. To remedy this, we sequenced small RNAs (sRNA) from vegetative and dissected inflorescence tissue in 28 phylogenetically diverse monocots and from several early-diverging angiosperm lineages, as well as publicly available data from 10 additional monocot species. We annotated miRNAs, small interfering RNAs (siRNAs) and phasiRNAs across the monocot phylogeny, identifying miRNAs apparently lost or gained in the grasses relative to other monocot families, as well as a number of transfer RNA fragments misannotated as miRNAs. Using our miRNA database cleaned of these misannotations, we identified conservation at the 8th, 9th, 19th, and 3′-end positions that we hypothesize are signatures of selection for processing, targeting, or Argonaute sorting. We show that 21-nucleotide (nt) reproductive phasiRNAs are far more numerous in grass genomes than other monocots. Based on sequenced monocot genomes and transcriptomes, DICER-LIKE5, important to 24-nt phasiRNA biogenesis, likely originated via gene duplication before the diversification of the grasses. This curated database of phylogenetically diverse monocot miRNAs, siRNAs, and phasiRNAs represents a large collection of data that should facilitate continued exploration of sRNA diversification in flowering plants. 
    more » « less
  4. Abstract

    Several protein families participate in the biogenesis and function of small RNAs (sRNAs) in plants. Those with primary roles include Dicer-like (DCL), RNA-dependent RNA polymerase (RDR), and Argonaute (AGO) proteins. Protein families such as double-stranded RNA-binding (DRB), SERRATE (SE), and SUPPRESSION OF SILENCING 3 (SGS3) act as partners of DCL or RDR proteins. Here, we present curated annotations and phylogenetic analyses of seven sRNA pathway protein families performed on 196 species in the Viridiplantae (aka green plants) lineage. Our results suggest that the RDR3 proteins emerged earlier than RDR1/2/6. RDR6 is found in filamentous green algae and all land plants, suggesting that the evolution of RDR6 proteins coincides with the evolution of phased small interfering RNAs (siRNAs). We traced the origin of the 24-nt reproductive phased siRNA-associated DCL5 protein back to the American sweet flag (Acorus americanus), the earliest diverged, extant monocot species. Our analyses of AGOs identified multiple duplication events of AGO genes that were lost, retained, or further duplicated in subgroups, indicating that the evolution of AGOs is complex in monocots. The results also refine the evolution of several clades of AGO proteins, such as AGO4, AGO6, AGO17, and AGO18. Analyses of nuclear localization signal sequences and catalytic triads of AGO proteins shed light on the regulatory roles of diverse AGOs. Collectively, this work generates a curated and evolutionarily coherent annotation for gene families involved in plant sRNA biogenesis/function and provides insights into the evolution of major sRNA pathways.

     
    more » « less
  5. Abstract

    Twenty-four-nucleotide (nt) small interfering RNAs (siRNAs) maintain asymmetric DNA methylation at thousands of euchromatic transposable elements in plant genomes in a process called RNA-directed DNA methylation (RdDM). RdDM is dispensable for growth and development in Arabidopsis thaliana, but is required for reproduction in other plants, such as Brassica rapa. The 24-nt siRNAs are abundant in maternal reproductive tissue, due largely to overwhelming expression from a few loci in the ovule and developing seed coat, termed siren loci. A recent study showed that 24-nt siRNAs produced in the anther tapetal tissue can methylate male meiocyte genes in trans. Here we show that in B. rapa, a similar process takes place in female tissue. siRNAs are produced from gene fragments embedded in some siren loci, and these siRNAs can trigger methylation in trans at related protein-coding genes. This trans-methylation is associated with silencing of some target genes and may be responsible for seed abortion in RdDM mutants. Furthermore, we demonstrate that a consensus sequence in at least two families of DNA transposons is associated with abundant siren expression, most likely through recruitment of CLASSY3, a putative chromatin remodeler. This research describes a mechanism whereby RdDM influences gene expression and sheds light on the role of RdDM during plant reproduction.

     
    more » « less