skip to main content

Title: Comprehensive database and evolutionary dynamics of U12-type introns
Abstract During nuclear maturation of most eukaryotic pre-messenger RNAs and long non-coding RNAs, introns are removed through the process of RNA splicing. Different classes of introns are excised by the U2-type or the U12-type spliceosomes, large complexes of small nuclear ribonucleoprotein particles and associated proteins. We created intronIC, a program for assigning intron class to all introns in a given genome, and used it on 24 eukaryotic genomes to create the Intron Annotation and Orthology Database (IAOD). We then used the data in the IAOD to revisit several hypotheses concerning the evolution of the two classes of spliceosomal introns, finding support for the class conversion model explaining the low abundance of U12-type introns in modern genomes.  more » « less
Award ID(s):
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Nucleic Acids Research
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Ouangraoua, Aida (Ed.)
    Abstract Previous evolutionary reconstructions have concluded that early eukaryotic ancestors including both the last common ancestor of eukaryotes and of all fungi had intron-rich genomes. By contrast, some extant eukaryotes have few introns, underscoring the complex histories of intron–exon structures, and raising the question as to why these few introns are retained. Here, we have used recently available fungal genomes to address a variety of questions related to intron evolution. Evolutionary reconstruction of intron presence and absence using 263 diverse fungal species supports the idea that massive intron reduction through intron loss has occurred in multiple clades. The intron densities estimated in various fungal ancestors differ from zero to 7.6 introns per 1 kb of protein-coding sequence. Massive intron loss has occurred not only in microsporidian parasites and saccharomycetous yeasts, but also in diverse smuts and allies. To investigate the roles of the remaining introns in highly-reduced species, we have searched for their special characteristics in eight intron-poor fungi. Notably, the introns of ribosome-associated genes RPL7 and NOG2 have conserved positions; both intron-containing genes encoding snoRNAs. Furthermore, both the proteins and snoRNAs are involved in ribosome biogenesis, suggesting that the expression of the protein-coding genes and noncoding snoRNAs may be functionally coordinated. Indeed, these introns are also conserved in three-quarters of fungi species. Our study shows that fungal introns have a complex evolutionary history and underappreciated roles in gene expression. 
    more » « less
  2. Abstract

    The oceanic igneous crust is a vast reservoir for microbial life, dominated by diverse and active bacteria, archaea, and fungi. Archaeal and bacterial viruses were previously detected in oceanic crustal fluids at the Juan de Fuca Ridge (JdFR). Here we report the discovery of two eukaryotic Nucleocytoviricota genomes from the same crustal fluids by sorting and sequencing single virions. Both genomes have a tRNATyrgene with an intron (20 bps) at the canonical position between nucleotide 37 and 38, a common feature in eukaryotic and archaeal tRNA genes with short introns (<100 bps), and fungal genes acquired through horizontal gene transfer (HGT) events. The dominance ofAscomycotafungi as the main eukaryotes in crustal fluids and the evidence for HGT point to these fungi as the putative hosts, making these the first putative fungi-Nucleocytoviricota specific association. Our study suggests active host-viral dynamics for the only eukaryotic group found in the subsurface oceanic crust and raises important questions about the impact of viral infection on the productivity and biogeochemical cycling in this ecosystem.

    more » « less
  3. Circular RNAs (circRNAs) are a recently discovered class of RNAs derived from protein-coding genes that have important biological and pathological roles. They are formed through backsplicing during co-transcriptional alternative splicing; however, the unified mechanism that accounts for backsplicing decisions remains unclear. Factors that regulate the transcriptional timing and spatial organization of pre-mRNA, including RNAPII kinetics, the availability of splicing factors, and features of gene architecture, have been shown to influence backsplicing decisions. Poly (ADP-ribose) polymerase I (PARP1) regulates alternative splicing through both its presence on chromatin as well as its PARylation activity. However, no studies have investigated PARP1’s possible role in regulating circRNA biogenesis. Here, we hypothesized that PARP1’s role in splicing extends to circRNA biogenesis. Our results identify many unique circRNAs in PARP1 depletion and PARylation-inhibited conditions compared to the wild type. We found that while all genes producing circRNAs share gene architecture features common to circRNA host genes, genes producing circRNAs in PARP1 knockdown conditions had longer upstream introns than downstream introns, whereas flanking introns in wild type host genes were symmetrical. Interestingly, we found that the behavior of PARP1 in regulating RNAPII pausing is distinct between these two classes of host genes. We conclude that the PARP1 pausing of RNAPII works within the context of gene architecture to regulate transcriptional kinetics, and therefore circRNA biogenesis. Furthermore, this regulation of PARP1 within host genes acts to fine tune their transcriptional output with implications in gene function. 
    more » « less
  4. null (Ed.)
    Abstract The enormous sequence heterogeneity of telomerase RNA (TR) subunits has thus far complicated their characterization in a wider phylogenetic range. Our recent finding that land plant TRs are, similarly to known ciliate TRs, transcribed by RNA polymerase III and under the control of the type-3 promoter, allowed us to design a novel strategy to characterize TRs in early diverging Viridiplantae taxa, as well as in ciliates and other Diaphoretickes lineages. Starting with the characterization of the upstream sequence element of the type 3 promoter that is conserved in a number of small nuclear RNAs, and the expected minimum TR template region as search features, we identified candidate TRs in selected Diaphoretickes genomes. Homologous TRs were then used to build covariance models to identify TRs in more distant species. Transcripts of the identified TRs were confirmed by transcriptomic data, RT-PCR and Northern hybridization. A templating role for one of our candidates was validated in Physcomitrium patens. Analysis of secondary structure demonstrated a deep conservation of motifs (pseudoknot and template boundary element) observed in all published TRs. These results elucidate the evolution of the earliest eukaryotic TRs, linking the common origin of TRs across Diaphoretickes, and underlying evolutionary transitions in telomere repeats. 
    more » « less
  5. null (Ed.)
    Even in well-characterized genomes, many transcripts are considered noncoding RNAs (ncRNAs) simply due to the absence of large open reading frames (ORFs). However, it is now becoming clear that many small ORFs (smORFs) produce peptides with important biological functions. In the process of characterizing the ribosome-bound transcriptome of an important cell type of the seminal fluid-producing accessory gland of Drosophila melanogaster , we detected an RNA, previously thought to be noncoding, called male-specific abdominal ( msa ). Notably, msa is nested in the HOX gene cluster of the Bithorax complex and is known to contain a micro-RNA within one of its introns. We find that this RNA encodes a “micropeptide” (9 or 20 amino acids, MSAmiP) that is expressed exclusively in the secondary cells of the male accessory gland, where it seems to accumulate in nuclei. Importantly, loss of function of this micropeptide causes defects in sperm competition. In addition to bringing insights into the biology of a rare cell type, this work underlines the importance of small peptides, a class of molecules that is now emerging as important actors in complex biological processes. 
    more » « less