skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Identification of a micropeptide and multiple secondary cell genes that modulate Drosophila male reproductive success
Even in well-characterized genomes, many transcripts are considered noncoding RNAs (ncRNAs) simply due to the absence of large open reading frames (ORFs). However, it is now becoming clear that many small ORFs (smORFs) produce peptides with important biological functions. In the process of characterizing the ribosome-bound transcriptome of an important cell type of the seminal fluid-producing accessory gland of Drosophila melanogaster , we detected an RNA, previously thought to be noncoding, called male-specific abdominal ( msa ). Notably, msa is nested in the HOX gene cluster of the Bithorax complex and is known to contain a micro-RNA within one of its introns. We find that this RNA encodes a “micropeptide” (9 or 20 amino acids, MSAmiP) that is expressed exclusively in the secondary cells of the male accessory gland, where it seems to accumulate in nuclei. Importantly, loss of function of this micropeptide causes defects in sperm competition. In addition to bringing insights into the biology of a rare cell type, this work underlines the importance of small peptides, a class of molecules that is now emerging as important actors in complex biological processes.  more » « less
Award ID(s):
1659534
PAR ID:
10225170
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
118
Issue:
15
ISSN:
0027-8424
Page Range / eLocation ID:
e2001897118
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Abstract Key message Arabidopsis pollen transcriptome analysis revealed new intergenic transcripts of unknown function, many of which are long non-coding RNAs, that may function in pollen-specific processes, including the heat stress response. Abstract The male gametophyte is the most heat sensitive of all plant tissues. In recent years, long noncoding RNAs (lncRNAs) have emerged as important components of cellular regulatory networks involved in most biological processes, including response to stress. While examining RNAseq datasets of developing and germinating Arabidopsis thaliana pollen exposed to heat stress (HS), we identified 66 novel and 246 recently annotated intergenic expressed loci (XLOCs) of unknown function, with the majority encoding lncRNAs. Comparison with HS in cauline leaves and other RNAseq experiments indicated that 74% of the 312 XLOCs are pollen-specific, and at least 42% are HS-responsive. Phylogenetic analysis revealed that 96% of the genes evolved recently in Brassicaceae . We found that 50 genes are putative targets of microRNAs and that 30% of the XLOCs contain small open reading frames (ORFs) with homology to protein sequences. Finally, RNAseq of ribosome-protected RNA fragments together with predictions of periodic footprint of the ribosome P-sites indicated that 23 of these ORFs are likely to be translated. Our findings indicate that many of the 312 unknown genes might be functional and play a significant role in pollen biology, including the HS response. 
    more » « less
  2. Hughes, T (Ed.)
    Abstract The germline-soma divide is a fundamental distinction in developmental biology, and different genes are expressed in germline and somatic cells throughout metazoan life cycles. Ciliates, a group of microbial eukaryotes, exhibit germline-somatic nuclear dimorphism within a single cell with two different genomes. The ciliate Oxytricha trifallax undergoes massive RNA-guided DNA elimination and genome rearrangement to produce a new somatic macronucleus (MAC) from a copy of the germline micronucleus (MIC). This process eliminates noncoding DNA sequences that interrupt genes and also deletes hundreds of germline-limited open reading frames (ORFs) that are transcribed during genome rearrangement. Here, we update the set of transcribed germline-limited ORFs (TGLOs) in O. trifallax. We show that TGLOs tend to be expressed during nuclear development and then are absent from the somatic MAC. We also demonstrate that exposure to synthetic RNA can reprogram TGLO retention in the somatic MAC and that TGLO retention leads to transcription outside the normal developmental program. These data suggest that TGLOs represent a group of developmentally regulated protein-coding sequences whose gene expression is terminated by DNA elimination. 
    more » « less
  3. Abstract How the noncoding genome affects cellular functions is a key biological question. A particular challenge is to distinguish the effects of noncoding DNA elements from long noncoding RNAs (lncRNAs) that coincide at the same loci. Here, we identified the flowering‐associated intergenic lncRNA (FLAIL) inArabidopsisthrough early floweringflailmutants. Expression ofFLAILRNA from a different chromosomal location in combination with strand‐specific RNA knockdown characterizedFLAILas a trans‐acting RNA molecule.FLAILdirectly binds to differentially expressed target genes that control flowering via RNA–DNA interactions through conserved sequence motifs.FLAILinteracts with protein and RNA components of the spliceosome to affect target mRNA expression through co‐transcriptional alternative splicing (AS) and linked chromatin regulation. In the absence ofFLAIL, splicing defects at the direct FLAIL target flowering gene LACCASE 8 (LAC8) correlated with reduced mRNA expression. Double mutant analyses support a model whereFLAIL‐mediated splicing of LAC8 promotes its mRNA expression and represses flowering. Our study suggests lncRNAs as accessory components of the spliceosome that regulate AS and gene expression to impact organismal development. 
    more » « less
  4. Abstract How recently originated gene copies become stable genomic components remains uncertain as high sequence similarity of young duplicates precludes their functional characterization. The tandem multigene familySdicis specific toDrosophila melanogasterand has been annotated across multiple reference-quality genome assemblies. Here we show the existence of a positive correlation betweenSdiccopy number and totalexpression, plus vast intrastrain differences in mRNA abundance among paralogs, using RNA-sequencing from testis of four strains with variable paralog composition. Single cell and nucleus RNA-sequencing data expose paralog expression differentiation in meiotic cell types within testis from third instar larva and adults. Additional RNA-sequencing across synthetic strains only differing in theirYchromosomes reveal a tissue-dependenttrans-regulatory effect onSdic: upregulation in testis and downregulation in male accessory gland. By leveraging paralog-specific expression information from tissue- and cell-specific data, our results elucidate the intraspecific functional diversification of a recently expanded tandem gene family. 
    more » « less
  5. Abstract MicroRNAs (miRNAs) are a group of small noncoding RNAs that regulate gene expression during important biological processes including development and pathogen defense in most living organisms. Presently, no miRNAs have been identified in the mosquito Culex tarsalis (Diptera: Culicidae), one of the most important vectors of West Nile virus (WNV) in North America. We used small RNA sequencing data and in vitro and in vivo experiments to identify and validate a repertoire of miRNAs in Cx. tarsalis mosquitoes. Using bioinformatic approaches we analyzed small RNA sequences from the Cx. tarsalis CT embryonic cell line to discover orthologs for 86 miRNAs. Consistent with other mosquitoes such as Aedes albopictus and Culex quinquefasciatus, miR-184 was found to be the most abundant miRNA in Cx. tarsalis. We also identified 20 novel miRNAs from the recently sequenced Cx. tarsalis genome, for a total of 106 miRNAs identified in this study. The presence of selected miRNAs was biologically validated in both the CT cell line and in adult Cx. tarsalis mosquitoes using RT–qPCR and sequencing. These results will open new avenues of research into the role of miRNAs in Cx. tarsalis biology, including development, metabolism, immunity, and pathogen infection. 
    more » « less