skip to main content


Title: Identification of a micropeptide and multiple secondary cell genes that modulate Drosophila male reproductive success
Even in well-characterized genomes, many transcripts are considered noncoding RNAs (ncRNAs) simply due to the absence of large open reading frames (ORFs). However, it is now becoming clear that many small ORFs (smORFs) produce peptides with important biological functions. In the process of characterizing the ribosome-bound transcriptome of an important cell type of the seminal fluid-producing accessory gland of Drosophila melanogaster , we detected an RNA, previously thought to be noncoding, called male-specific abdominal ( msa ). Notably, msa is nested in the HOX gene cluster of the Bithorax complex and is known to contain a micro-RNA within one of its introns. We find that this RNA encodes a “micropeptide” (9 or 20 amino acids, MSAmiP) that is expressed exclusively in the secondary cells of the male accessory gland, where it seems to accumulate in nuclei. Importantly, loss of function of this micropeptide causes defects in sperm competition. In addition to bringing insights into the biology of a rare cell type, this work underlines the importance of small peptides, a class of molecules that is now emerging as important actors in complex biological processes.  more » « less
Award ID(s):
1659534
NSF-PAR ID:
10225170
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
118
Issue:
15
ISSN:
0027-8424
Page Range / eLocation ID:
e2001897118
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Abstract Key message Arabidopsis pollen transcriptome analysis revealed new intergenic transcripts of unknown function, many of which are long non-coding RNAs, that may function in pollen-specific processes, including the heat stress response. Abstract The male gametophyte is the most heat sensitive of all plant tissues. In recent years, long noncoding RNAs (lncRNAs) have emerged as important components of cellular regulatory networks involved in most biological processes, including response to stress. While examining RNAseq datasets of developing and germinating Arabidopsis thaliana pollen exposed to heat stress (HS), we identified 66 novel and 246 recently annotated intergenic expressed loci (XLOCs) of unknown function, with the majority encoding lncRNAs. Comparison with HS in cauline leaves and other RNAseq experiments indicated that 74% of the 312 XLOCs are pollen-specific, and at least 42% are HS-responsive. Phylogenetic analysis revealed that 96% of the genes evolved recently in Brassicaceae . We found that 50 genes are putative targets of microRNAs and that 30% of the XLOCs contain small open reading frames (ORFs) with homology to protein sequences. Finally, RNAseq of ribosome-protected RNA fragments together with predictions of periodic footprint of the ribosome P-sites indicated that 23 of these ORFs are likely to be translated. Our findings indicate that many of the 312 unknown genes might be functional and play a significant role in pollen biology, including the HS response. 
    more » « less
  2. Abstract

    Early lineage diversification is central to understand what mutational events drive species divergence. Particularly, gene misregulation in interspecific hybrids can inform about what genes and pathways underlie hybrid dysfunction. InDrosophilahybrids, how regulatory evolution impacts different reproductive tissues remains understudied. Here, we generate a new genome assembly and annotation inDrosophila willistoniand analyse the patterns of transcriptome divergence between two allopatrically evolvedD. willistonisubspecies, their male sterile and female fertile hybrid progeny across testis, male accessory gland, and ovary. Patterns of transcriptome divergence and modes of regulatory evolution were tissue‐specific. Despite no indication for cell‐type differences in hybrid testis, this tissue exhibited the largest magnitude of expression differentiation between subspecies and between parentals and hybrids. No evidence for anomalous dosage compensation in hybrid male tissues was detected nor was a differential role for the neo‐ and the ancestral arms of theD. willistoni Xchromosome. Compared to the autosomes, theXchromosome appeared enriched for transgressively expressed genes in testis despite being the least differentiated in expression between subspecies. Evidence for fine genome clustering of transgressively expressed genes suggests a role of chromatin structure on hybrid gene misregulation. Lastly, transgressively expressed genes in the testis of the sterile male progeny were enriched for GO terms not typically associated with sperm function, instead hinting at anomalous development of the reproductive tissue. Our thorough tissue‐level portrait of transcriptome differentiation between recently divergedD. willistonisubspecies and their hybrids provides a more nuanced view of early regulatory changes during speciation.

     
    more » « less
  3. Hughes, T (Ed.)
    Abstract The germline-soma divide is a fundamental distinction in developmental biology, and different genes are expressed in germline and somatic cells throughout metazoan life cycles. Ciliates, a group of microbial eukaryotes, exhibit germline-somatic nuclear dimorphism within a single cell with two different genomes. The ciliate Oxytricha trifallax undergoes massive RNA-guided DNA elimination and genome rearrangement to produce a new somatic macronucleus (MAC) from a copy of the germline micronucleus (MIC). This process eliminates noncoding DNA sequences that interrupt genes and also deletes hundreds of germline-limited open reading frames (ORFs) that are transcribed during genome rearrangement. Here, we update the set of transcribed germline-limited ORFs (TGLOs) in O. trifallax. We show that TGLOs tend to be expressed during nuclear development and then are absent from the somatic MAC. We also demonstrate that exposure to synthetic RNA can reprogram TGLO retention in the somatic MAC and that TGLO retention leads to transcription outside the normal developmental program. These data suggest that TGLOs represent a group of developmentally regulated protein-coding sequences whose gene expression is terminated by DNA elimination. 
    more » « less
  4. Abstract

    MicroRNAs (miRNAs) are a group of small noncoding RNAs that regulate gene expression during important biological processes including development and pathogen defense in most living organisms. Presently, no miRNAs have been identified in the mosquito Culex tarsalis (Diptera: Culicidae), one of the most important vectors of West Nile virus (WNV) in North America. We used small RNA sequencing data and in vitro and in vivo experiments to identify and validate a repertoire of miRNAs in Cx. tarsalis mosquitoes. Using bioinformatic approaches we analyzed small RNA sequences from the Cx. tarsalis CT embryonic cell line to discover orthologs for 86 miRNAs. Consistent with other mosquitoes such as Aedes albopictus and Culex quinquefasciatus, miR-184 was found to be the most abundant miRNA in Cx. tarsalis. We also identified 20 novel miRNAs from the recently sequenced Cx. tarsalis genome, for a total of 106 miRNAs identified in this study. The presence of selected miRNAs was biologically validated in both the CT cell line and in adult Cx. tarsalis mosquitoes using RT–qPCR and sequencing. These results will open new avenues of research into the role of miRNAs in Cx. tarsalis biology, including development, metabolism, immunity, and pathogen infection.

     
    more » « less
  5. Abstract

    How recently originated gene copies become stable genomic components remains uncertain as high sequence similarity of young duplicates precludes their functional characterization. The tandem multigene familySdicis specific toDrosophila melanogasterand has been annotated across multiple reference-quality genome assemblies. Here we show the existence of a positive correlation betweenSdiccopy number and totalexpression, plus vast intrastrain differences in mRNA abundance among paralogs, using RNA-sequencing from testis of four strains with variable paralog composition. Single cell and nucleus RNA-sequencing data expose paralog expression differentiation in meiotic cell types within testis from third instar larva and adults. Additional RNA-sequencing across synthetic strains only differing in theirYchromosomes reveal a tissue-dependenttrans-regulatory effect onSdic: upregulation in testis and downregulation in male accessory gland. By leveraging paralog-specific expression information from tissue- and cell-specific data, our results elucidate the intraspecific functional diversification of a recently expanded tandem gene family.

     
    more » « less