skip to main content


Title: Structural and functional characterization of a putative de novo gene in Drosophila
Abstract Comparative genomic studies have repeatedly shown that new protein-coding genes can emerge de novo from noncoding DNA. Still unknown is how and when the structures of encoded de novo proteins emerge and evolve. Combining biochemical, genetic and evolutionary analyses, we elucidate the function and structure of goddard , a gene which appears to have evolved de novo at least 50 million years ago within the Drosophila genus. Previous studies found that goddard is required for male fertility. Here, we show that Goddard protein localizes to elongating sperm axonemes and that in its absence, elongated spermatids fail to undergo individualization. Combining modelling, NMR and circular dichroism (CD) data, we show that Goddard protein contains a large central α -helix, but is otherwise partially disordered. We find similar results for Goddard’s orthologs from divergent fly species and their reconstructed ancestral sequences. Accordingly, Goddard’s structure appears to have been maintained with only minor changes over millions of years.  more » « less
Award ID(s):
1652013
NSF-PAR ID:
10217199
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
Nature Communications
Volume:
12
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important func- tional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila mela- nogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. 
    more » « less
  2. Phillips, Margaret (Ed.)
    ABSTRACT Trypanosoma brucei , the causative agent of human and animal African trypanosomiasis, cycles between a mammalian host and a tsetse fly vector. The parasite undergoes huge changes in morphology and metabolism during adaptation to each host environment. These changes are reflected in the different transcriptomes of parasites living in each host. However, it remains unclear whether chromatin-interacting proteins help mediate these changes. Bromodomain proteins localize to transcription start sites in bloodstream parasites, but whether the localization of bromodomain proteins changes as parasites differentiate from bloodstream to insect stages remains unknown. To address this question, we performed cleavage under target and release using nuclease (CUT&RUN) against bromodomain protein 3 (Bdf3) in parasites differentiating from bloodstream to insect forms. We found that Bdf3 occupancy at most loci increased at 3 h following onset of differentiation and decreased thereafter. A number of sites with increased bromodomain protein occupancy lie proximal to genes with altered transcript levels during differentiation, such as procyclins, procyclin-associated genes, and invariant surface glycoproteins. Most Bdf3-occupied sites are observed throughout differentiation. However, one site appears de novo during differentiation and lies proximal to the procyclin gene locus housing genes essential for remodeling surface proteins following transition to the insect stage. These studies indicate that occupancy of chromatin-interacting proteins is dynamic during life cycle stage transitions and provide the groundwork for future studies on the effects of changes in bromodomain protein occupancy. Additionally, the adaptation of CUT&RUN for Trypanosoma brucei provides other researchers with an alternative to chromatin immunoprecipitation (ChIP). IMPORTANCE The parasite Trypanosoma brucei is the causative agent of human and animal African trypanosomiasis (sleeping sickness). Trypanosomiasis, which affects humans and cattle, is fatal if untreated. Existing drugs have significant side effects. Thus, these parasites impose a significant human and economic burden in sub-Saharan Africa, where trypanosomiasis is endemic. T. brucei cycles between the mammalian host and a tsetse fly vector, and parasites undergo huge changes in morphology and metabolism to adapt to different hosts. Here, we show that DNA-interacting bromodomain protein 3 (Bdf3) shows changes in occupancy at its binding sites as parasites transition from the bloodstream to the insect stage. Additionally, a new binding site appears near the locus responsible for remodeling of parasite surface proteins during transition to the insect stage. Understanding the mechanisms behind host adaptation is important for understanding the life cycle of the parasite. 
    more » « less
  3. Mank, Judith (Ed.)
    Abstract The X chromosome of therian mammals shows strong conservation among distantly related species, limiting insights into the distinct selective processes that have shaped sex chromosome evolution. We constructed a chromosome-scale de novo genome assembly for the Siberian dwarf hamster (Phodopus sungorus), a species reported to show extensive recombination suppression across an entire arm of the X chromosome. Combining a physical genome assembly based on shotgun and long-range proximity ligation sequencing with a dense genetic map, we detected widespread suppression of female recombination across ∼65% of the Phodopus X chromosome. This region of suppressed recombination likely corresponds to the Xp arm, which has previously been shown to be highly heterochromatic. Using additional sequencing data from two closely related species (P. campbelli and P. roborovskii), we show that recombination suppression on Xp appears to be independent of major structural rearrangements. The suppressed Xp arm was enriched for several transposable element families and de-enriched for genes primarily expressed in placenta, but otherwise showed similar gene densities, expression patterns, and rates of molecular evolution when compared to the recombinant Xq arm. Phodopus Xp gene content and order was also broadly conserved relative to the more distantly related rat X chromosome. These data suggest that widespread suppression of recombination has likely evolved through the transient induction of facultative heterochromatin on the Phodopus Xp arm without major changes in chromosome structure or genetic content. Thus, substantial changes in the recombination landscape have so far had relatively subtle influences on patterns of X-linked molecular evolution in these species. 
    more » « less
  4. Survival of pediatric AML remains poor despite maximized myelosuppressive therapy. The pneumocystis jiroveci pneumonia (PJP)-treating medication atovaquone (AQ) suppresses oxidative phosphorylation (OXPHOS) and reduces AML burden in patient-derived xenograft (PDX) mouse models, making it an ideal concomitant AML therapy. Poor palatability and limited product formulations have historically limited routine use of AQ in pediatric AML patients. Patients with de novo AML were enrolled at two hospitals. Daily AQ at established PJP dosing was combined with standard AML therapy, based on the Medical Research Council backbone. AQ compliance, adverse events (AEs), ease of administration score (scale: 1 (very difficult)-5 (very easy)) and blood/marrow pharmacokinetics (PK) were collected during Induction 1. Correlative studies assessed AQ-induced apoptosis and effects on OXPHOS. PDX models were treated with AQ. A total of 26 patients enrolled (ages 7.2 months–19.7 years, median 12 years); 24 were evaluable. A total of 14 (58%) and 19 (79%) evaluable patients achieved plasma concentrations above the known anti-leukemia concentration (>10 µM) by day 11 and at the end of Induction, respectively. Seven (29%) patients achieved adequate concentrations for PJP prophylaxis (>40 µM). Mean ease of administration score was 3.8. Correlative studies with AQ in patient samples demonstrated robust apoptosis, OXPHOS suppression, and prolonged survival in PDX models. Combining AQ with chemotherapy for AML appears feasible and safe in pediatric patients during Induction 1 and shows single-agent anti-leukemic effects in PDX models. AQ appears to be an ideal concomitant AML therapeutic but may require intra-patient dose adjustment to achieve concentrations sufficient for PJP prophylaxis.

     
    more » « less
  5. Slotte, Tanja (Ed.)
    Abstract Intracellular transfers of mitochondrial DNA continue to shape nuclear genomes. Chromosome 2 of the model plant Arabidopsis thaliana contains one of the largest known nuclear insertions of mitochondrial DNA (numts). Estimated at over 600 kb in size, this numt is larger than the entire Arabidopsis mitochondrial genome. The primary Arabidopsis nuclear reference genome contains less than half of the numt because of its structural complexity and repetitiveness. Recent data sets generated with improved long-read sequencing technologies (PacBio HiFi) provide an opportunity to finally determine the accurate sequence and structure of this numt. We performed a de novo assembly using sequencing data from recent initiatives to span the Arabidopsis centromeres, producing a gap-free sequence of the Chromosome 2 numt, which is 641 kb in length and has 99.933% nucleotide sequence identity with the actual mitochondrial genome. The numt assembly is consistent with the repetitive structure previously predicted from fiber-based fluorescent in situ hybridization. Nanopore sequencing data indicate that the numt has high levels of cytosine methylation, helping to explain its biased spectrum of nucleotide sequence divergence and supporting previous inferences that it is transcriptionally inactive. The original numt insertion appears to have involved multiple mitochondrial DNA copies with alternative structures that subsequently underwent an additional duplication event within the nuclear genome. This work provides insights into numt evolution, addresses one of the last unresolved regions of the Arabidopsis reference genome, and represents a resource for distinguishing between highly similar numt and mitochondrial sequences in studies of transcription, epigenetic modifications, and de novo mutations. 
    more » « less