skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Long-read genome assemblies for the study of chromosome expansion: Drosophila kikkawai , Drosophila takahashii , Drosophila bipectinata , and Drosophila ananassae
Abstract Flow cytometry estimates of genome sizes among species of Drosophila show a 3-fold variation, ranging from ∼127 Mb in Drosophila mercatorum to ∼400 Mb in Drosophila cyrtoloma. However, the assembled portion of the Muller F element (orthologous to the fourth chromosome in Drosophila melanogaster) shows a nearly 14-fold variation in size, ranging from ∼1.3 Mb to >18 Mb. Here, we present chromosome-level long-read genome assemblies for 4 Drosophila species with expanded F elements ranging in size from 2.3 to 20.5 Mb. Each Muller element is present as a single scaffold in each assembly. These assemblies will enable new insights into the evolutionary causes and consequences of chromosome size expansion.  more » « less
Award ID(s):
2114661 1915544
PAR ID:
10506197
Author(s) / Creator(s):
; ; ; ; ; ;
Editor(s):
Vogel, K
Publisher / Repository:
Oxford University Press on behalf of The Genetics Society of America
Date Published:
Journal Name:
G3: Genes, Genomes, Genetics
Volume:
13
Issue:
10
ISSN:
2160-1836
Subject(s) / Keyword(s):
Keywords: Drosophila kikkawai strain 14028-0561.14 Drosophila takahashii strain IR98-3 E-12201 Drosophila bipectinata strain 14024-0381.07 Drosophila ananassae strain 14024-0371 13
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The F element of the Drosophila karyotype (the fourth chromosome in Drosophila melanogaster) is often referred to as the “dot chromosome” because of its appearance in a metaphase chromosome spread. This chromosome is distinct from other Drosophila autosomes in possessing both a high level of repetitious sequences (in particular, remnants of transposable elements) and a gene density similar to that found in the other chromosome arms, ∼80 genes distributed throughout its 1.3-Mb “long arm.” The dot chromosome is notorious for its lack of recombination and is often neglected as a consequence. This and other features suggest that the F element is packaged as heterochromatin throughout. F element genes have distinct characteristics (e.g., low codon bias, and larger size due both to larger introns and an increased number of exons), but exhibit expression levels comparable to genes found in euchromatin. Mapping experiments show the presence of appropriate chromatin modifications for the formation of DNaseI hypersensitive sites and transcript initiation at the 5′ ends of active genes, but, in most cases, high levels of heterochromatin proteins are observed over the body of these genes. These various features raise many interesting questions about the relationships of chromatin structures with gene and chromosome function. The apparent evolution of the F element as an autosome from an ancestral sex chromosome also raises intriguing questions. The findings argue that the F element is a unique chromosome that occupies its own space in the nucleus. Further study of the F element should provide new insights into chromosome structure and function. 
    more » « less
  2. Mugal, Carina (Ed.)
    Abstract Comparative genomic analyses among closely related species provide an opportunity to assess their evolutionary history. The relatedness between species can depend on a variety of factors, including reproductive isolation, introgression, and incomplete lineage sorting, and this can impact divergence across the genome. Here, we use a combination of long- and short-read sequencing and HI-C scaffolding to assemble genomes for each of the four species in the testacea species group of Drosophila, including D. testacea, D. orientacea, D. neotestacea, and D. putrida, and its outgroup, D. bizonata. First, among species, we find many structural rearrangements across the genome as well as a large size difference in the dot chromosome that we infer is due to the expansion of repetitive elements. Second, we assess phylogenetic discordance and uncover a difference in the phylogeny inferred from genes on Muller E and the mitogenome relative to the rest of the genome, which may be due to recent hybridization. Lastly, we assess the rate of molecular evolution of genes shared across all species and identify genes evolving at different rates across the phylogeny. Our results present genomic resources for this species group and begin to probe into some of the evolutionary characteristics that contribute to variation in genome structure, while highlighting the need for high-quality genome resources to fully capture and understand the evolutionary history among closely related species. 
    more » « less
  3. Abstract The common bed bug, Cimex lectularius, is a globally distributed pest insect of medical, veterinary, and economic importance. Previous reference genome assemblies for this species were generated from short read sequencing data, resulting in a ~650 Mb composed of thousands of contigs. Here, we present a haplotype-resolved, chromosome-level reference genome, generated from an adult Harlen strain female specimen. Using PacBio long read and Omni-C proximity sequencing, we generated a 540 Mb genome with 15 chromosomes (13 autosomes and 2 sex chromosomes - X1X2) with an N50 > 30 Mb and BUSCO > 90%. Previous karyotyping efforts indicate an XY sex chromosome system, with 2n=26 and X1X1X2X2 females and X1X2Y males; however significant fragmentation of the X chromosome has also been reported. We further use whole genome resequencing data from males and females to identify the X1 and X2 chromosomes based on sex biases in coverage. This highly contiguous reference genome assembly provides a much-improved resource for identifying chromosomal genome architecture, and for interpreting patterns of urban outbreaks and signatures of selection linked to insecticide resistance. 
    more » « less
  4. Annotating the genomes of multiple species allows us to analyze the evolution of their genes. While many eukaryotic genome assemblies already include computational gene predictions, these predictions can benefit from review and refinement through manual gene annotation. The Genomics Education Partnership (GEP; https://thegep.org/ ) developed a structural annotation protocol for protein-coding genes that enables undergraduate student and faculty researchers to create high-quality gene annotations that can be utilized in subsequent scientific investigations. For example, this protocol has been utilized by the GEP faculty to engage undergraduate students in the comparative annotation of genes involved in the insulin signaling pathway in 27 Drosophila species, using D. melanogaster as the reference genome. Students construct gene models using multiple lines of computational and empirical evidence including expression data (e.g., RNA-Seq), sequence similarity (e.g., BLAST and multiple sequence alignment), and computational gene predictions. Quality control measures require each gene be annotated by at least two students working independently, followed by reconciliation of the submitted gene models by a more experienced student. This article provides an overview of the annotation protocol and describes how discrepancies in student submitted gene models are resolved to produce a final, high-quality gene set suitable for subsequent analyses. The protocol can be adapted to other scientific questions (e.g., expansion of the Drosophila Muller F element) and species (e.g., parasitoid wasps) to provide additional opportunities for undergraduate students to participate in genomics research. These student annotation efforts can substantially improve the quality of gene annotations in publicly available genomic databases. 
    more » « less
  5. Suh, Alexander (Ed.)
    Abstract Despite being quite specious (~10,000 extant species), birds have a fairly uniform genome size and karyotype (including the common occurrence of microchromosomes) relative to other vertebrate lineages. Storks (Family Ciconiidae) are a charismatic and distinct group of large wading birds with nearly worldwide distribution but few genomic resources. Here we present an annotated chromosome-level reference genome and chromosome orthology analysis for the wood stork (Mycteria americana), a species that has been federally protected under the Endangered Species Act since 1984. The annotated chromosome-level reference assembly was produced using the blood of a wild female wood stork chick, has a length of 1.35 Gb, a contig N50 of 37 Mb, a scaffold N50 of 80 Mb, and a BUSCO score of 98.8%. We identified 31 autosomal pairs and two sex chromosomes in the wood stork genome, but failed to identify four additional autosomal microchromosomes previously found via karyotyping. Orthology analyses confirmed reported synapomorphies unique to storks and identified the chromosomes participating in these fusions. This study highlights the difficulty and potential problems associated with delineating microchromosomes in reference genome assemblies. It also provides a foundation for studying karyotype evolution in the core water bird clade that includes penguins, albatrosses, storks, cormorants, herons, and ibises. Finally, our reference genome will allow for numerous genomic studies, such as genome-wide association studies of local adaptation, that will aid in wood stork conservation. 
    more » « less