skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Chromosome-level genome of the wood stork ( Mycteria americana ) provides insight into avian chromosome evolution
Abstract Despite being quite specious (~10,000 extant species), birds have a fairly uniform genome size and karyotype (including the common occurrence of microchromosomes) relative to other vertebrate lineages. Storks (Family Ciconiidae) are a charismatic and distinct group of large wading birds with nearly worldwide distribution but few genomic resources. Here we present an annotated chromosome-level reference genome and chromosome orthology analysis for the wood stork (Mycteria americana), a species that has been federally protected under the Endangered Species Act since 1984. The annotated chromosome-level reference assembly was produced using the blood of a wild female wood stork chick, has a length of 1.35 Gb, a contig N50 of 37 Mb, a scaffold N50 of 80 Mb, and a BUSCO score of 98.8%. We identified 31 autosomal pairs and two sex chromosomes in the wood stork genome, but failed to identify four additional autosomal microchromosomes previously found via karyotyping. Orthology analyses confirmed reported synapomorphies unique to storks and identified the chromosomes participating in these fusions. This study highlights the difficulty and potential problems associated with delineating microchromosomes in reference genome assemblies. It also provides a foundation for studying karyotype evolution in the core water bird clade that includes penguins, albatrosses, storks, cormorants, herons, and ibises. Finally, our reference genome will allow for numerous genomic studies, such as genome-wide association studies of local adaptation, that will aid in wood stork conservation.  more » « less
Award ID(s):
2129600
PAR ID:
10481940
Author(s) / Creator(s):
; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Journal of Heredity
ISSN:
0022-1503
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Sethuraman, A (Ed.)
    Abstract Spiny lizards in the genus Sceloporus are a model system among squamate reptiles for studies of chromosomal evolution. While most pleurodont iguanians retain an ancestral karyotype formula of 2n = 36 chromosomes, Sceloporus exhibits substantial karyotype variation ranging from 2n =  22 to 46 chromosomes. We present two annotated chromosome-scale genome assemblies for the Plateau Fence Lizard (Sceloporus tristichus) to facilitate research on the role of pericentric inversion polymorphisms on adaptation and speciation. Based on previous karyotype work using conventional staining, the S. tristichus genome is characterized as 2n =  22 with six pairs of macrochromosomes and five pairs of microchromosomes and a pericentric inversion polymorphism on chromosome 7 that is geographically variable. We provide annotated, chromosome-scale genomes for two lizards located at opposite ends of a dynamic hybrid zone that are each fixed for different inversion polymorphisms. The assembled genomes are 1.84–1.87 Gb (1.72 Gb for scaffolds mapping to chromosomes) with a scaffold N50 of 267.5 Mb. Functional annotation of the genomes resulted in ∼15K predicted gene models. Our assemblies confirmed the presence of a 4.62-Mb pericentric inversion on chromosome 7, which contains 62 annotated coding genes with known functions. In addition, we collected population genomics data using double digest RAD-sequencing for 44 S. tristichus to estimate population structure and phylogeny across the Colorado Plateau. These new genomic resources provide opportunities to perform genomic scans and investigate the formation and spread of pericentric inversions in a naturally occurring hybrid zone. 
    more » « less
  2. Mank, Judith (Ed.)
    Abstract Urosaurus nigricaudus is a phrynosomatid lizard endemic to the Baja California Peninsula in Mexico. This work presents a chromosome-level genome assembly and annotation from a male individual. We used PacBio long reads and HiRise scaffolding to generate a high-quality genomic assembly of 1.87 Gb distributed in 327 scaffolds, with an N50 of 279 Mb and an L50 of 3. Approximately 98.4% of the genome is contained in 14 scaffolds, with 6 large scaffolds (334–127 Mb) representing macrochromosomes and 8 small scaffolds (63–22 Mb) representing microchromosomes. Using standard gene modeling and transcriptomic data, we predicted 17,902 protein-coding genes on the genome. The repeat content is characterized by a large proportion of long interspersed nuclear elements that are relatively old. Synteny analysis revealed some microchromosomes with high repeat content are more prone to rearrangements but that both macro- and microchromosomes are well conserved across reptiles. We identified scaffold 14 as the X chromosome. This microchromosome presents perfect dosage compensation where the single X of males has the same expression levels as two X chromosomes in females. Finally, we estimated the effective population size for U. nigricaudus was extremely low, which may reflect a reduction in polymorphism related to it becoming a peninsular endemic. 
    more » « less
  3. Abstract Raphidioptera (snakeflies) are a holometabolan order with the least species diversity but play a pivotal role in understanding the origin of complete metamorphosis. Here, we provide an annotated, chromosome-level reference genome assembly for an Asian endemic snakeflyMongoloraphidia duomilia(Yang, 1998) of the family Raphidiidae, assembled using PacBio HiFi and Hi-C data from female specimens. The resulting assembly is 653.56 Mb, of which 97.90% is anchored into 13 chromosomes. The scaffold N50 is 53.50 Mb, and BUSCO completeness is 97.80%. Repetitive elements comprise 64.31% of the genome (366.04 Mb). We identified 599 noncoding RNAs and predicted 11,141 protein-coding genes in the genome (97.70% BUSCO completeness). The new snakefly genome will facilitate comparison of genome architecture across Neuropterida and Holometabola and shed light on the ecological and evolutionary transitions between Neuropterida and Coleopterida. 
    more » « less
  4. Abstract Genome-wide information has so far been unavailable for ribbon worms of the clade Hoplonemertea, the most species-rich class within the phylum Nemertea. While species within Pilidiophora, the sister clade of Hoplonemertea, possess a pilidium larval stage and lack stylets on their proboscis, Hoplonemertea species have a planuliform larva and are armed with stylets employed for the injection of toxins into their prey. To further compare these developmental, physiological, and behavioral differences from a genomic perspective, the availability of a reference genome for a Hoplonemertea species is crucial. Such data will be highly useful for future investigations toward a better understanding of molecular ecology, venom evolution, and regeneration not only in Nemertea but also in other marine invertebrate phyla. To this end, we herein present the annotated chromosome-level genome assembly for Emplectonema gracile (Nemertea; Hoplonemertea; Monostilifera; Emplectonematidae), an easily collected nemertean well suited for laboratory experimentation. The genome has an assembly size of 157.9 Mb. Hi-C scaffolding yielded chromosome-level scaffolds, with a scaffold N50 of 10.0 Mb and a score of 95.1% for complete BUSCO genes found as a single copy. Annotation predicted 20,684 protein-coding genes. The high-quality reference genome reaches an Earth BioGenome standard level of 7.C.Q50. 
    more » « less
  5. Abstract The common bed bug, Cimex lectularius, is a globally distributed pest insect of medical, veterinary, and economic importance. Previous reference genome assemblies for this species were generated from short read sequencing data, resulting in a ~650 Mb composed of thousands of contigs. Here, we present a haplotype-resolved, chromosome-level reference genome, generated from an adult Harlen strain female specimen. Using PacBio long read and Omni-C proximity sequencing, we generated a 540 Mb genome with 15 chromosomes (13 autosomes and 2 sex chromosomes - X1X2) with an N50 > 30 Mb and BUSCO > 90%. Previous karyotyping efforts indicate an XY sex chromosome system, with 2n=26 and X1X1X2X2 females and X1X2Y males; however significant fragmentation of the X chromosome has also been reported. We further use whole genome resequencing data from males and females to identify the X1 and X2 chromosomes based on sex biases in coverage. This highly contiguous reference genome assembly provides a much-improved resource for identifying chromosomal genome architecture, and for interpreting patterns of urban outbreaks and signatures of selection linked to insecticide resistance. 
    more » « less