skip to main content


Title: An annotated draft genome for the Andean bear, Tremarctos ornatus
Abstract The Andean bear is the only extant member of the Tremarctine subfamily and the only extant ursid species to inhabit South America. Here, we present an annotated de novo assembly of a nuclear genome from a captive-born female Andean bear, Mischief, generated using a combination of short and long DNA and RNA reads. Our final assembly has a length of 2.23 Gb, and a scaffold N50 of 21.12 Mb, contig N50 of 23.5 kb, and BUSCO score of 88%. The Andean bear genome will be a useful resource for exploring the complex phylogenetic history of extinct and extant bear species and for future population genetics studies of Andean bears.  more » « less
Award ID(s):
1754451
NSF-PAR ID:
10232211
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Journal of Heredity
ISSN:
0022-1503
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    The brown bear (Ursus arctos) is the second largest and most widespread extant terrestrial carnivore on Earth and has recently emerged as a medical model for human metabolic diseases. Here, we report a fully phased chromosome-level assembly of a male North American brown bear built by combining Pacific Biosciences (PacBio) HiFi data and publicly available Hi-C data. The final genome size is 2.47 Gigabases (Gb) with a scaffold and contig N50 length of 70.08 and 43.94 Megabases (Mb), respectively. Benchmarking Universal Single-Copy Ortholog (BUSCO) analysis revealed that 94.5% of single copy orthologs from Mammalia were present in the genome (the highest of any ursid genome to date). Repetitive elements accounted for 44.48% of the genome and a total of 20,480 protein coding genes were identified. Based on whole genome alignment to the polar bear, the brown bear is highly syntenic with the polar bear, and our phylogenetic analysis of 7,246 single-copy orthologs supports the currently proposed species tree for Ursidae. This highly contiguous genome assembly will support future research on both the evolutionary history of the bear family and the physiological mechanisms behind hibernation, the latter of which has broad medical implications.

     
    more » « less
  2. Abstract

    Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest-quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified 9 of the 19 chromosomal scaffolds were assembled as a near-single contig, whereas the other 10 chromosomes were each scaffolded together from multiple contigs. We qualitatively identified that the percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction of previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000.

     
    more » « less
  3. Abstract

    Despite being quite specious (~10,000 extant species), birds have a fairly uniform genome size and karyotype (including the common occurrence of microchromosomes) relative to other vertebrate lineages. Storks (Family Ciconiidae) are a charismatic and distinct group of large wading birds with nearly worldwide distribution but few genomic resources. Here we present an annotated chromosome-level reference genome and chromosome orthology analysis for the wood stork (Mycteria americana), a species that has been federally protected under the Endangered Species Act since 1984. The annotated chromosome-level reference assembly was produced using the blood of a wild female wood stork chick, has a length of 1.35 Gb, a contig N50 of 37 Mb, a scaffold N50 of 80 Mb, and a BUSCO score of 98.8%. We identified 31 autosomal pairs and two sex chromosomes in the wood stork genome, but failed to identify four additional autosomal microchromosomes previously found via karyotyping. Orthology analyses confirmed reported synapomorphies unique to storks and identified the chromosomes participating in these fusions. This study highlights the difficulty and potential problems associated with delineating microchromosomes in reference genome assemblies. It also provides a foundation for studying karyotype evolution in the core water bird clade that includes penguins, albatrosses, storks, cormorants, herons, and ibises. Finally, our reference genome will allow for numerous genomic studies, such as genome-wide association studies of local adaptation, that will aid in wood stork conservation.

     
    more » « less
  4. Abstract

    The evolutionary direction of gonochorism and hermaphroditism is an intriguing mystery to be solved. The special transient hermaphroditic stage makes the little yellow croaker (Larimichthys polyactis) an appealing model for studying hermaphrodite formation. However, the origin and evolutionary relationship between ofL. polyactisandLarimichthys crocea, the most famous commercial fish species in East Asia, remain unclear. Here, we report the sequence of theL. polyactisgenome, which we found is ~706 Mb long (contig N50 = 1.21 Mb and scaffold N50 = 4.52 Mb) and contains 25,233 protein‐coding genes. Phylogenomic analysis suggested thatL. polyactisdiverged from the common ancestor,L. crocea, approximately 25.4 million years ago. Our high‐quality genome assembly enabled comparative genomic analysis, which revealed several within‐chromosome rearrangements and translocations, without major chromosome fission or fusion events between the two species. Thedmrt1gene was identified as the male‐specific gene inL. polyactis. Transcriptome analysis showed that the expression ofdmrt1and its upstream regulatory gene (rnf183) were both sexually dimorphic.Rnf183, unlike its two paraloguesrnf223andrnf225, is only present inLarimichthysandLatesbut not in other teleost species, suggesting that it originated from lineage‐specific duplication or was lost in other teleosts.Phylogenetic analysis shows that the hermaphrodite stage in maleL. polyactismay be explained by the sequence evolution ofdmrt1. Decoding theL. polyactisgenome not only provides insight into the genetic underpinnings of hermaphrodite evolution, but also provides valuable information for enhancing fish aquaculture.

     
    more » « less
  5. Abstract

    The cabbage looper,Trichoplusia ni, is a globally distributed highly polyphagous herbivore and an important agricultural pest.T. nihas evolved resistance to various chemical insecticides, and is one of the only two insect species that have evolved resistance to the biopesticideBacillus thuringiensis(Bt) in agricultural systems and has been selected for resistance to baculovirus infections. We report a 333‐Mb high‐qualityT. nigenome assembly, which has N50 lengths of scaffolds and contigs of 4.6 Mb and 140 Kb, respectively, and contains 14,384 protein‐coding genes. High‐density genetic maps were constructed to anchor 305 Mb (91.7%) of the assembly to 31 chromosomes. Comparative genomic analysis ofT. niwithBombyx morishowed enrichment of tandemly duplicated genes inT. niin families involved in detoxification and digestion, consistent with the broad host range ofT. ni. High levels of genome synteny were found betweenT. niand other sequenced lepidopterans. However, genome synteny analysis ofT. niand theT. niderived cell line High Five (Hi5) indicated extensive genome rearrangements in the cell line. These results provided the first genomic evidence revealing the high instability of chromosomes in lepidopteran cell lines known from karyotypic observations. The high‐qualityT. nigenome sequence provides a valuable resource for research in a broad range of areas including fundamental insect biology, insect‐plant interactions and co‐evolution, mechanisms and evolution of insect resistance to chemical and biological pesticides, and technology development for insect pest management.

     
    more » « less