The brown bear (Ursus arctos) is the second largest and most widespread extant terrestrial carnivore on Earth and has recently emerged as a medical model for human metabolic diseases. Here, we report a fully phased chromosome-level assembly of a male North American brown bear built by combining Pacific Biosciences (PacBio) HiFi data and publicly available Hi-C data. The final genome size is 2.47 Gigabases (Gb) with a scaffold and contig N50 length of 70.08 and 43.94 Megabases (Mb), respectively. Benchmarking Universal Single-Copy Ortholog (BUSCO) analysis revealed that 94.5% of single copy orthologs from Mammalia were present in the genome (the highest of any ursid genome to date). Repetitive elements accounted for 44.48% of the genome and a total of 20,480 protein coding genes were identified. Based on whole genome alignment to the polar bear, the brown bear is highly syntenic with the polar bear, and our phylogenetic analysis of 7,246 single-copy orthologs supports the currently proposed species tree for Ursidae. This highly contiguous genome assembly will support future research on both the evolutionary history of the bear family and the physiological mechanisms behind hibernation, the latter of which has broad medical implications.
- Award ID(s):
- 1754451
- NSF-PAR ID:
- 10232211
- Date Published:
- Journal Name:
- Journal of Heredity
- ISSN:
- 0022-1503
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
Abstract Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest-quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified 9 of the 19 chromosomal scaffolds were assembled as a near-single contig, whereas the other 10 chromosomes were each scaffolded together from multiple contigs. We qualitatively identified that the percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction of previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000.
-
Abstract Despite being quite specious (~10,000 extant species), birds have a fairly uniform genome size and karyotype (including the common occurrence of microchromosomes) relative to other vertebrate lineages. Storks (Family Ciconiidae) are a charismatic and distinct group of large wading birds with nearly worldwide distribution but few genomic resources. Here we present an annotated chromosome-level reference genome and chromosome orthology analysis for the wood stork (Mycteria americana), a species that has been federally protected under the Endangered Species Act since 1984. The annotated chromosome-level reference assembly was produced using the blood of a wild female wood stork chick, has a length of 1.35 Gb, a contig N50 of 37 Mb, a scaffold N50 of 80 Mb, and a BUSCO score of 98.8%. We identified 31 autosomal pairs and two sex chromosomes in the wood stork genome, but failed to identify four additional autosomal microchromosomes previously found via karyotyping. Orthology analyses confirmed reported synapomorphies unique to storks and identified the chromosomes participating in these fusions. This study highlights the difficulty and potential problems associated with delineating microchromosomes in reference genome assemblies. It also provides a foundation for studying karyotype evolution in the core water bird clade that includes penguins, albatrosses, storks, cormorants, herons, and ibises. Finally, our reference genome will allow for numerous genomic studies, such as genome-wide association studies of local adaptation, that will aid in wood stork conservation.
-
Abstract The evolutionary direction of gonochorism and hermaphroditism is an intriguing mystery to be solved. The special transient hermaphroditic stage makes the little yellow croaker (
Larimichthys polyactis ) an appealing model for studying hermaphrodite formation. However, the origin and evolutionary relationship between ofL. polyactis andLarimichthys crocea , the most famous commercial fish species in East Asia, remain unclear. Here, we report the sequence of theL. polyactis genome, which we found is ~706 Mb long (contig N50 = 1.21 Mb and scaffold N50 = 4.52 Mb) and contains 25,233 protein‐coding genes. Phylogenomic analysis suggested thatL. polyactis diverged from the common ancestor,L. crocea , approximately 25.4 million years ago. Our high‐quality genome assembly enabled comparative genomic analysis, which revealed several within‐chromosome rearrangements and translocations, without major chromosome fission or fusion events between the two species. Thedmrt1 gene was identified as the male‐specific gene inL. polyactis . Transcriptome analysis showed that the expression ofdmrt1 and its upstream regulatory gene (rnf183 ) were both sexually dimorphic.Rnf183 , unlike its two paraloguesrnf223 andrnf225 , is only present inLarimichthys andLates but not in other teleost species, suggesting that it originated from lineage‐specific duplication or was lost in other teleosts. Phylogenetic analysis shows that the hermaphrodite stage in maleL. polyactis may be explained by the sequence evolution ofdmrt1 . Decoding theL. polyactis genome not only provides insight into the genetic underpinnings of hermaphrodite evolution, but also provides valuable information for enhancing fish aquaculture. -
Abstract The cabbage looper,
Trichoplusia ni , is a globally distributed highly polyphagous herbivore and an important agricultural pest.T. ni has evolved resistance to various chemical insecticides, and is one of the only two insect species that have evolved resistance to the biopesticideBacillus thuringiensis (Bt) in agricultural systems and has been selected for resistance to baculovirus infections. We report a 333‐Mb high‐qualityT. ni genome assembly, which has N50 lengths of scaffolds and contigs of 4.6 Mb and 140 Kb, respectively, and contains 14,384 protein‐coding genes. High‐density genetic maps were constructed to anchor 305 Mb (91.7%) of the assembly to 31 chromosomes. Comparative genomic analysis ofT. ni withBombyx mori showed enrichment of tandemly duplicated genes inT. ni in families involved in detoxification and digestion, consistent with the broad host range ofT. ni . High levels of genome synteny were found betweenT. ni and other sequenced lepidopterans. However, genome synteny analysis ofT. ni and theT. ni derived cell line High Five (Hi5) indicated extensive genome rearrangements in the cell line. These results provided the first genomic evidence revealing the high instability of chromosomes in lepidopteran cell lines known from karyotypic observations. The high‐qualityT. ni genome sequence provides a valuable resource for research in a broad range of areas including fundamental insect biology, insect‐plant interactions and co‐evolution, mechanisms and evolution of insect resistance to chemical and biological pesticides, and technology development for insect pest management.