skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A High-Quality Reference Genome Assembly of the Saltwater Crocodile, Crocodylus porosus, Reveals Patterns of Selection in Crocodylidae
Abstract Crocodilians are an economically, culturally, and biologically important group. To improve researchers’ ability to study genome structure, evolution, and gene regulation in the clade, we generated a high-quality de novo genome assembly of the saltwater crocodile, Crocodylus porosus, from Illumina short read data from genomic libraries and in vitro proximity-ligation libraries. The assembled genome is 2,123.5 Mb, with N50 scaffold size of 17.7 Mb and N90 scaffold size of 3.8 Mb. We then annotated this new assembly, increasing the number of annotated genes by 74%. In total, 96% of 23,242 annotated genes were associated with a functional protein domain. Furthermore, multiple noncoding functional regions and mappable genetic markers were identified. Upon analysis and overlapping the results of branch length estimation and site selection tests for detecting potential selection, we found 16 putative genes under positive selection in crocodilians, 10 in C. porosus and 6 in Alligator mississippiensis. The annotated C. porosus genome will serve as an important platform for osmoregulatory, physiological, and sex determination studies, as well as an important reference in investigating the phylogenetic relationships of crocodilians, birds, and other tetrapods.  more » « less
Award ID(s):
1838283
PAR ID:
10321668
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Editor(s):
Cordaux, Richard
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
12
Issue:
1
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Campbell, P (Ed.)
    Abstract The mallard (Anas platyrhynchos) is one of the most common, economically, and socially important birds around the world. Mallards were not only an important food source for early humans but eventually becoming intimately linked with people as they were domesticated over the last 2,000 years. To date, mallard genomes are largely reconstructed from samples of domestic or unknown genetic heritage. Here, we report the first high-quality genome assembly and annotation of a genetically vetted wild mallard from North America (NAwild_v1.0). The genome was assembled using a combination of shotgun libraries, proximity ligation Chicago, and Dovetail Hi-C libraries. The final assembly is ∼1.04 Gb in size, with 98.3% of the sequence located in 30 full or nearly full chromosome-level scaffolds, and with a N50/L50 of 79.1 Mb/4 scaffolds. We used a combination of gene prediction and similarity approaches to annotate a total of 23,584 functional genes, of which 19,242 were associated to GO terms. The genome assembly and the set of annotated genes yielded a 95.4% completeness score when compared with the BUSCO aves_odb10 dataset. Next, we aligned 3 previously published mallard genomes to ours, and demonstrate how runs of homozygosity and nucleotide diversity are substantially higher and lower, respectively, to ours and how these artificially changed genomes resulted in profoundly different and biased demographic histories. Our wild mallard assembly not only provides a valuable resource to shed light onto genome evolution, speciation, and other adaptive processes, but also helping with identifying functional genes that have been significantly altered during the domestication process. 
    more » « less
  2. The western painted turtle, Chrysemys picta bellii, has the greatest tolerance to anoxia of any tetrapod studied to date. These turtles reside in the northern United States and southern Canada, and survive months of anoxia while submerged in ice-locked ponds and bogs. Reference genomes provide an important resource for elucidating the molecular bases for such unique physiological traits. An initial reference genome for this species was published in 2013, but the assembly is highly fragmented which poses several limitations for downstream analyses and biological interpretation. In this study, we created a new and improved assembly by combining PacBio HiFi, 10x Genomics Chromium, Hi-C sequence data and BioNano optical mapping derived from a single individual to generate a new haplotype-resolved chromosome-level assembly for C. picta bellii, called SLU_Cpb5.0. The genome size of the primary assembly is 2.372 Gb with a scaffold N50 of 133.6 Mb, which is a 6.5-fold improvement over the existing assembly. Genome annotation of SLU_Cpb5.0 revealed 12,242 novel genes compared to previous assemblies. Our PacBio Iso-Seq RNA sequencing data for twelve tissues unraveled over 100,000 novel transcript isoforms and 4,325 novel genes that were not annotated by standard NCBI pipeline. We also observed distinct patterns of tissue-specific isoform expression, creating a robust foundation for future characterization of the functions of these genes. The improved genome assembly and annotation will facilitate comparative genomics studies to better understand the genetic basis of C.picta bellii's extreme physiological adaptations and other aspects of its biology. 
    more » « less
  3. Abstract Genome-wide information has so far been unavailable for ribbon worms of the clade Hoplonemertea, the most species-rich class within the phylum Nemertea. While species within Pilidiophora, the sister clade of Hoplonemertea, possess a pilidium larval stage and lack stylets on their proboscis, Hoplonemertea species have a planuliform larva and are armed with stylets employed for the injection of toxins into their prey. To further compare these developmental, physiological, and behavioral differences from a genomic perspective, the availability of a reference genome for a Hoplonemertea species is crucial. Such data will be highly useful for future investigations toward a better understanding of molecular ecology, venom evolution, and regeneration not only in Nemertea but also in other marine invertebrate phyla. To this end, we herein present the annotated chromosome-level genome assembly for Emplectonema gracile (Nemertea; Hoplonemertea; Monostilifera; Emplectonematidae), an easily collected nemertean well suited for laboratory experimentation. The genome has an assembly size of 157.9 Mb. Hi-C scaffolding yielded chromosome-level scaffolds, with a scaffold N50 of 10.0 Mb and a score of 95.1% for complete BUSCO genes found as a single copy. Annotation predicted 20,684 protein-coding genes. The high-quality reference genome reaches an Earth BioGenome standard level of 7.C.Q50. 
    more » « less
  4. Abstract The diatom, Cyclotella cryptica, is a well-established model species for physiological studies and biotechnology applications of diatoms. To further facilitate its use as a model diatom, we report an improved reference genome assembly and annotation for C. cryptica strain CCMP332. We used a combination of long- and short-read sequencing to assemble a high-quality and contaminant-free genome. The genome is 171 Mb in size and consists of 662 scaffolds with a scaffold N50 of 494 kb. This represents a 176-fold decrease in scaffold number and 41-fold increase in scaffold N50 compared to the previous assembly. The genome contains 21,250 predicted genes, 75% of which were assigned putative functions. Repetitive DNA comprises 59% of the genome, and an improved classification of repetitive elements indicated that a historically steady accumulation of transposable elements has contributed to the relatively large size of the C. cryptica genome. The high-quality C. cryptica genome will serve as a valuable reference for ecological, genetic, and biotechnology studies of diatoms. 
    more » « less
  5. Abstract The Japanese rhinoceros beetleTrypoxylus dichotomusis a giant beetle with distinctive exaggerated horns present on the head and prothoracic regions of the male.T. dichotomushas been used as a research model in various fields such as evolutionary developmental biology, ecology, ethology, biomimetics, and drug discovery. In this study, de novo assembly of 615 Mb, representing 80% of the genome estimated by flow cytometry, was obtained using the 10 × Chromium platform. The scaffold N50 length of the genome assembly was 8.02 Mb, with repetitive elements predicted to comprise 49.5% of the assembly. In total, 23,987 protein-coding genes were predicted in the genome. In addition, de novo assembly of the mitochondrial genome yielded a contig of 20,217 bp. We also analyzed the transcriptome by generating 16 RNA-seq libraries from a variety of tissues of both sexes and developmental stages, which allowed us to identify 13 co-expressed gene modules. We focused on the genes related to horn formation and obtained new insights into the evolution of the gene repertoire and sexual dimorphism as exemplified by the sex-specific splicing pattern of thedoublesexgene. This genomic information will be an excellent resource for further functional and evolutionary analyses, including the evolutionary origin and genetic regulation of beetle horns and the molecular mechanisms underlying sexual dimorphism. 
    more » « less