skip to main content


Title: Ultracontinuous Single Haplotype Genome Assemblies for the Domestic Cat ( Felis catus ) and Asian Leopard Cat ( Prionailurus bengalensis )
Abstract In addition to including one of the most popular companion animals, species from the cat family Felidae serve as a powerful system for genetic analysis of inherited and infectious disease, as well as for the study of phenotypic evolution and speciation. Previous diploid-based genome assemblies for the domestic cat have served as the primary reference for genomic studies within the cat family. However, these versions suffered from poor resolution of complex and highly repetitive regions, with substantial amounts of unplaced sequence that is polymorphic or copy number variable. We sequenced the genome of a female F1 Bengal hybrid cat, the offspring of a domestic cat (Felis catus) x Asian leopard cat (Prionailurus bengalensis) cross, with PacBio long sequence reads and used Illumina sequence reads from the parents to phase >99.9% of the reads into the 2 species’ haplotypes. De novo assembly of the phased reads produced highly continuous haploid genome assemblies for the domestic cat and Asian leopard cat, with contig N50 statistics exceeding 83 Mb for both genomes. Whole-genome alignments reveal the Felis and Prionailurus genomes are colinear, and the cytogenetic differences between the homologous F1 and E4 chromosomes represent a case of centromere repositioning in the absence of a chromosomal inversion. Both assemblies offer significant improvements over the previous domestic cat reference genome, with a 100% increase in contiguity and the capture of the vast majority of chromosome arms in 1 or 2 large contigs. We further demonstrated that comparably accurate F1 haplotype phasing can be achieved with members of the same species when one or both parents of the trio are not available. These novel genome resources will empower studies of feline precision medicine, adaptation, and speciation.  more » « less
Award ID(s):
1753760
NSF-PAR ID:
10257035
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Editor(s):
Shapiro, Beth
Date Published:
Journal Name:
Journal of Heredity
Volume:
112
Issue:
2
ISSN:
0022-1503
Page Range / eLocation ID:
165 to 173
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Previous studies of canid population and evolutionary genetics have relied on high-quality domestic dog reference genomes that have been produced primarily for biomedical and trait mapping studies in dog breeds. However, the absence of highly contiguous genomes from other Canis species like the gray wolf and coyote, that represent additional distinct demographic histories, may bias inferences regarding interspecific genetic diversity and phylogenetic relationships. Here, we present single haplotype de novo genome assemblies for the gray wolf and coyote, generated by applying the trio-binning approach to long sequence reads generated from the genome of a female first-generation hybrid produced from a gray wolf and coyote mating. The assemblies were highly contiguous, with contig N50 sizes of 44.6 and 42.0 Mb for the wolf and coyote, respectively. Genome scaffolding and alignments between the two Canis assemblies and published dog reference genomes showed near complete collinearity, with one exception: a coyote-specific chromosome fission of chromosome 13 and fusion of the proximal portion of that chromosome with chromosome 8, retaining the Canis-typical haploid chromosome number of 2n = 78. We evaluated mapping quality for previous RADseq data from 334 canids and found nearly identical mapping quality and patterns among canid species and regional populations regardless of the genome used for alignment (dog, coyote, or gray wolf). These novel wolf and coyote genome reference assemblies will be important resources for proper and accurate inference of Canis demography, taxonomic evaluation, and conservation genetics.

     
    more » « less
  2. Abstract

    Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest-quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified 9 of the 19 chromosomal scaffolds were assembled as a near-single contig, whereas the other 10 chromosomes were each scaffolded together from multiple contigs. We qualitatively identified that the percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction of previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000.

     
    more » « less
  3. Abstract

    High-throughput sequencing-based methods for bulked segregant analysis (BSA) allow for the rapid identification of genetic markers associated with traits of interest. BSA studies have successfully identified qualitative (binary) and quantitative trait loci (QTLs) using QTL mapping. However, most require population structures that fit the models available and a reference genome. Instead, high-throughput short-read sequencing can be combined with BSA of k-mers (BSA-k-mer) to map traits that appear refractory to standard approaches. This method can be applied to any organism and is particularly useful for species with genomes diverged from the closest sequenced genome. It is also instrumental when dealing with highly heterozygous and potentially polyploid genomes without phased haplotype assemblies and for which a single haplotype can control a trait. Finally, it is flexible in terms of population structure. Here, we apply the BSA-k-mer method for the rapid identification of candidate regions related to seed spot and seed size in diploid potato. Using a mixture of F1 and F2 individuals from a cross between 2 highly heterozygous parents, candidate sequences were identified for each trait using the BSA-k-mer approach. Using parental reads, we were able to determine the parental origin of the loci. Finally, we mapped the identified k-mers to a closely related potato genome to validate the method and determine the genomic loci underlying these sequences. The location identified for the seed spot matches with previously identified loci associated with pigmentation in potato. The loci associated with seed size are novel. Both loci are relevant in future breeding toward true seeds in potato.

     
    more » « less
  4. Habitat degradation and loss of genetic diversity are common threats faced by almost all of today’s wild cats. Big cats, such as tigers and lions, are of great concern and have received considerable conservation attention through policies and international actions. However, knowledge of and conservation actions for small wild cats are lagging considerably behind. The black-footed cat,Felis nigripes, one of the smallest felid species, is experiencing increasing threats with a rapid reduction in population size. However, there is a lack of genetic information to assist in developing effective conservation actions. A de novo assembly of a high-quality chromosome-level reference genome of the black-footed cat was made, and comparative genomics and population genomics analyses were carried out. These analyses revealed that the most significant genetic changes in the evolution of the black-footed cat are the rapid evolution of sensory and metabolic-related genes, reflecting genetic adaptations to its characteristic nocturnal hunting and a high metabolic rate. Genomes of the black-footed cat exhibit a high level of inbreeding, especially for signals of recent inbreeding events, which suggest that they may have experienced severe genetic isolation caused by habitat fragmentation. More importantly, inbreeding associated with two deleterious mutated genes may exacerbate the risk of amyloidosis, the dominant disease that causes mortality of about 70% of captive individuals. Our research provides comprehensive documentation of the evolutionary history of the black-footed cat and suggests that there is an urgent need to investigate genomic variations of small felids worldwide to support effective conservation actions.

     
    more » « less
  5. Gojobori, Jun (Ed.)
    Abstract The sterility or inviability of hybrid offspring produced from an interspecific mating results from incompatibilities between parental genotypes that are thought to result from divergence of loci involved in epistatic interactions. However, attributes contributing to the rapid evolution of these regions also complicates their assembly, thus discovery of candidate hybrid sterility loci is difficult and has been restricted to a small number of model systems. Here we reported rapid interspecific divergence at the DXZ4 macrosatellite locus in an interspecific cross between two closely related mammalian species: the domestic cat (Felis silvestris catus) and the Jungle cat (Felis chaus). DXZ4 is an interesting candidate due to its structural complexity, copy number variability, and described role in the critical yet complex biological process of X-chromosome inactivation. However, the full structure of DXZ4 was absent or incomplete in nearly every available mammalian genome assembly given its repetitive complexity. We compared highly continuous genomes for three cat species, each containing a complete DXZ4 locus, and discovered that the felid DXZ4 locus differs substantially from the human ortholog, and that it varies in copy number between cat species. Additionally, we reported expression, methylation, and structural conformation profiles of DXZ4 and the X chromosome during stages of spermatogenesis that have been previously associated with hybrid male sterility. Collectively, these findings suggest a new role for DXZ4 in male meiosis and a proposed mechanism of feline interspecific incompatibility through rapid satellite divergence. 
    more » « less