skip to main content


Title: A Chromosome-Length Reference Genome for the Endangered Pacific Pocket Mouse Reveals Recent Inbreeding in a Historically Large Population
Abstract High-quality reference genomes are fundamental tools for understanding population history, and can provide estimates of genetic and demographic parameters relevant to the conservation of biodiversity. The federally endangered Pacific pocket mouse (PPM), which persists in three small, isolated populations in southern California, is a promising model for studying how demographic history shapes genetic diversity, and how diversity in turn may influence extinction risk. To facilitate these studies in PPM, we combined PacBio HiFi long reads with Omni-C and Hi-C data to generate a de novo genome assembly, and annotated the genome using RNAseq. The assembly comprised 28 chromosome-length scaffolds (N50 = 72.6 MB) and the complete mitochondrial genome, and included a long heterochromatic region on chromosome 18 not represented in the previously available short-read assembly. Heterozygosity was highly variable across the genome of the reference individual, with 18% of windows falling in runs of homozygosity (ROH) >1 MB, and nearly 9% in tracts spanning >5 MB. Yet outside of ROH, heterozygosity was relatively high (0.0027), and historical Ne estimates were large. These patterns of genetic variation suggest recent inbreeding in a formerly large population. Currently the most contiguous assembly for a heteromyid rodent, this reference genome provides insight into the past and recent demographic history of the population, and will be a critical tool for management and future studies of outbreeding depression, inbreeding depression, and genetic load.  more » « less
Award ID(s):
2019745
NSF-PAR ID:
10417188
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Editor(s):
Enard, David
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
14
Issue:
8
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Lohmueller, Kirk (Ed.)
    Abstract The levels and distribution of standing genetic variation in a genome can provide a wealth of insights about the adaptive potential, demographic history, and genome structure of a population or species. As structural variants are increasingly associated with traits important for adaptation and speciation, investigating both sequence and structural variation is essential for wholly tapping this potential. Using a combination of shotgun sequencing, 10x Genomics linked reads and proximity-ligation data (Chicago and Hi-C), we produced and annotated a chromosome-level genome assembly for the Atlantic silverside (Menidia menidia)—an established ecological model for studying the phenotypic effects of natural and artificial selection—and examined patterns of genomic variation across two individuals sampled from different populations with divergent local adaptations. Levels of diversity varied substantially across each chromosome, consistently being highly elevated near the ends (presumably near telomeric regions) and dipping to near zero around putative centromeres. Overall, our estimate of the genome-wide average heterozygosity in the Atlantic silverside is among the highest reported for a fish, or any vertebrate (1.32–1.76% depending on inference method and sample). Furthermore, we also found extreme levels of structural variation, affecting ∼23% of the total genome sequence, including multiple large inversions (> 1 Mb and up to 12.6 Mb) associated with previously identified haploblocks showing strong differentiation between locally adapted populations. These extreme levels of standing genetic variation are likely associated with large effective population sizes and may help explain the remarkable adaptive divergence among populations of the Atlantic silverside. 
    more » « less
  2. Abstract

    Understanding the genomic consequences of population decline is important for predicting species' vulnerability to intensifying global change. Empirical information about genomic changes in populations in the early stages of decline, especially for those still experiencing immigration, remains scarce. We used 7834 autosomal SNPs and demographic data for 288 Florida scrub jays (Aphelocoma coerulescens; FSJ) sampled in 2000 and 2008 to compare levels of genetic diversity, inbreeding, relatedness, and lengths of runs of homozygosity (ROH) between two subpopulations within dispersal distance of one another but have experienced contrasting demographic trajectories. At Archbold Biological Station (ABS), the FSJ population has been stable because of consistent habitat protection and management, while at nearby Placid Lakes Estates (PLE), the population declined precipitously due to suburban development. By the onset of our sampling in 2000, birds in PLE were already less heterozygous, more inbred, and on average more related than birds in ABS. No significant changes occurred in heterozygosity or inbreeding across the 8‐year sampling interval, but average relatedness among individuals decreased in PLE, thus by 2008 average relatedness did not differ between sites. PLE harbored a similar proportion of short ROH but a greater proportion of long ROH than ABS, suggesting one continuous population of shared demographic history in the past, which is now experiencing more recent inbreeding. These results broadly uphold the predictions of simple population genetic models based on inferred effective population sizes and rates of immigration. Our study highlights how, in just a few generations, formerly continuous populations can diverge in heterozygosity and levels of inbreeding with severe local population decline despite ongoing gene flow.

     
    more » « less
  3. INTRODUCTION The Anthropocene is marked by an accelerated loss of biodiversity, widespread population declines, and a global conservation crisis. Given limited resources for conservation intervention, an approach is needed to identify threatened species from among the thousands lacking adequate information for status assessments. Such prioritization for intervention could come from genome sequence data, as genomes contain information about demography, diversity, fitness, and adaptive potential. However, the relevance of genomic data for identifying at-risk species is uncertain, in part because genetic variation may reflect past events and life histories better than contemporary conservation status. RATIONALE The Zoonomia multispecies alignment presents an opportunity to systematically compare neutral and functional genomic diversity and their relationships to contemporary extinction risk across a large sample of diverse mammalian taxa. We surveyed 240 species spanning from the “Least Concern” to “Critically Endangered” categories, as published in the International Union for Conservation of Nature’s Red List of Threatened Species. Using a single genome for each species, we estimated historical effective population sizes ( N e ) and distributions of genome-wide heterozygosity. To estimate genetic load, we identified substitutions relative to reconstructed ancestral sequences, assuming that mutations at evolutionarily conserved sites and in protein-coding sequences, especially in genes essential for viability in mice, are predominantly deleterious. We examined relationships between the conservation status of species and metrics of heterozygosity, demography, and genetic load and used these data to train and test models to distinguish threatened from nonthreatened species. RESULTS Species with smaller historical N e are more likely to be categorized as at risk of extinction, suggesting that demography, even from periods more than 10,000 years in the past, may be informative of contemporary resilience. Species with smaller historical N e also carry proportionally higher burdens of weakly and moderately deleterious alleles, consistent with theoretical expectations of the long-term accumulation and fixation of genetic load under strong genetic drift. We found weak support for a causative link between fixed drift load and extinction risk; however, other types of genetic load not captured in our data, such as rare, highly deleterious alleles, may also play a role. Although ecological (e.g., physiological, life-history, and behavioral) variables were the best predictors of extinction risk, genomic variables nonrandomly distinguished threatened from nonthreatened species in regression and machine learning models. These results suggest that information encoded within even a single genome can provide a risk assessment in the absence of adequate ecological or population census data. CONCLUSION Our analysis highlights the potential for genomic data to rapidly and inexpensively gauge extinction risk by leveraging relationships between contemporary conservation status and genetic variation shaped by the long-term demographic history of species. As more resequencing data and additional reference genomes become available, estimates of genetic load, estimates of recent demographic history, and accuracy of predictive models will improve. We therefore echo calls for including genomic information in assessments of the conservation status of species. Genomic information can help predict extinction risk in diverse mammalian species. Across 240 mammals, species with smaller historical N e had lower genetic diversity, higher genetic load, and were more likely to be threatened with extinction. Genomic data were used to train models that predict whether a species is threatened, which can be valuable for assessing extinction risk in species lacking ecological or census data. [Animal silhouettes are from PhyloPic] 
    more » « less
  4. Abstract Hares (genus Lepus) provide clear examples of repeated and often massive introgressive hybridization and striking local adaptations. Genomic studies on this group have so far relied on comparisons to the European rabbit (Oryctolagus cuniculus) reference genome. Here, we report the first de novo draft reference genome for a hare species, the mountain hare (Lepus timidus), and evaluate the efficacy of whole-genome re-sequencing analyses using the new reference versus using the rabbit reference genome. The genome was assembled using the ALLPATHS-LG protocol with a combination of overlapping pair and mate-pair Illumina sequencing (77x coverage). The assembly contained 32,294 scaffolds with a total length of 2.7 Gb and a scaffold N50 of 3.4 Mb. Re-scaffolding based on the rabbit reference reduced the total number of scaffolds to 4,205 with a scaffold N50 of 194 Mb. A correspondence was found between 22 of these hare scaffolds and the rabbit chromosomes, based on gene content and direct alignment. We annotated 24,578 protein coding genes by combining ab-initio predictions, homology search, and transcriptome data, of which 683 were solely derived from hare-specific transcriptome data. The hare reference genome is therefore a new resource to discover and investigate hare-specific variation. Similar estimates of heterozygosity and inferred demographic history profiles were obtained when mapping hare whole-genome re-sequencing data to the new hare draft genome or to alternative references based on the rabbit genome. Our results validate previous reference-based strategies and suggest that the chromosome-scale hare draft genome should enable chromosome-wide analyses and genome scans on hares. 
    more » « less
  5. Habitat degradation and loss of genetic diversity are common threats faced by almost all of today’s wild cats. Big cats, such as tigers and lions, are of great concern and have received considerable conservation attention through policies and international actions. However, knowledge of and conservation actions for small wild cats are lagging considerably behind. The black-footed cat,Felis nigripes, one of the smallest felid species, is experiencing increasing threats with a rapid reduction in population size. However, there is a lack of genetic information to assist in developing effective conservation actions. A de novo assembly of a high-quality chromosome-level reference genome of the black-footed cat was made, and comparative genomics and population genomics analyses were carried out. These analyses revealed that the most significant genetic changes in the evolution of the black-footed cat are the rapid evolution of sensory and metabolic-related genes, reflecting genetic adaptations to its characteristic nocturnal hunting and a high metabolic rate. Genomes of the black-footed cat exhibit a high level of inbreeding, especially for signals of recent inbreeding events, which suggest that they may have experienced severe genetic isolation caused by habitat fragmentation. More importantly, inbreeding associated with two deleterious mutated genes may exacerbate the risk of amyloidosis, the dominant disease that causes mortality of about 70% of captive individuals. Our research provides comprehensive documentation of the evolutionary history of the black-footed cat and suggests that there is an urgent need to investigate genomic variations of small felids worldwide to support effective conservation actions.

     
    more » « less