skip to main content

Title: Mammals with Small Populations Do Not Exhibit Larger Genomes
Abstract Genome size in cellular organisms varies by six orders of magnitude, yet the cause of this large variation remains unexplained. The influential Drift-Barrier Hypothesis proposes that large genomes tend to evolve in small populations due to inefficient selection. However, to our knowledge no explicit tests of the Drift-Barrier Hypothesis have been reported. We performed the first explicit test, by comparing estimated census population size and genome size in mammals while incorporating potential covariates and the effect of shared evolutionary history. We found a lack of correlation between census population size and genome size among 199 species of mammals. These results suggest that population size is not the predominant factor influencing genome size and that the Drift-Barrier Hypothesis should be considered provisional.  more » « less
Award ID(s):
1818288 1838327
Author(s) / Creator(s):
; ;
Larracuente, Amanda
Date Published:
Journal Name:
Molecular Biology and Evolution
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. INTRODUCTION The Anthropocene is marked by an accelerated loss of biodiversity, widespread population declines, and a global conservation crisis. Given limited resources for conservation intervention, an approach is needed to identify threatened species from among the thousands lacking adequate information for status assessments. Such prioritization for intervention could come from genome sequence data, as genomes contain information about demography, diversity, fitness, and adaptive potential. However, the relevance of genomic data for identifying at-risk species is uncertain, in part because genetic variation may reflect past events and life histories better than contemporary conservation status. RATIONALE The Zoonomia multispecies alignment presents an opportunity to systematically compare neutral and functional genomic diversity and their relationships to contemporary extinction risk across a large sample of diverse mammalian taxa. We surveyed 240 species spanning from the “Least Concern” to “Critically Endangered” categories, as published in the International Union for Conservation of Nature’s Red List of Threatened Species. Using a single genome for each species, we estimated historical effective population sizes ( N e ) and distributions of genome-wide heterozygosity. To estimate genetic load, we identified substitutions relative to reconstructed ancestral sequences, assuming that mutations at evolutionarily conserved sites and in protein-coding sequences, especially in genes essential for viability in mice, are predominantly deleterious. We examined relationships between the conservation status of species and metrics of heterozygosity, demography, and genetic load and used these data to train and test models to distinguish threatened from nonthreatened species. RESULTS Species with smaller historical N e are more likely to be categorized as at risk of extinction, suggesting that demography, even from periods more than 10,000 years in the past, may be informative of contemporary resilience. Species with smaller historical N e also carry proportionally higher burdens of weakly and moderately deleterious alleles, consistent with theoretical expectations of the long-term accumulation and fixation of genetic load under strong genetic drift. We found weak support for a causative link between fixed drift load and extinction risk; however, other types of genetic load not captured in our data, such as rare, highly deleterious alleles, may also play a role. Although ecological (e.g., physiological, life-history, and behavioral) variables were the best predictors of extinction risk, genomic variables nonrandomly distinguished threatened from nonthreatened species in regression and machine learning models. These results suggest that information encoded within even a single genome can provide a risk assessment in the absence of adequate ecological or population census data. CONCLUSION Our analysis highlights the potential for genomic data to rapidly and inexpensively gauge extinction risk by leveraging relationships between contemporary conservation status and genetic variation shaped by the long-term demographic history of species. As more resequencing data and additional reference genomes become available, estimates of genetic load, estimates of recent demographic history, and accuracy of predictive models will improve. We therefore echo calls for including genomic information in assessments of the conservation status of species. Genomic information can help predict extinction risk in diverse mammalian species. Across 240 mammals, species with smaller historical N e had lower genetic diversity, higher genetic load, and were more likely to be threatened with extinction. Genomic data were used to train models that predict whether a species is threatened, which can be valuable for assessing extinction risk in species lacking ecological or census data. [Animal silhouettes are from PhyloPic] 
    more » « less
  2. Abstract

    Owing to advances in genome sequencing, genome stability has become one of the most scrutinized cellular traits across the Tree of Life. Despite its centrality to all things biological, the mutation rate (per nucleotide site per generation) ranges over three orders of magnitude among species and several‐fold within individual phylogenetic lineages. Within all major organismal groups, mutation rates scale negatively with the effective population size of a species and with the amount of functional DNA in the genome. This relationship is most parsimoniously explained by the drift‐barrier hypothesis, which postulates that natural selection typically operates to reduce mutation rates until further improvement is thwarted by the power of random genetic drift. Despite this constraint, the molecular mechanisms underlying DNA replication fidelity and repair are free to wander, provided the performance of the entire system is maintained at the prevailing level. The evolutionary flexibility of the mutation rate bears on the resolution of several prior conundrums in phylogenetic and population‐genetic analysis and raises challenges for future applications in these areas.

    more » « less
  3. Abstract Data from nearly 1000 species reveal the upper bound to rates of biomass production achievable by natural selection across the Tree of Life. For heterotrophs, maximum growth rates scale positively with organism size in bacteria but negatively in eukaryotes, whereas for phototrophs, the scaling is negligible for cyanobacteria and weakly negative for eukaryotes. These results have significant implications for understanding the bioenergetic consequences of the transition from prokaryotes to eukaryotes, and of the expansion of some groups of the latter into multicellularity. The magnitudes of the scaling coefficients for eukaryotes are significantly lower than expected under any proposed physical-constraint model. Supported by genomic, bioenergetic, and population-genetic data and theory, an alternative hypothesis for the observed negative scaling in eukaryotes postulates that growth-diminishing mutations with small effects passively accumulate with increasing organism size as a consequence of associated increases in the power of random genetic drift. In contrast, conditional on the structural and functional features of ribosomes, natural selection has been able to promote bacteria with the fastest possible growth rates, implying minimal conflicts with both bioenergetic constraints and random genetic drift. If this extension of the drift-barrier hypothesis is correct, the interpretations of comparative studies of biological traits that have traditionally ignored differences in population-genetic environments will require revisiting. 
    more » « less
  4. Abstract

    It is often expected that temperate plants have expanded their geographical ranges northward from primarily southern refugia. Evidence for this hypothesis is mixed in eastern North American species, and there is increasing support for colonization from middle latitudes. We studied genome‐wide patterns of variation in RADseq loci to test hypotheses concerning range expansion in a North American forest herb (Campanula americana). First, spatial patterns of genetic differentiation were determined. Then phylogenetic relationships and divergence times were estimated. Spatial signatures of genetic drift were also studied to identify the directionality of recent range expansion and its geographical origins. Finally, spatially explicit scenarios for the spread of plants across the landscape were compared, using variation in the population mutation parameter and Tajima'sD. We found strong longitudinal subdivision, with populations clustering into groups west and east of the Mississippi River. While the southeastern region was probably part of a diverse Pleistocene refugium, there is little evidence that range expansion involved founders from these southern locales. Instead, declines in genetic diversity and the loss of rare alleles support a westward colonization wave from a middle latitude refugium near the southern Appalachian Mountains, with subsequent expansion from a Pleistocene staging ground in the Mississippi River Valley (0.51–1.27 million years ago). These analyses implicate stepping stone colonization from middle latitudes as an important mechanism of species range expansion in eastern North America. This study further demonstrates the utility of population genetics as a tool to infer the routes travelled by organisms during geographical range expansion.

    more » « less
  5. Abstract

    Monitoring temporal changes in population genetic diversity and effective population size can provide vital information on future viability. The dusky gopher frog,Lithobates sevosus, is a critically endangered species found only in coastal Mississippi, with low genetic variability as a consequence of isolation and population size reduction. Conservation management practices have been implemented, but their efficacy has not been addressed. We genotyped individuals collected 1997–2014 to determine temporal trends in population genetic variation, structure, and effective size. Observed and expected heterozygosity and allelic richness revealed temporally stable, but low, levels of genetic variation. Positive levels of inbreeding were found in each year. There was weak genetic structure among years, which can be attributed to increased effects of genetic drift and inbreeding in small populations.L. sevosusexhibited an increase in effective population size, and currently has an estimated effective size of 33.0–58.6 individuals, which is approximately half the census size. This large ratio could possibly be explained by genetic compensation. We found that management practices have been effective at maintaining and improving effective size and genetic diversity, but that additional strategies need to be implemented to enhance viability of the species.

    more » « less