Title: The Evolution of Comparative Phylogeography: Putting the Geography (and More) into Comparative Population Genomics
Abstract Comparative population genomics is an ascendant field using genomic comparisons between species to draw inferences about forces regulating genetic variation. Comparative phylogeography, by contrast, focuses on the shared lineage histories of species codistributed geographically and is decidedly organismal in perspective. Comparative phylogeography is approximately 35 years old, and, by some metrics, is showing signs of reduced growth. Here, we contrast the goals and methods of comparative population genomics and comparative phylogeography and argue that comparative phylogeography offers an important perspective on evolutionary history that succeeds in integrating genomics with landscape evolution in ways that complement the suprageographic perspective of comparative population genomics. Focusing primarily on terrestrial vertebrates, we review the history of comparative phylogeography, its milestones and ongoing conceptual innovations, its increasingly global focus, and its status as a bridge between landscape genomics and the process of speciation. We also argue that, as a science with a strong “sense of place,” comparative phylogeography offers abundant “place-based” educational opportunities with its focus on geography and natural history, as well as opportunities for collaboration with local communities and indigenous peoples. Although comparative phylogeography does not yet require whole-genome sequencing for many of its goals, we conclude that it nonetheless plays an more » important role in grounding our interpretation of genetic variation in the fundamentals of geography and Earth history. « less
Corbett-Detig, Russell
Award ID(s):
Publication Date:
Journal Name:
Genome Biology and Evolution
Sponsoring Org:
National Science Foundation
  1. The North American tiger salamander species complex, including its best-known species, the Mexican axolotl, has long been a source of biological fascination. The complex exhibits a wide range of variation in developmental life history strategies, including populations and individuals that undergo metamorphosis; those able to forego metamorphosis and retain a larval, aquatic lifestyle (i.e., paedomorphosis); and those that do both. The evolution of a paedomorphic life history state is thought to lead to increased population genetic differentiation and ultimately reproductive isolation and speciation, but the degree to which it has shaped population- and species-level divergence is poorly understood. Using a large multilocus dataset from hundreds of samples across North America, we identified genetic clusters across the geographic range of the tiger salamander complex. These clusters often contain a mixture of paedomorphic and metamorphic taxa, indicating that geographic isolation has played a larger role in lineage divergence than paedomorphosis in this system. This conclusion is bolstered by geography-informed analyses indicating no effect of life history strategy on population genetic differentiation and by model-based population genetic analyses demonstrating gene flow between adjacent metamorphic and paedomorphic populations. This fine-scale genetic perspective on life history variation establishes a framework for understanding how plasticity, localmore »adaptation, and gene flow contribute to lineage divergence. Many members of the tiger salamander complex are endangered, and the Mexican axolotl is an important model system in regenerative and biomedical research. Our results chart a course for more informed use of these taxa in experimental, ecological, and conservation research.

    « less
  2. Abstract Background

    Distributional responses by alpine taxa to repeated, glacial-interglacial cycles throughout the last two million years have significantly influenced the spatial genetic structure of populations. These effects have been exacerbated for the American pika (Ochotona princeps), a small alpine lagomorph constrained by thermal sensitivity and a limited dispersal capacity. As a species of conservation concern, long-term lack of gene flow has important consequences for landscape genetic structure and levels of diversity within populations. Here, we use reduced representation sequencing (ddRADseq) to provide a genome-wide perspective on patterns of genetic variation across pika populations representing distinct subspecies. To investigate how landscape and environmental features shape genetic variation, we collected genetic samples from distinct geographic regions as well as across finer spatial scales in two geographically proximate mountain ranges of eastern Nevada.


    Our genome-wide analyses corroborate range-wide, mitochondrial subspecific designations and reveal pronounced fine-scale population structure between the Ruby Mountains and East Humboldt Range of eastern Nevada. Populations in Nevada were characterized by low genetic diversity (π = 0.0006–0.0009; θW = 0.0005–0.0007) relative to populations in California (π = 0.0014–0.0019; θW = 0.0011–0.0017) and the Rocky Mountains (π = 0.0025–0.0027; θW = 0.0021–0.0024), indicating substantial genetic drift in these isolated populations. Tajima’sDwas positive for all sites (D = 0.240–0.811), consistent with recent contraction in population sizesmore »range-wide.


    Substantial influences of geography, elevation and climate variables on genetic differentiation were also detected and may interact with the regional effects of anthropogenic climate change to force the loss of unique genetic lineages through continued population extirpations in the Great Basin and Sierra Nevada.

    « less
  3. Advances in genomics have led to an appreciation that introgression is common, but its evolutionary consequences are poorly understood. In recent species radiations the sharing of genetic variation across porous species boundaries can facilitate adaptation to new environments and generate novel phenotypes, which may contribute to further diversification. Most Anopheles mosquito species that are of major importance as human malaria vectors have evolved within recent and rapid radiations of largely nonvector species. Here, we focus on one of the most medically important yet understudied anopheline radiations, the Afrotropical Anopheles funestus complex (AFC), to investigate the role of introgression in its diversification and the possible link between introgression and vector potential. The AFC comprises at least seven morphologically similar species, yet only An. funestus sensu stricto is a highly efficient malaria vector with a pan-African distribution. Based on de novo genome assemblies and additional whole-genome resequencing, we use phylogenomic and population genomic analyses to establish species relationships. We show that extensive interspecific gene flow involving multiple species pairs has shaped the evolutionary history of the AFC since its diversification. The most recent introgression event involved a massive and asymmetrical movement of genes from a distantly related AFC lineage into An. funestusmore », an event that predated and plausibly facilitated its subsequent dramatic geographic range expansion across most of tropical Africa. We propose that introgression may be a common mechanism facilitating adaptation to new environments and enhancing vectorial capacity in Anopheles mosquitoes.« less
  4. Lozier, J (Ed.)
    Comparative phylogeographic studies can distinguish between idiosyncratic and community-wide responses to past environmental change. However, to date, the impacts of species interactions have been largely overlooked. Here we used non-genetic data to characterize two competing scenarios about expected levels of congruence among five deadwood-associated (saproxylic) invertebrate species (i.e., a wood-feeding cockroach, termite, and beetle; a predatory centipede, and a detritivorous millipede) from the southern Appalachian Mountains—a globally recognized center of endemism. Under one scenario, abiotic factors primarily drove species’ responses, with predicted congruence based on the spatial overlap of climatically stable habitat areas estimated for each species via ecological niche modeling. The second scenario considered biotic factors to be most influential, with proxies for species interactions used to predict congruence. Analyses of mitochondrial and nuclear DNA sequences focused on four axes of comparison: the number and geographic distribution of distinct spatial-genetic clusters, phylogeographic structure, changes in effective population size, and historical gene flow dynamics. Overall, we found stronger support for the ecological co-associations scenario, suggesting an important influence of biotic factors in constraining or facilitating species’ responses to Pleistocene climatic cycles. However, there was an imperfect fit between predictions and outcomes of genetic data analyses. Thus, while thought-provoking, conclusions remainmore »tentative until additional data on species interactions becomes available. Ultimately, the approaches presented here advance comparative phylogeography by expanding the scope of inferences beyond solely considering abiotic drivers, which we believe is too simplistic. This work also provides conservation-relevant insights into the evolutionary history of a functionally important ecological community.« less
  5. INTRODUCTION Transposable elements (TEs), repeat expansions, and repeat-mediated structural rearrangements play key roles in chromosome structure and species evolution, contribute to human genetic variation, and substantially influence human health through copy number variants, structural variants, insertions, deletions, and alterations to gene transcription and splicing. Despite their formative role in genome stability, repetitive regions have been relegated to gaps and collapsed regions in human genome reference GRCh38 owing to the technological limitations during its development. The lack of linear sequence in these regions, particularly in centromeres, resulted in the inability to fully explore the repeat content of the human genome in the context of both local and regional chromosomal environments. RATIONALE Long-read sequencing supported the complete, telomere-to-telomere (T2T) assembly of the pseudo-haploid human cell line CHM13. This resource affords a genome-scale assessment of all human repetitive sequences, including TEs and previously unknown repeats and satellites, both within and outside of gaps and collapsed regions. Additionally, a complete genome enables the opportunity to explore the epigenetic and transcriptional profiles of these elements that are fundamental to our understanding of chromosome structure, function, and evolution. Comparative analyses reveal modes of repeat divergence, evolution, and expansion or contraction with locus-level resolution. RESULTS We implementedmore »a comprehensive repeat annotation workflow using previously known human repeats and de novo repeat modeling followed by manual curation, including assessing overlaps with gene annotations, segmental duplications, tandem repeats, and annotated repeats. Using this method, we developed an updated catalog of human repetitive sequences and refined previous repeat annotations. We discovered 43 previously unknown repeats and repeat variants and characterized 19 complex, composite repetitive structures, which often carry genes, across T2T-CHM13. Using precision nuclear run-on sequencing (PRO-seq) and CpG methylated sites generated from Oxford Nanopore Technologies long-read sequencing data, we assessed RNA polymerase engagement across retroelements genome-wide, revealing correlations between nascent transcription, sequence divergence, CpG density, and methylation. These analyses were extended to evaluate RNA polymerase occupancy for all repeats, including high-density satellite repeats that reside in previously inaccessible centromeric regions of all human chromosomes. Moreover, using both mapping-dependent and mapping-independent approaches across early developmental stages and a complete cell cycle time series, we found that engaged RNA polymerase across satellites is low; in contrast, TE transcription is abundant and serves as a boundary for changes in CpG methylation and centromere substructure. Together, these data reveal the dynamic relationship between transcriptionally active retroelement subclasses and DNA methylation, as well as potential mechanisms for the derivation and evolution of new repeat families and composite elements. Focusing on the emerging T2T-level assembly of the HG002 X chromosome, we reveal that a high level of repeat variation likely exists across the human population, including composite element copy numbers that affect gene copy number. Additionally, we highlight the impact of repeats on the structural diversity of the genome, revealing repeat expansions with extreme copy number differences between humans and primates while also providing high-confidence annotations of retroelement transduction events. CONCLUSION The comprehensive repeat annotations and updated repeat models described herein serve as a resource for expanding the compendium of human genome sequences and reveal the impact of specific repeats on the human genome. In developing this resource, we provide a methodological framework for assessing repeat variation within and between human genomes. The exhaustive assessment of the transcriptional landscape of repeats, at both the genome scale and locally, such as within centromeres, sets the stage for functional studies to disentangle the role transcription plays in the mechanisms essential for genome stability and chromosome segregation. Finally, our work demonstrates the need to increase efforts toward achieving T2T-level assemblies for nonhuman primates and other species to fully understand the complexity and impact of repeat-derived genomic innovations that define primate lineages, including humans. Telomere-to-telomere assembly of CHM13 supports repeat annotations and discoveries. The human reference T2T-CHM13 filled gaps and corrected collapsed regions (triangles) in GRCh38. Combining long read–based methylation calls, PRO-seq, and multilevel computational methods, we provide a compendium of human repeats, define retroelement expression and methylation profiles, and delineate locus-specific sites of nascent transcription genome-wide, including previously inaccessible centromeres. SINE, short interspersed element; SVA, SINE–variable number tandem repeat– Alu ; LINE, long interspersed element; LTR, long terminal repeat; TSS, transcription start site; pA, xxxxxxxxxxxxxxxx.« less