skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 10:00 PM ET on Thursday, February 12 until 1:00 AM ET on Friday, February 13 due to maintenance. We apologize for the inconvenience.


Title: Sequencing Disparity in the Genomic Era
Abstract Advances in sequencing technology have resulted in the expectation that genomic studies will become more representative of organismal diversity. To test this expectation, we explored species representation of nonhuman eukaryotes in the Sequence Read Archive. Though species richness has been increasing steadily, species evenness is decreasing over time. Moreover, the top 1% most studied organisms increasingly represent a larger proportion of total experiments, demonstrating growing bias in favor of a small minority of species. To better understand molecular processes and patterns, genomic studies should reverse current trends by adopting more comparative approaches.  more » « less
Award ID(s):
1831094
PAR ID:
10109389
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Molecular Biology and Evolution
Volume:
36
Issue:
8
ISSN:
0737-4038
Page Range / eLocation ID:
1624 to 1627
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Mank, Judith (Ed.)
    Abstract Many animal species are haplodiploid: their fertilized eggs develop into diploid females and their unfertilized eggs develop into haploid males. The unique genetic features of haplodiploidy raise the prospect that these systems can be used to disentangle the population genetic consequences of haploid and diploid selection. To this end, sex-specific reproductive genes are of particular interest because, while they are shared within the same genome, they consistently experience selection in different ploidal environments. However, other features of these genes, including sex-specific expression and putative involvement in postcopulatory sexual selection, are potentially confounding factors because they may also impact the efficacy of selection asymmetrically between the sexes. Thus, to properly interpret evolutionary genomic patterns, it is necessary to generate a null expectation for the relative amount of polymorphism and divergence we expect to observe among sex-specific genes in haplodiploid species, given differences in ploidal environment, sex-limited expression, and their potential role in sexual selection. Here, we derive the theoretical expectation for the rate of evolution of sex-specific genes in haplodiploid species, under the assumption that they experience the same selective environment as genes expressed in both sexes. We find that the null expectation is that reproductive genes evolve more rapidly than constitutively expressed genes in haplodiploid genomes. However, despite the aforementioned differences, the null expectation does not differ between male- and female-specific reproductive genes, when assuming additivity. Our theoretical results provide an important baseline expectation that should be used in molecular evolution studies comparing rates of evolution among classes of genes in haplodiploid species. 
    more » « less
  2. Fraser, Bonnie (Ed.)
    Abstract Kangaroo rats in the genus Dipodomys are found in a variety of habitat types in western North America, including deserts, arid and semiarid grasslands, and scrublands. Many Dipodomys species are experiencing strong population declines due to increasing habitat fragmentation, with two species listed as federally endangered in the United States. The precarious state of many Dipodomys populations, including those occupying extreme environments, make species of this genus valuable subjects for studying the impacts of habitat degradation and fragmentation on population genomic patterns and for characterizing the genomic bases of adaptation to harsh conditions. To facilitate exploration of such questions, we assembled and annotated a reference genome for the banner-tailed kangaroo rat (Dipodomys spectabilis) using PacBio HiFi sequencing reads, providing a more contiguous genomic resource than two previously assembled Dipodomys genomes. Using the HiFi data for D. spectabilis and publicly available sequencing data for two other Dipodomys species (Dipodomys ordii and Dipodomys stephensi), we demonstrate the utility of this new assembly for studies of congeners by conducting inference of historic effective population sizes (Ne) and linking these patterns to the species’ current extinction risk statuses. The genome assembly presented here will serve as a valuable resource for population and conservation genomic studies of Dipodomys species, comparative genomic research within mammals and rodents, and investigations into genomic adaptation to extreme environments and changing landscapes. 
    more » « less
  3. Understanding the evolutionary consequences of anthropogenic change is imperative for estimating long-term species resilience. While contemporary genomic data can provide us with important insights into recent demographic histories, investigating past change using present genomic data alone has limitations. In comparison, temporal genomics studies, defined herein as those that incorporate time series genomic data, utilize museum collections and repeated field sampling to directly examine evolutionary change. As temporal genomics is applied to more systems, species and questions, best practices can be helpful guides to make the most efficient use of limited resources. Here, we conduct a systematic literature review to synthesize the effects of temporal genomics methodology on our ability to detect evolutionary changes. We focus on studies investigating recent change within the past 200 years, highlighting evolutionary processes that have occurred during the past two centuries of accelerated anthropogenic pressure. We first identify the most frequently studied taxa, systems, questions and drivers, before highlighting overlooked areas where further temporal genomic studies may be particularly enlightening. Then, we provide guidelines for future study and sample designs while identifying key considerations that may influence statistical and analytical power. Our aim is to provide recommendations to a broad array of researchers interested in using temporal genomics in their work. 
    more » « less
  4. Genomic species delimitation is transforming how we understand and define species by enabling a process-oriented and efficient approach to identifying species boundaries. This review outlines the two key steps in genomic species delimitation: (a) discovering species-level units and (b) assessing their validity. Validity can be evaluated by a diversity of approaches, including applying the multispecies coalescent to delineate the population–species boundary and using estimated gene flow as a proxy for reproductive isolation. We illustrate the utility of these methods across the tree of life through a comprehensive review of published articles and case studies on birds, siphonophores, and bacteria. Despite the many benefits of genomic species delimitation, challenges remain. In particular, genomic divergence does not always accurately reflect ecological divergence and reproductive barriers, and genome heterogeneity can complicate the overall understanding of genetic divergence. We discuss these challenges and potential solutions. 
    more » « less
  5. The F-box proteins function as substrate receptors to determine the specificity of Skp1-Cul1-F-box ubiquitin ligases. Genomic studies revealed large and diverse sizes of the F-box gene superfamily across plant species. Our previous studies suggested that the plant F-box gene superfamily is under genomic drift evolution promoted by epigenomic programming. However, how the size of the superfamily drifts across plant genomes is currently unknown. Through a large-scale genomic and phylogenetic comparison of the F-box gene superfamily covering 110 green plants and one red algal species, I discovered four distinct groups of plant F-box genes with diverse evolutionary processes. While the members in Clusters 1 and 2 are species/lineage-specific, those in Clusters 3 and 4 are present in over 46 plant genomes. Statistical modeling suggests that F-box genes from the former two groups are skewed toward fewer species and more paralogs compared to those of the latter two groups whose presence frequency and sizes in plant genomes follow a random statistical model. The enrichment of known Arabidopsis F-box genes in Clusters 3 and 4, along with comprehensive biochemical evidence showing that Arabidopsis members in Cluster 4 interact with the Arabidopsis Skp1-like 1 (ASK1), demonstrates over-representation of active F-box genes in these two groups. Collectively, I propose purifying and dosage balancing selection models to explain the lineage/species-specific duplications and expansions of F-box genes in plant genomes. The purifying selection model suggests that most, if not all, lineage/species-specific F-box genes are detrimental and are thus kept at low frequencies in plant genomes. 
    more » « less