skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: On the potential of Angiosperms353 for population genomic studies
PREMISE The successful application of universal targeted sequencing markers, such as those developed for the Angiosperms353 probe set, within populations could reduce or eliminate the need for specific marker development, while retaining the benefits of full-gene sequences in population-level analyses. However, whether the Angiosperms353 markers provide sufficient variation within species to calculate demographic parameters is untested. METHODS Using herbarium specimens from a 50-year-old floristic survey in Texas, we sequenced 95 samples from 24 species using the Angiosperms353 probe set. Our data workflow calls variants within species and prepares data for population genetic analysis using standard metrics. In our case study, gene recovery was affected by genomic library concentration only at low concentrations and displayed limited phylogenetic bias. RESULTS We identified over 1000 segregating variants with zero missing data for 92% of species and demonstrate that Angiosperms353 markers contain sufficient variation to estimate pairwise nucleotide diversity (π)—typically between 0.002 and 0.010, with most variation found in flanking non-coding regions. In a subset of variants that were filtered to reduce linkage, we uncovered high heterozygosity in many species, suggesting that denser sampling within species should permit estimation of gene flow and population dynamics. DISCUSSION Angiosperms353 should benefit conservation genetic studies by providing universal repeatable markers, low missing data, and haplotype information, while permitting inclusion of decades-old herbarium specimens.  more » « less
Award ID(s):
1753800 1902078
PAR ID:
10233346
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Applications in Plant Sciences
ISSN:
2168-0450
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract— The genus Solidago represents a taxonomically challenging group due to its sheer number of species, putative hybridization, polyploidy, and shallow genetic divergence among species. Here we use a dataset obtained exclusively from herbarium specimens to evaluate the status of Solidago ulmifolia var. palmeri , a morphologically subtle taxon potentially confined to Alabama, Arkansas, Mississippi, and Missouri. A multivariate analysis of both discrete and continuous morphological data revealed no clear distinction between S. ulmifolia var. palmeri and Solidago ulmifolia var. ulmifolia . Solidago ulmifolia var. palmeri ’s status was also assessed with a phylogenomic and SNP clustering analysis of data generated with the “Angiosperms353” probe kit. Neither analysis supported Solidago ulmifolia var. palmeri as a distinct taxon, and we suggest that this name should be discarded. The status of Solidago delicatula (formerly known as Solidago ulmifolia var. microphylla ) was also assessed. Both morphological and phylogenetic analyses supported the species status of S. delicatula and we suggest maintaining this species at its current rank. These results highlight the utility of the Angiosperms353 probe kit, both with herbarium tissue and at lower taxonomic levels. Indeed, this is the first study to utilize this kit to identify genetic groups within a species. 
    more » « less
  2. PremiseDivergence depends on the strength of selection and frequency of gene flow between taxa, while reproductive isolation relies on mating barriers and geographic distance. Less is known about how these processes interact at early stages of speciation. Here, we compared population‐level differentiation in floral phenotype and genetic sequence variation among recently divergedCastillejato explore patterns of diversification under different scenarios of reproductive isolation. MethodsUsing target enrichment enabled by the Angiosperms353 probe set, we assessed genetic distance among 50 populations of fourCastillejaspecies. We investigated whether patterns of genetic divergence are explained by floral trait variation or geographic distance in two focal groups: the widespreadC. sessilifloraand the more restrictedC. purpureaspecies complex. ResultsWe document thatC. sessilifloraand theC. purpureacomplex are characterized by high diversity in floral color across varying geographic scales. Despite phenotypic divergence, groups were not well supported in phylogenetic analyses, and little genetic differentiation was found across targeted Angiosperms353 loci. Nonetheless, a principal coordinate analysis of single nucleotide polymorphisms revealed differentiation withinC. sessilifloraacross floral morphs and geography and less differentiation among species of theC. purpureacomplex. ConclusionsPatterns of genetic distance inC. sessiliflorasuggest species cohesion maintained over long distances despite variation in floral traits. In theC. purpureacomplex, divergence in floral color across narrow geographic clines may be driven by recent selection on floral color. These contrasting patterns of floral and genetic differentiation reveal that divergence can arise via multiple eco‐evolutionary paths. 
    more » « less
  3. PremisePhylogenetic studies in the Compositae are challenging due to the sheer size of the family and the challenges they pose for molecular tools, ranging from the genomic impact of polyploid events to their very conserved plastid genomes. The search for better molecular tools for phylogenetic studies led to the development of the family‐specific Compositae1061 probe set, as well as the universal Angiosperms353 probe set designed for all flowering plants. In this study, we evaluate the extent to which data generated using the family‐specific kit and those obtained with the universal kit can be merged for downstream analyses. MethodsWe used comparative methods to verify the presence of shared loci between probe sets. Using two sets of eight samples sequenced with Compositae1061 and Angiosperms353, we ran phylogenetic analyses with and without loci flagged as paralogs, a gene tree discordance analysis, and a complementary phylogenetic analysis mixing samples from both sample sets. ResultsOur results show that the Compositae1061 kit provides an average of 721 loci, with 9–46% of them presenting paralogs, while the Angiosperms353 set yields an average of 287 loci, which are less affected by paralogy. Analyses mixing samples from both sets showed that the presence of 30 shared loci in the probe sets allows the combination of data generated in different ways. DiscussionCombining data generated using different probe sets opens up the possibility of collaborative efforts and shared data within the synantherological community. 
    more » « less
  4. Charleston, Michael (Ed.)
    Abstract We present a 517-gene phylogenetic framework for the breadfruit genus Artocarpus (ca. 70 spp., Moraceae), making use of silica-dried leaves from recent fieldwork and herbarium specimens (some up to 106 years old) to achieve 96% taxon sampling. We explore issues relating to assembly, paralogous loci, partitions, and analysis method to reconstruct a phylogeny that is robust to variation in data and available tools. Although codon partitioning did not result in any substantial topological differences, the inclusion of flanking noncoding sequence in analyses significantly increased the resolution of gene trees. We also found that increasing the size of data sets increased convergence between analysis methods but did not reduce gene-tree conflict. We optimized the HybPiper targeted-enrichment sequence assembly pipeline for short sequences derived from degraded DNA extracted from museum specimens. Although the subgenera of Artocarpus were monophyletic, revision is required at finer scales, particularly with respect to widespread species. We expect our results to provide a basis for further studies in Artocarpus and provide guidelines for future analyses of data sets based on target enrichment data, particularly those using sequences from both fresh and museum material, counseling careful attention to the potential of off-target sequences to improve resolution. [Artocarpus; Moraceae; noncoding sequences; phylogenomics; target enrichment.] 
    more » « less
  5. Multicopy ampliconic gene families on the Y chromosome play an important role in spermatogenesis. Thus, studying their genetic variation in endangered great ape species is critical. We estimated the sizes (copy number) of nine Y ampliconic gene families in population samples of chimpanzee, bonobo, and orangutan with droplet digital polymerase chain reaction, combined these estimates with published data for human and gorilla, and produced genome-wide testis gene expression data for great apes. Analyzing this comprehensive data set within an evolutionary framework, we, first, found high inter- and intraspecific variation in gene family size, with larger families exhibiting higher variation as compared with smaller families, a pattern consistent with random genetic drift. Second, for four gene families, we observed significant interspecific size differences, sometimes even between sister species—chimpanzee and bonobo. Third, despite substantial variation in copy number, Y ampliconic gene families’ expression levels did not differ significantly among species, suggesting dosage regulation. Fourth, for three gene families, size was positively correlated with gene expression levels across species, suggesting that, given sufficient evolutionary time, copy number influences gene expression. Our results indicate high variability in size but conservation in gene expression levels in Y ampliconic gene families, significantly advancing our understanding of Y-chromosome evolution in great apes. 
    more » « less