skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: TagSeq for gene expression in non‐model plants: A pilot study at the Santa Rita Experimental Range NEON core site
PremiseTagSeq is a cost‐effective approach for gene expression studies requiring a large number of samples. To date, TagSeq studies in plants have been limited to those with a high‐quality reference genome. We tested the suitability of reference transcriptomes for TagSeq in non‐model plants, as part of a study of natural gene expression variation at the Santa Rita Experimental Range National Ecological Observatory Network (NEON) core site. MethodsTissue for TagSeq was sampled from multiple individuals of four species (Bouteloua aristidoidesandEragrostis lehmanniana[Poaceae],Tidestromia lanuginosa[Amaranthaceae], andParkinsonia florida[Fabaceae]) at two locations on three dates (56 samples total). One sample per species was used to create a reference transcriptome via standard RNA‐seq. TagSeq performance was assessed by recovery of reference loci, specificity of tag alignments, and variation among samples. ResultsA high fraction of tags aligned to each reference and mapped uniquely. Expression patterns were quantifiable for tens of thousands of loci, which revealed consistent spatial differentiation in expression for all species. DiscussionTagSeq using de novo reference transcriptomes was an effective approach to quantifying gene expression in this study. Tags were highly locus specific and generated biologically informative profiles for four non‐model plant species.  more » « less
Award ID(s):
1750280 1550838
PAR ID:
10455033
Author(s) / Creator(s):
 ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Applications in Plant Sciences
Volume:
8
Issue:
11
ISSN:
2168-0450
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. PremiseLarge‐scale projects such as the National Ecological Observatory Network (NEON) collect ecological data on entire biomes to track climate change. NEON provides an opportunity to launch community transcriptomic projects that ask integrative questions in ecology and evolution. We conducted a pilot study to investigate the challenges of collecting RNA‐seq data from diverse plant communities. MethodsWe generated >650 Gbp of RNA‐seq for 24 vascular plant species representing 12 genera and nine families at the Harvard Forest NEON site. Each species was sampled twice in 2016 (July and August). We assessed transcriptome quality and content with TransRate, BUSCO, and Gene Ontology annotations. ResultsOnly modest differences in assembly quality were observed across multiplek‐mers. On average, transcriptomes contained hits to >70% of loci in the BUSCO database. We found no significant difference in the number of assembled and annotated transcripts between diploid and polyploid transcriptomes. DiscussionWe provide new RNA‐seq data sets for 24 species of vascular plants in Harvard Forest. Challenges associated with this type of study included recovery of high‐quality RNA from diverse species and access to NEON sites for genomic sampling. Overcoming these challenges offers opportunities for large‐scale studies at the intersection of ecology and genomics. 
    more » « less
  2. PremiseCornales is an order of flowering plants containing ecologically and horticulturally important families, including Cornaceae (dogwoods) and Hydrangeaceae (hydrangeas), among others. While many relationships in Cornales are strongly supported by previous studies, some uncertainty remains with regards to the placement of Hydrostachyaceae and to relationships among families in Cornales and within Cornaceae. Here we analyzed hundreds of nuclear loci to test published phylogenetic hypotheses and estimated a robust species tree for Cornales. MethodsUsing the Angiosperms353 probe set and existing data sets, we generated phylogenomic data for 158 samples, representing all families in the Cornales, with intensive sampling in the Cornaceae. ResultsWe curated an average of 312 genes per sample, constructed maximum likelihood gene trees, and inferred a species tree using the summary approach implemented in ASTRAL‐III, a method statistically consistent with the multispecies coalescent model. ConclusionsThe species tree we constructed generally shows high support values and a high degree of concordance among individual nuclear gene trees. Relationships among families are largely congruent with previous molecular studies, except for the placement of the nyssoids and the Grubbiaceae‐Curtisiaceae clades. Furthermore, we were able to place Hydrostachyaceae within Cornales, and within Cornaceae, the monophyly of known morphogroups was well supported. However, patterns of gene tree discordance suggest potential ancient reticulation, gene flow, and/or ILS in the Hydrostachyaceae lineage and the early diversification ofCornus. Our findings reveal new insights into the diversification process across Cornales and demonstrate the utility of the Angiosperms353 probe set. 
    more » « less
  3. PREMISEUnderstanding the relationship between genetic structure and geography provides information about a species’ history and can be used for breeding and conservation goals. The North American prairie is interesting because of its recent origin and subsequent fragmentation.Silphium integrifolium, an iconic perennial American prairie wildflower, is targeted for domestication, having undergone a few generations of improvement. We present the first application of population genetic data in this species to address the following goals: (1) improve breeding by characterizing genetic structure and (2) identify the species geographic origin and potential targets and drivers of selection during range expansion. METHODSWe developed a reference transcriptome as a genotyping reference for samples from throughout the species range. Population genetic analyses were used to describe patterns of genetic variation, and demographic modeling was used to characterize potential processes that shaped variation. Outlier scans for selection and associations with environmental variables were used to identify loci linked to putative targets and drivers of selection. RESULTSGenetic variation partitioned samples into three geographic clusters. Patterns of variation and demographic modeling suggest that the species origin is in the American Southeast. Breeding program accessions are from the region with lowest observed genetic variation. CONCLUSIONSThis prairie species did not originate within the prairie. Breeding may be improved by including accessions from outside of the germplasm founding region. The geographic structuring and the identified targets and drivers of adaptation can guide collecting efforts toward populations with beneficial agronomic traits. 
    more » « less
  4. PremiseApocynaceae is the 10th largest flowering plant family and a focus for study of plant–insect interactions, especially as mediated by secondary metabolites. However, it has few genomic resources relative to its size. Target capture sequencing is a powerful approach for genome reduction that facilitates studies requiring data from the nuclear genome in non‐model taxa, such as Apocynaceae. MethodsTranscriptomes were used to design probes for targeted sequencing of putatively single‐copy nuclear genes across Apocynaceae. The sequences obtained were used to assess the success of the probe design, the intrageneric and intraspecific variation in the targeted genes, and the utility of the genes for inferring phylogeny. ResultsFrom 853 candidate nuclear genes, 835 were consistently recovered in single copy and were variable enough for phylogenomics. The inferred gene trees were useful for coalescent‐based species tree analysis, which showed all subfamilies of Apocynaceae as monophyletic, while also resolving relationships among species within the genusApocynum. Intraspecific comparison ofElytropus chilensisindividuals revealed numerous single‐nucleotide polymorphisms with potential for use in population‐level studies. DiscussionCommunity use of this Hyb‐Seq probe set will facilitate and promote progress in the study of Apocynaceae across scales from population genomics to phylogenomics. 
    more » « less
  5. Abstract BackgroundThe maize inbred line A188 is an attractive model for elucidation of gene function and improvement due to its high embryogenic capacity and many contrasting traits to the first maize reference genome, B73, and other elite lines. The lack of a genome assembly of A188 limits its use as a model for functional studies. ResultsHere, we present a chromosome-level genome assembly of A188 using long reads and optical maps. Comparison of A188 with B73 using both whole-genome alignments and read depths from sequencing reads identify approximately 1.1 Gb of syntenic sequences as well as extensive structural variation, including a 1.8-Mb duplication containing the Gametophyte factor1 locus for unilateral cross-incompatibility, and six inversions of 0.7 Mb or greater. Increased copy number of carotenoid cleavage dioxygenase 1 (ccd1) in A188 is associated with elevated expression during seed development. Highccd1expression in seeds together with low expression of yellow endosperm 1 (y1) reduces carotenoid accumulation, accounting for the white seed phenotype of A188. Furthermore, transcriptome and epigenome analyses reveal enhanced expression of defense pathways and altered DNA methylation patterns of the embryonic callus. ConclusionsThe A188 genome assembly provides a high-resolution sequence for a complex genome species and a foundational resource for analyses of genome variation and gene function in maize. The genome, in comparison to B73, contains extensive intra-species structural variations and other genetic differences. Expression and network analyses identify discrete profiles for embryonic callus and other tissues. 
    more » « less