skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Cultivar-specific transcriptome and pan-transcriptome reconstruction of tetraploid potato
Although the reference genome of Solanum tuberosum Group Phureja double-monoploid (DM) clone is available, knowledge on the genetic diversity of the highly heterozygous tetraploid Group Tuberosum, representing most cultivated varieties, remains largely unexplored. This lack of knowledge hinders further progress in potato research. In conducted investigation, we first merged and manually curated the two existing partially-overlapping DM genome-based gene models, creating a union of genes in Phureja scaffold. Next, we compiled available and newly generated RNA-Seq datasets (cca. 1.5 billion reads) for three tetraploid potato genotypes (cultivar Désirée, cultivar Rywal, and breeding clone PW363) with diverse breeding pedigrees. Short-read transcriptomes were assembled using several de novo assemblers under different settings to test for optimal outcome. For cultivar Rywal, PacBio Iso-Seq full-length transcriptome sequencing was also performed. EvidentialGene redundancy-reducing pipeline complemented with in-house developed scripts was employed to produce accurate and complete cultivar-specific transcriptomes, as well as to attain the pan-transcriptome. The generated transcriptomes and pan-transcriptome represent a valuable resource for potato gene variability exploration, high-throughput omics analyses, and breeding programmes.  more » « less
Award ID(s):
1759906
PAR ID:
10190673
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Scientific data
Volume:
7
ISSN:
2052-4463
Page Range / eLocation ID:
249
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Harris, T (Ed.)
    Abstract Potato is a key food crop with a complex, polyploid genome. Advancements in sequencing technologies coupled with improvements in genome assembly algorithms have enabled generation of phased, chromosome-scale genome assemblies for cultivated tetraploid potato. The SpudDB database houses potato genome sequence and annotation, with the doubled monoploid DM 1–3 516 R44 (hereafter DM) genome serving as the reference genome and haplotype. Diverse annotation data types for DM genes are provided through a suite of Gene Report Pages including gene expression profiles across 438 potato samples. To further annotate potato genes based on expression, 65 gene co-expression modules were constructed that permit the identification of tightly co-regulated genes within DM across development and responses to wounding, abiotic stress, and biotic stress. Genome browser views of DM and 28 other potato genomes are provided along with a download page for genome sequence and annotation. To link syntenic genes within and between haplotypes, syntelogs were identified across 25 cultivated potato genomes. Through access to potato genome sequences and associated annotations, SpudDB can enable potato biologists, geneticists, and breeders to continue to improve this key food crop. 
    more » « less
  2. Udall, J (Ed.)
    Abstract Availability of readily transformable germplasm, as well as efficient pipelines for gene discovery are notable bottlenecks in the application of genome editing in potato. To study and introduce traits such as resistance against biotic and abiotic factors, tuber quality traits and self-fertility, model germplasm that is amenable to gene editing and regeneration is needed. Cultivated potato is a heterozygous autotetraploid and its genetic redundancy and complexity makes studying gene function challenging. Genome editing is simpler at the diploid level, with fewer allelic variants to consider. A readily transformable diploid potato would be further complemented by genomic resources that could aid in high throughput functional analysis. The heterozygous Solanum tuberosum Group Phureja clone 1S1 has a high regeneration rate, self-fertility, desirable tuber traits and is amenable to Agrobacterium-mediated transformation. We leveraged its amenability to Agrobacterium-mediated transformation to create a Cas9 constitutively expressing line for use in viral vector-based gene editing. To create a contiguous genome assembly, a homozygous doubled monoploid of 1S1 (DM1S1) was sequenced using 44 Gbp of long reads generated from Oxford Nanopore Technologies (ONT), yielding a 736 Mb assembly that encoded 31,145 protein-coding genes. The final assembly for DM1S1 represents a nearly complete genic space, shown by the presence of 99.6% of the genes in the Benchmarking Universal Single Copy Orthologs (BUSCO) set. Variant analysis with Illumina reads from 1S1 was used to deduce its alternate haplotype. These genetic and genomic resources provide a toolkit for applications of genome editing in both basic and applied research of potato. 
    more » « less
  3. null (Ed.)
    Abstract The challenges of breeding autotetraploid potato (Solanum tuberosum) have motivated the development of alternative breeding strategies. A common approach is to obtain uniparental dihaploids from a tetraploid of interest through pollination with S. tuberosum Andigenum Group (formerly S. phureja) cultivars. The mechanism underlying haploid formation of these crosses is unclear, and questions regarding the frequency of paternal DNA transmission remain. Previous reports have described aneuploid and euploid progeny that, in some cases, displayed genetic markers from the haploid inducer (HI). Here, we surveyed a population of 167 presumed dihaploids for large-scale structural variation that would underlie chromosomal addition from the HI, and for small-scale introgression of genetic markers. In 19 progeny, we detected 10 of the 12 possible trisomies and, in all cases, demonstrated the noninducer parent origin of the additional chromosome. Deep sequencing indicated that occasional, short-tract signals appearing to be of HI origin were better explained as technical artifacts. Leveraging recurring copy number variation patterns, we documented subchromosomal dosage variation indicating segregation of polymorphic maternal haplotypes. Collectively, 52% of the assayed chromosomal loci were classified as dosage variable. Our findings help elucidate the genomic consequences of potato haploid induction and suggest that most potato dihaploids will be free of residual pollinator DNA. 
    more » « less
  4. null (Ed.)
    Abstract In cultivated tetraploid potato (Solanum tuberosum), reduction to diploidy (dihaploidy) allows for hybridization to diploids and introgression breeding and may facilitate the production of inbreds. Pollination with haploid inducers yields maternal dihaploids, as well as triploid and tetraploid hybrids. Dihaploids may result from parthenogenesis, entailing the development of embryos from unfertilized eggs, or genome elimination, entailing missegregation and the loss of paternal chromosomes. A sign of genome elimination is the occasional persistence of haploid inducer DNA in some dihaploids. We characterized the genomes of 919 putative dihaploids and 134 hybrids produced by pollinating tetraploid clones with three haploid inducers: IVP35, IVP101, and PL-4. Whole-chromosome or segmental aneuploidy was observed in 76 dihaploids, with karyotypes ranging from 2n=2x-1=23 to 2n=2x+3=27. Of the additional chromosomes in 74 aneuploids, 66 were from the non-inducer parent and 8 from the inducer parent. Overall, we detected full or partial chromosomes from the haploid inducer parent in 0.87% of the dihaploids, irrespective of parental genotypes. Chromosomal breaks commonly affected the paternal genome in the dihaploid and tetraploid progeny, but not in the triploid progeny, correlating instability to sperm ploidy and to haploid induction. The residual haploid inducer DNA discovered in the progeny is consistent with genome elimination as the mechanism of haploid induction. 
    more » « less
  5. Gaut, Brandon (Ed.)
    Abstract As the closest extant sister group to seed plants, ferns are an important reference point to study the origin and evolution of plant genes and traits. One bottleneck to the use of ferns in phylogenetic and genetic studies is the fact that genome-level sequence information of this group is limited, due to the extreme genome sizes of most ferns. Ceratopteris richardii (hereafter Ceratopteris) has been widely used as a model system for ferns. In this study, we generated a transcriptome of Ceratopteris, through the de novo assembly of the RNA-seq data from 17 sequencing libraries that are derived from two sexual types of gametophytes and five different sporophyte tissues. The Ceratopteris transcriptome, together with 38 genomes and transcriptomes from other species across the Viridiplantae, were used to uncover the evolutionary dynamics of orthogroups (predicted gene families using OrthoFinder) within the euphyllophytes and identify proteins associated with the major shifts in plant morphology and physiology that occurred in the last common ancestors of euphyllophytes, ferns, and seed plants. Furthermore, this resource was used to identify and classify the GRAS domain transcriptional regulators of many developmental processes in plants. Through the phylogenetic analysis within each of the 15 GRAS orthogroups, we uncovered which GRAS family members are conserved or have diversified in ferns and seed plants. Taken together, the transcriptome database and analyses reported here provide an important platform for exploring the evolution of gene families in land plants and for studying gene function in seed-free vascular plants. 
    more » « less