skip to main content


Title: Reference Genome for the Highly Transformable Setaria viridis ME034V
Abstract Setaria viridis (green foxtail) is an important model system for improving cereal crops due to its diploid genome, ease of cultivation, and use of C4 photosynthesis. The S. viridis accession ME034V is exceptionally transformable, but the lack of a sequenced genome for this accession has limited its utility. We present a 397 Mb highly contiguous de novo assembly of ME034V using ultra-long nanopore sequencing technology (read N50 = 41kb). We estimate that this genome is largely complete based on our updated k-mer based genome size estimate of 401 Mb for S. viridis. Genome annotation identified 37,908 protein-coding genes and >300k repetitive elements comprising 46% of the genome. We compared the ME034V assembly with two other previously sequenced Setaria genomes as well as to a diversity panel of 235 S. viridis accessions. We found the genome assemblies to be largely syntenic, but numerous unique polymorphic structural variants were discovered. Several ME034V deletions may be associated with recent retrotransposition of copia and gypsy LTR repeat families, as evidenced by their low genotype frequencies in the sampled population. Lastly, we performed a phylogenomic analysis to identify gene families that have expanded in Setaria, including those involved in specialized metabolism and plant defense response. The high continuity of the ME034V genome assembly validates the utility of ultra-long DNA sequencing to improve genetic resources for emerging model organisms. Structural variation present in Setaria illustrates the importance of obtaining the proper genome reference for genetic experiments. Thus, we anticipate that the ME034V genome will be of significant utility for the Setaria research community.  more » « less
Award ID(s):
1831493
NSF-PAR ID:
10273207
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
G3 Genes|Genomes|Genetics
Volume:
10
Issue:
10
ISSN:
2160-1836
Page Range / eLocation ID:
3467 to 3478
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    De novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species.

    Findings

    Using Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements.

    Conclusions

    Knowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees.

     
    more » « less
  2. Pyhäjärvi, T (Ed.)
    Abstract Blackberries (Rubus spp.) are the fourth most economically important berry crop worldwide. Genome assemblies and annotations have been developed for Rubus species in subgenus Idaeobatus, including black raspberry (R. occidentalis), red raspberry (R. idaeus), and R. chingii, but very few genomic resources exist for blackberries and their relatives in subgenus Rubus. Here we present a chromosome-length assembly and annotation of the diploid blackberry germplasm accession “Hillquist” (R. argutus). “Hillquist” is the only known source of primocane-fruiting (annual-fruiting) in tetraploid fresh-market blackberry breeding programs and is represented in the pedigree of many important cultivars worldwide. The “Hillquist” assembly, generated using Pacific Biosciences long reads scaffolded with high-throughput chromosome conformation capture sequencing, consisted of 298 Mb, of which 270 Mb (90%) was placed on 7 chromosome-length scaffolds with an average length of 38.6 Mb. Approximately 52.8% of the genome was composed of repetitive elements. The genome sequence was highly collinear with a novel maternal haplotype-resolved linkage map of the tetraploid blackberry selection A-2551TN and genome assemblies of R. chingii and red raspberry. A total of 38,503 protein-coding genes were predicted, of which 72% were functionally annotated. Eighteen flowering gene homologs within a previously mapped locus aligning to an 11.2 Mb region on chromosome Ra02 were identified as potential candidate genes for primocane-fruiting. The utility of the “Hillquist” genome has been demonstrated here by the development of the first genotyping-by-sequencing-based linkage map of tetraploid blackberry and the identification of possible candidate genes for primocane-fruiting. This chromosome-length assembly will facilitate future studies in Rubus biology, genetics, and genomics and strengthen applied breeding programs. 
    more » « less
  3. Abstract

    Wild and weedy relatives of domesticated crops harbor genetic variants that can advance agricultural biotechnology. Here we provide a genome resource for the wild plant green millet (Setaria viridis), a model species for studies of C4grasses, and use the resource to probe domestication genes in the close crop relative foxtail millet (Setaria italica). We produced a platinum-quality genome assembly ofS. viridisand de novo assemblies for 598 wild accessions and exploited these assemblies to identify loci underlying three traits: response to climate, a ‘loss of shattering’ trait that permits mechanical harvest and leaf angle, a predictor of yield in many grass crops. With CRISPR–Cas9 genome editing, we validatedLess Shattering1(SvLes1) as a gene whose product controls seed shattering. InS. italica, this gene was rendered nonfunctional by a retrotransposon insertion in the domesticated loss-of-shattering alleleSiLes1-TE(transposable element). This resource will enhance the utility ofS. viridisfor dissection of complex traits and biotechnological improvement of panicoid crops.

     
    more » « less
  4. Abstract

    Symbiotic relationships between vestimentiferan tubeworms and chemosynthetic Gammaproteobacteria build the foundations of many hydrothermal vent and hydrocarbon seep ecosystems in the deep sea. The association between the vent tubewormRiftia pachyptilaand its endosymbiontCandidatusEndoriftia persephone has become a model system for symbiosis research in deep‐sea vestimentiferans, while markedly fewer studies have investigated symbiotic relationships in other tubeworm species, especially at cold seeps. Here we sequenced the endosymbiont genome of the tubewormLamellibrachia barhamifrom a cold seep in the Gulf of California, using short‐ and long‐read sequencing technologies in combination with Hi‐C and Dovetail Chicago libraries. Our final assembly had a size of ~4.17 MB, a GC content of 54.54%, 137X coverage, 4153 coding sequences, and aCheckMcompleteness score of 97.19%. A single scaffold contained 99.51% of the genome. Comparative genomic analyses indicated that theL. barhamisymbiont shares a set of core genes and many metabolic pathways with other vestimentiferan symbionts, while containing 433 unique gene clusters that comprised a variety of transposases, defence‐related genes and a lineage‐specific CRISPR/Cas3 system. This assembly represents the most contiguous tubeworm symbiont genome resource to date and will be particularly valuable for future comparative genomic studies investigating structural genome evolution, physiological adaptations and host‐symbiont communication in chemosynthetic animal‐microbe symbioses.

     
    more » « less
  5. Abstract

    Acarospora socialis, the bright cobblestone lichen, is commonly found in southwestern North America. This charismatic yellow lichen is a species of key ecological significance as it is often a pioneer species in new environments. Despite their ecological importance virtually no research has been conducted on the genomics of A. socialis. To address this, we used long-read sequencing to generate the first high-quality draft genome of A. socialis. Lichen thallus tissue was collected from Pinkham Canyon in Joshua Tree National Park, California and deposited in the UC Riverside herbarium under accession #295874. The de novo assembly of the mycobiont partner of the lichen was generated from Pacific Biosciences HiFi long reads and Dovetail Omni-C chromatin capture data. After removing algal and bacterial contigs, the fungal genome was approximately 31.2 Mb consisting of 38 scaffolds with contig and scaffold N50 of 2.4 Mb. The BUSCO completeness score of the assembled genome was 97.5% using the Ascomycota gene set. Information on the genome of A. socialis is important for California conservation purposes given that this lichen is threatened in some places locally by wildfires due to climate change. This reference genome will be used for understanding the genetic diversity, population genomics, and comparative genomics of A. socialis species. Genomic resources for this species will support population and landscape genomics investigations, exploring the use of A. socialis as a bioindicator species for climate change, and in studies of adaptation by comparing populations that occur across aridity gradients in California.

     
    more » « less