Erysiphe necator is an economically important biotrophic fungal pathogen responsible for powdery mildew disease on grapevine. Currently, genome sequences are available for only a few E. necator isolates from the United States. Based on the combination of Nanopore and Illumina sequencing technologies, we present here the complete genome assembly for an isolate of E. necator, NAFU1, identified in China. We acquired a total of 15.93 Gb of raw reads. These reads were processed into a 61.12-Mb genome assembly containing 73 contigs with an N 50 of 2.06 Mb and a maximum length of 6.05 Mb. Combining the results of three gene-prediction modules (i.e., an evidence-based gene modeler [EVidenceModeler], an ab initio gene modeler, and a homology-based gene modeler), we predicted 7,235 protein-coding genes in the assembled genome of E. necator NAFU1. This information will facilitate studies of genome evolution and pathogenicity mechanisms of E. necator and other powdery mildew species through comparative genome sequence analysis and other molecular genetic tools. [Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license .
more »
« less
A reference genome for the long-term kleptoplast-retaining sea slug Elysia crispata morphotype clarki
Abstract Several species of sacoglossan sea slugs possess the incredible ability to sequester chloroplasts from the algae they consume. These “photosynthetic animals” incorporate stolen chloroplasts, called kleptoplasts, into the epithelial cells of tubules that extend from their digestive tracts throughout their bodies. The mechanism by which these slugs maintain functioning kleptoplasts in the absence of an algal nuclear genome is unknown. Here, we report a draft genome of the sacoglossan slug Elysia crispata morphotype clarki, a morphotype native to the Florida Keys that can retain photosynthetically active kleptoplasts for several months without feeding. We used a combination of Oxford Nanopore Technologies long reads and Illumina short reads to produce a 786-Mb assembly (N50 = 0.459 Mb) containing 68,514 predicted protein-coding genes. A phylogenetic analysis found no evidence of horizontal acquisition of genes from algae. We performed gene family and gene expression analyses to identify E. crispata genes unique to kleptoplast-containing slugs that were more highly expressed in fed versus unfed developmental life stages. Consistent with analyses in other kleptoplastic slugs, our investigation suggests that genes encoding lectin carbohydrate-binding proteins and those involved in regulation of reactive oxygen species and immunity may play a role in kleptoplast retention. Lastly, we identified four polyketide synthase genes that could potentially encode proteins producing UV- and oxidation-blocking compounds in slug cell membranes. The genome of E. crispata is a quality resource that provides potential targets for functional analyses and enables further investigation into the evolution and mechanisms of kleptoplasty in animals.
more »
« less
- Award ID(s):
- 1831493
- PAR ID:
- 10535970
- Editor(s):
- Mallarino, R
- Publisher / Repository:
- G3
- Date Published:
- Journal Name:
- G3: Genes, Genomes, Genetics
- Volume:
- 13
- Issue:
- 12
- ISSN:
- 2160-1836
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract BackgroundDe novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species. FindingsUsing Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements. ConclusionsKnowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees.more » « less
-
Abstract We present the first long-read de novo assembly and annotation of the luna moth (Actias luna) and provide the full characterization of heavy chain fibroin (h-fibroin), a long and highly repetitive gene (>20 kb) essential in silk fiber production. There are >160,000 described species of moths and butterflies (Lepidoptera), but only within the last 5 years have we begun to recover high-quality annotated whole genomes across the order that capture h-fibroin. Using PacBio HiFi reads, we produce the first high-quality long-read reference genome for this species. The assembled genome has a length of 532 Mb, a contig N50 of 16.8 Mb, an L50 of 14 contigs, and 99.4% completeness (BUSCO). Our annotation using Bombyx mori protein and A. luna RNAseq evidence captured a total of 20,866 genes at 98.9% completeness with 10,267 functionally annotated proteins and a full-length h-fibroin annotation of 2,679 amino acid residues.more » « less
-
Jiang, Yu (Ed.)Abstract Copepods are among the most abundant organisms on the planet and play critical functions in aquatic ecosystems. Among copepods, populations of the Eurytemora affinis species complex are numerically dominant in many coastal habitats and serve as food sources for major fisheries. Intriguingly, certain populations possess the unusual capacity to invade novel salinities on rapid time scales. Despite their ecological importance, high-quality genomic resources have been absent for calanoid copepods, limiting our ability to comprehensively dissect the genome architecture underlying the highly invasive and adaptive capacity of certain populations. Here, we present the first chromosome-level genome of a calanoid copepod, from the Atlantic clade (Eurytemora carolleeae) of the E. affinis species complex. This genome was assembled using high-coverage PacBio long-read and Hi-C sequences of an inbred line, generated through 30 generations of full-sib mating. This genome, consisting of 529.3 Mb (contig N50 = 4.2 Mb, scaffold N50 = 140.6 Mb), was anchored onto four chromosomes. Genome annotation predicted 20,262 protein-coding genes, of which ion transport-related gene families were substantially expanded based on comparative analyses of 12 additional arthropod genomes. Also, we found genome-wide signatures of historical gene body methylation of the ion transport-related genes and the significant clustering of these genes on each chromosome. This genome represents one of the most contiguous copepod genomes to date and is among the highest quality marine invertebrate genomes. As such, this genome provides an invaluable resource to help yield fundamental insights into the ability of this copepod to adapt to rapidly changing environments.more » « less
-
Abstract BackgroundThe increasing number of chromosome-level genome assemblies has advanced our knowledge and understanding of macroevolutionary processes. Here, we introduce the genome of the desert horned lizard, Phrynosoma platyrhinos, an iguanid lizard occupying extreme desert conditions of the American southwest. We conduct analysis of the chromosomal structure and composition of this species and compare these features across genomes of 12 other reptiles (5 species of lizards, 3 snakes, 3 turtles, and 1 bird). FindingsThe desert horned lizard genome was sequenced using Illumina paired-end reads and assembled and scaffolded using Dovetail Genomics Hi-C and Chicago long-range contact data. The resulting genome assembly has a total length of 1,901.85 Mb, scaffold N50 length of 273.213 Mb, and includes 5,294 scaffolds. The chromosome-level assembly is composed of 6 macrochromosomes and 11 microchromosomes. A total of 20,764 genes were annotated in the assembly. GC content and gene density are higher for microchromosomes than macrochromosomes, while repeat element distributions show the opposite trend. Pathway analyses provide preliminary evidence that microchromosome and macrochromosome gene content are functionally distinct. Synteny analysis indicates that large microchromosome blocks are conserved among closely related species, whereas macrochromosomes show evidence of frequent fusion and fission events among reptiles, even between closely related species. ConclusionsOur results demonstrate dynamic karyotypic evolution across Reptilia, with frequent inferred splits, fusions, and rearrangements that have resulted in shuffling of chromosomal blocks between macrochromosomes and microchromosomes. Our analyses also provide new evidence for distinct gene content and chromosomal structure between microchromosomes and macrochromosomes within reptiles.more » « less
An official website of the United States government

