Abstract BackgroundDe novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species. FindingsUsing Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements. ConclusionsKnowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees. 
                        more » 
                        « less   
                    
                            
                            A Chromosome-level Genome Assembly of the Highly Heterozygous Sea Urchin Echinometra sp. EZ Reveals Adaptation in the Regulatory Regions of Stress Response Genes
                        
                    
    
            Abstract Echinometra is the most widespread genus of sea urchin and has been the focus of a wide range of studies in ecology, speciation, and reproduction. However, available genetic data for this genus are generally limited to a few select loci. Here, we present a chromosome-level genome assembly based on 10x Genomics, PacBio, and Hi-C sequencing for Echinometra sp. EZ from the Persian/Arabian Gulf. The genome is assembled into 210 scaffolds totaling 817.8 Mb with an N50 of 39.5 Mb. From this assembly, we determined that the E. sp. EZ genome consists of 2n = 42 chromosomes. BUSCO analysis showed that 95.3% of BUSCO genes were complete. Ab initio and transcript-informed gene modeling and annotation identified 29,405 genes, including a conserved Hox cluster. E. sp. EZ can be found in high-temperature and high-salinity environments, and we therefore compared E. sp. EZ gene families and transcription factors associated with environmental stress response (“defensome”) with other echinoid species with similar high-quality genomic resources. While the number of defensome genes was broadly similar for all species, we identified strong signatures of positive selection in E. sp. EZ noncoding elements near genes involved in environmental response pathways as well as losses of transcription factors important for environmental response. These data provide key insights into the biology of E. sp. EZ as well as the diversification of Echinometra more widely and will serve as a useful tool for the community to explore questions in this taxonomic group and beyond. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 1924498
- PAR ID:
- 10444163
- Editor(s):
- O’Neill, Rachel
- Date Published:
- Journal Name:
- Genome Biology and Evolution
- Volume:
- 14
- Issue:
- 10
- ISSN:
- 1759-6653
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract Wildlife diseases, such as the sea star wasting (SSW) epizootic that outbroke in the mid-2010s, appear to be associated with acute and/or chronic abiotic environmental change; dissociating the effects of different drivers can be difficult. The sunflower sea star, Pycnopodia helianthoides, was the species most severely impacted during the SSW outbreak, which overlapped with periods of anomalous atmospheric and oceanographic conditions, and there is not yet a consensus on the cause(s). Genomic data may reveal underlying molecular signatures that implicate a subset of factors and, thus, clarify past events while also setting the scene for effective restoration efforts. To advance this goal, we used Pacific Biosciences HiFi long sequencing reads and Dovetail Omni-C proximity reads to generate a highly contiguous genome assembly that was then annotated using RNA-seq-informed gene prediction. The genome assembly is 484 Mb long, with contig N50 of 1.9 Mb, scaffold N50 of 21.8 Mb, BUSCO completeness score of 96.1%, and 22 major scaffolds consistent with prior evidence that sea star genomes comprise 22 autosomes. These statistics generally fall between those of other recently assembled chromosome-scale assemblies for two species in the distantly related asteroid genus Pisaster. These novel genomic resources for P. helianthoides will underwrite population genomic, comparative genomic, and phylogenomic analyses—as well as their integration across scales—of SSW and environmental stressors.more » « less
- 
            As many bacteria detected in Antarctic environments are neither true psychrophiles nor endemic species, their proliferation in spite of environmental extremes gives rise to genome adaptations. Janthinobacterium sp. CG23_2 is a bacterial isolate from the Cotton Glacier stream, Antarctica. To understand how Janthinobacterium sp. CG23_2 has adapted to its environment, we investigated its genomic traits in comparison to genomes of 35 published Janthinobacterium species. While we hypothesized that genome shrinkage and specialization to narrow ecological niches would be energetically favorable for dwelling in an ephemeral Antarctic stream, the genome of Janthinobacterium sp. CG23_2 was on average 1.7 ± 0.6 Mb larger and predicted 1411 ± 499 more coding sequences compared to the other Janthinobacterium spp. Putatively identified horizontal gene transfer events contributed 0.92 Mb to the genome size expansion of Janthinobacterium sp. CG23_2. Genes with high copy numbers in the species-specific accessory genome of Janthinobacterium sp. CG23_2 were associated with environmental sensing, locomotion, response and transcriptional regulation, stress response, and mobile elements—functional categories which also showed molecular adaptation to cold. Our data suggest that genome plasticity and the abundant complementary genes for sensing and responding to the extracellular environment supported the adaptation of Janthinobacterium sp. CG23_2 to this extreme environment.more » « less
- 
            Abstract Exserohilum turcicum causes northern corn leaf blight and sorghum leaf blight. While the same species cause disease in both crops, the strains are host-specific. Here, we report the sequence and de novo annotated assemblies of one sorghum- and one maize-specific E. turcicum strain. The strains were sequenced using the PacBio Sequel II system. The total genome length for both assemblies was between 44 and 45 Mb with N50 of ∼2.5 Mb. Ninety-eight percent of the Benchmarking Universal Single-Copy Orthologs (BUSCO) for both assemblies had complete status. The estimated number of genes was 11,762 and 12,029 in the sorghum- and maize-specific isolates, respectively. Funannotate, EffectorP, SignalP, and transcriptome data were used to create functional annotation of each genome. The whole-genome comparison identified ten large-scale inversions and three translocations between the maize- and sorghum-specific strains, along with homologous genes and gene duplications. RNA was sequenced from the maize- and sorghum-specific isolate 10 days post-inoculation in maize and sorghum and from axenic cultures. Gene expression data from planta and axenic growth experiments were compared for each strain. Candidate host-specificity genes were identified by combining results from whole-genome comparison, synteny analysis, gene annotations, and transcriptome data. Overall, this study identified several candidate host-specificity genes that provide insights into E. turcicum interaction with its hosts.more » « less
- 
            Erysiphe necator is an economically important biotrophic fungal pathogen responsible for powdery mildew disease on grapevine. Currently, genome sequences are available for only a few E. necator isolates from the United States. Based on the combination of Nanopore and Illumina sequencing technologies, we present here the complete genome assembly for an isolate of E. necator, NAFU1, identified in China. We acquired a total of 15.93 Gb of raw reads. These reads were processed into a 61.12-Mb genome assembly containing 73 contigs with an N 50 of 2.06 Mb and a maximum length of 6.05 Mb. Combining the results of three gene-prediction modules (i.e., an evidence-based gene modeler [EVidenceModeler], an ab initio gene modeler, and a homology-based gene modeler), we predicted 7,235 protein-coding genes in the assembled genome of E. necator NAFU1. This information will facilitate studies of genome evolution and pathogenicity mechanisms of E. necator and other powdery mildew species through comparative genome sequence analysis and other molecular genetic tools. [Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license .more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    