skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Complete genome sequence of the Gordonia bacteriophage BiggityBass
Here, we characterized the complete genome of the Siphoviridae BiggityBass, a lytic subcluster DR bacteriophage infecting Gordonia terrae CAG3. Its 63.2-kb genome contains 84 protein-coding genes, of which 40 could be assigned a putative function. BiggityBass is related most closely to AnClar and Yago84 with 90.61% and 90.52% nucleotide identity, respectively.  more » « less
Award ID(s):
2045343
PAR ID:
10390246
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; « less
Editor(s):
Dennehy, John J.
Publisher / Repository:
ASM Journals
Date Published:
Journal Name:
Microbiology Resource Announcements
Volume:
11
Issue:
9
ISSN:
2576-098X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT Here, we present the genome sequence of a member of the group I Acidobacteria , Acidipila sp. strain EB88, which was isolated from temperate forest soil. Like many other members of its class, its genome contains evidence of the potential to utilize a broad range of sugars. 
    more » « less
  2. Dennehy, John J (Ed.)
    Amabiko is a lytic subcluster BE2 bacteriophage that infects Streptomyces scabiei — a bacterium causing common scab in potatoes. Its 131,414 bp genome has a GC content of 49.5% and contains 245 putative protein-coding genes, 45 tRNAs, and one tmRNA. Amabiko is closely related to Streptomyces bacteriophage MindFlayer (gene content similarity: 86.5%). 
    more » « less
  3. CitationSnead, A.A., Meng, F., Largotta, N. et al. Diploid chromosome-level genome assembly and annotation for Lycorma delicatula. Sci Data 12, 579 (2025). https://doi.org/10.1038/s41597-025-04854-8AbstractThe spotted lanternfly (Lycorma delicatula) is a planthopper species (Hemiptera: Fulgoridae) native to China but invasive in South Korea, Japan, and the United States where it is a significant threat to agriculture. Hence, genomic resources are critical to both management and understand the genomic characteristics of successful invaders. Here, we report a haplotype-phased genome assembly and annotation using PacBio long-read sequencing, Hi-C technology, and RNA-seq data. The 2.2 Gbp genome comprises 13 chromosomes, and our whole genome sequencing of eighty-two adults indicated chromosome four as the sex chromosome and anXO sex-determination system.We identified over 12,000 protein coding genes and performed functional annotation, facilitating identification of several candidate genes which may hold importance for spotted lanternfly control. Both the assemblies and annotations were highly complete with over 96% of BUSCO genes complete regardless of the database employed (i.e., Eukaryota, Arthropoda, Insecta). This reference-quality genome will serve as an important resource for both development and optimization of management practices for the spotted lanternfly and invasive genomics as a whole.Description of the data and file structureThis dataset contains the haplotype-phased chromosome-level genome assembly of the spotted lanternfly (Lycorma delicatula) described and published in Snead & Meng et al. (in review). The genome combined long-read data and HiC data (SRA31402152-SRA31402153) to assembly and scaffold each haplotype. The annotation uses RNAseq data from 12 adults (SRA31411873-SRA31411894) to structurally annotate both haplotypes. Finally, whole-genome sequencing of 82 adult spotted lanternfly (bioproject PRJNA1136004) described in the metadata csv provided was used to identify punitive sex chromosomes. The dataset also include GO results for each chromosome not explicitly described in the results of the manuscript.Files and variablesFile: SLF_Hap1.fastaDescription: A fasta file of the assembled genome for the cleaned 13 chromosome haplotype 1 assembly.File: SLF_Hap2.fastaDescription: A fasta file of the assembled genome for the cleaned 13 chromosome haplotype 2 assembly.File: SLF_Hap1_Repeats.gffDescription: A gff file of the repeats annotated in the cleaned 13 chromosome haplotype 1 assembly.File: SLF_Hap2_Repeats.gffDescription: A gff file of the repeats annotated in the cleaned 13 chromosome haplotype 2 assembly.File: SLF_Hap1.gffDescription: A structural annotation of the 13 chromosome haplotype 1 assembly with functional annotations.File: SLF_Hap2.gffDescription: A structural annotation of the 13 chromosome haplotype 2 assembly with functional annotations.File: GO_plot_chr_1.pngDescription: An image of the top 20 GO term results for chromosome 1.File: GO_plot_chr_2.pngDescription: An image of the top 20 GO term results for chromosome 2.File: GO_plot_chr_3.pngDescription: An image of the top 20 GO term results for chromosome 3.File: GO_plot_chr_8.pngDescription: An image of the top 20 GO term results for chromosome 8.File: GO_plot_chr_5.pngDescription: An image of the top 20 GO term results for chromosome 5.File: GO_plot_chr_4.pngDescription: An image of the top 20 GO term results for chromosome 4.File: GO_plot_chr_6.pngDescription: An image of the top 20 GO term results for chromosome 6.File: GO_plot_chr_7.pngDescription: An image of the top 20 GO term results for chromosome 7.File: GO_plot_chr_11.pngDescription: An image of the top 20 GO term results for chromosome 11.File: GO_plot_chr_9.pngDescription: An image of the top 20 GO term results for chromosome 9.File: GO_plot_chr_10.pngDescription: An image of the top 20 GO term results for chromosome 10.File: GO_plot_chr_12.pngDescription: An image of the top 20 GO term results for chromosome 12.File: GO_plot_chr_13.pngDescription: An image of the top 20 GO term results for chromosome 13.File: SLF_Samples_SRA.csvDescription: A csv with the sequencing information, SRA numbers, and sexes of the adults used in to identify the putative sex chromosome.File: SLF_RNAseq_Metadata.csvDescription: A csv with the sequencing information, SRA numbers, and other metadata for the RNAseq used to annotation the genomes.Variablesaccession: The SRA accession numberstudy: The studyobject_status: If the NCBI submission was new or not.bioproject_accession: The bioproject accession numberbiosample_accession: The Biosample accession numberlibrary_ID: The ID used to identify that genomic library.title: The title of the study (the bioproject)library_strategy: Specific sequencing technique used to prepare the library.library_source: The biological material used to create the sequencing library.library_selection: The library preparation method.library_layout: The arrangement of reads within the sequencing library.platform: The sequencing platform.instrument_model: The model of the sequences.design_description: Description of the study design.filetype: Type of filefilename: First filefilename2: Second filesex: The biological sex of the adult.Code/softwareThe initial haplotype-phased scaffolded genome was assembled by Dovetail Genomics (Cantata Bio) with standard software outlined in the methods with default settings. Scripts for the remaining work including annotation, gene ontology enrichment, and other analyses are located in the Github repository (https://github.com/anthonysnead/SLF-Genome-Assembly(opens in new window)).Access informationOther publicly accessible locations of the data:The raw sequencing data and the annotated haplotype-phased genome assembly of Lycorma delicatula have been deposited at the National Center for Biotechnology Information (NCBI). The Hi-C and HiFi data can be found under SRA31402152 and SRA31402153. The RNA-seq data can be found under SRA31411873-SRA31411894, while the DNA-seq data can be found under bioproject PRJNA1136004. 
    more » « less
  4. Eyre-Walker, Adam (Ed.)
    The coppery titi monkey (Plecturocebus cupreus) is an emerging nonhuman primate model system for behavioral and neurobiological research. At the same time, the almost entire absence of genomic resources for the species has hampered insights into the genetic underpinnings of the phenotypic traits of interest. To facilitate future genotype-to-phenotype studies, we here present a high-quality, fully annotated de novo genome assembly for the species with chromosome-length scaffolds spanning the autosomes and chromosome X (scaffold N50 = 130.8 Mb), constructed using data obtained from several orthologous short- and long-read sequencing and scaffolding techniques. With a base-level accuracy of ∼99.99% in chromosome-length scaffolds as well as benchmarking universal single-copy ortholog and k-mer completeness scores of >99.0% and 95.1% at the genome level, this assembly represents one of the most complete Pitheciidae genomes to date, making it an invaluable resource for comparative evolutionary genomics research to improve our understanding of lineage-specific changes underlying adaptive traits as well as deleterious mutations associated with disease. 
    more » « less
  5. In agriculture, various chemicals are used to control the weeds. Out of which, glyphosate is an important herbicide invariably used in the cultivation of glyphosate-resistant crops to control weeds. Overuse of glyphosate results in the evolution of glyphosate-resistant weeds. Evolution of glyphosate resistance (GR) in Amaranthus palmeri (AP) is a serious concern in the USA. Investigation of the mechanism of GR in AP identified different resistance mechanisms of which 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene amplification is predominant. Molecular analysis of GR AP identified the presence of a 5- to >160-fold increase in copies of the EPSPS gene than in a glyphosate-susceptible (GS) population. This increased copy number of the EPSPS gene increased the genome size ranging from 3.5 to 11.8%, depending on the copy number compared to the genome size of GS AP. FISH analysis using a 399-kb EPSPS cassette derived from bacterial artificial chromosomes (BACs) as probes identified that amplified EPSPS copies in GR AP exist in extrachromosomal circular DNA (eccDNA) in addition to the native copy in the chromosome. The EPSPS gene-containing eccDNA having a size of ∼400 kb is termed EPSPS-eccDNA and showed somatic mosacism in size and copy number. EPSPS-eccDNA has a genetic mechanism to tether randomly to mitotic or meiotic chromosomes during cell division or gamete formation and is inherited to daughter cells or progeny generating copy number variation. These eccDNAs are stable genetic elements that can replicate and exist independently. The genomic characterization of the EPSPS locus, along with the flanking regions, identified the presence of a complex array of repeats and mobile genetic elements. The cytogenomics approach in understanding the biology of EPSPS-eccDNA sheds light on various characteristics of EPSPS-eccDNA that favor GR in AP. 
    more » « less