skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: De Novo Long-Read Genome Assembly and Annotation of the Luna Moth ( Actias luna ) Fully Resolves Repeat-Rich Silk Genes
Abstract We present the first long-read de novo assembly and annotation of the luna moth (Actias luna) and provide the full characterization of heavy chain fibroin (h-fibroin), a long and highly repetitive gene (>20 kb) essential in silk fiber production. There are >160,000 described species of moths and butterflies (Lepidoptera), but only within the last 5 years have we begun to recover high-quality annotated whole genomes across the order that capture h-fibroin. Using PacBio HiFi reads, we produce the first high-quality long-read reference genome for this species. The assembled genome has a length of 532 Mb, a contig N50 of 16.8 Mb, an L50 of 14 contigs, and 99.4% completeness (BUSCO). Our annotation using Bombyx mori protein and A. luna RNAseq evidence captured a total of 20,866 genes at 98.9% completeness with 10,267 functionally annotated proteins and a full-length h-fibroin annotation of 2,679 amino acid residues.  more » « less
Award ID(s):
2217159 2217155
PAR ID:
10525004
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
16
Issue:
7
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Lavrov, Dennis (Ed.)
    Abstract The painted lady butterfly, Vanessa cardui, has the longest migration routes, the widest hostplant diversity, and one of the most complex wing patterns of any insect. Due to minimal culturing requirements, easily characterized wing pattern elements, and technical feasibility of CRISPR/Cas9 genome editing, V. cardui is emerging as a functional genomics model for diverse research programs. Here, we report a high-quality, annotated genome assembly of the V. cardui genome, generated using 84× coverage of PacBio long-read data, which we assembled into 205 contigs with a total length of 425.4 Mb (N50 = 10.3 Mb). The genome was very complete (single-copy complete Benchmarking Universal Single-Copy Orthologs [BUSCO] 97%), with contigs assembled into presumptive chromosomes using synteny analyses. Our annotation used embryonic, larval, and pupal transcriptomes, and 20 transcriptomes across five different wing developmental stages. Gene annotations showed a high level of accuracy and completeness, with 14,437 predicted protein-coding genes. This annotated genome assembly constitutes an important resource for diverse functional genomic studies ranging from the developmental genetic basis of butterfly color pattern, to coevolution with diverse hostplants. 
    more » « less
  2. Eyre-Walker, Adam (Ed.)
    The coppery titi monkey (Plecturocebus cupreus) is an emerging nonhuman primate model system for behavioral and neurobiological research. At the same time, the almost entire absence of genomic resources for the species has hampered insights into the genetic underpinnings of the phenotypic traits of interest. To facilitate future genotype-to-phenotype studies, we here present a high-quality, fully annotated de novo genome assembly for the species with chromosome-length scaffolds spanning the autosomes and chromosome X (scaffold N50 = 130.8 Mb), constructed using data obtained from several orthologous short- and long-read sequencing and scaffolding techniques. With a base-level accuracy of ∼99.99% in chromosome-length scaffolds as well as benchmarking universal single-copy ortholog and k-mer completeness scores of >99.0% and 95.1% at the genome level, this assembly represents one of the most complete Pitheciidae genomes to date, making it an invaluable resource for comparative evolutionary genomics research to improve our understanding of lineage-specific changes underlying adaptive traits as well as deleterious mutations associated with disease. 
    more » « less
  3. The parasitoid wasp Muscidifurax raptorellus (Hymenoptera: Pteromalidae) is a gregarious species that has received extensive attention for its potential in biological pest control against house fly, stable fly, and other filth flies. It has a high reproductive capacity and can be reared easily. However, genome assembly is not available for M. raptorellus or any other species in this genus. Previously, we assembled a complete circular mitochondrial genome with a length of 24,717 bp. Here, we assembled and annotated a high-quality nuclear genome of M. raptorellus , using a combination of long-read (104× genome coverage) and short-read (326× genome coverage) sequencing technologies. The assembled genome size is 314 Mbp in 226 contigs, with a 97.9% BUSCO completeness score and a contig N50 of 4.67 Mb, suggesting excellent continuity of this assembly. Our assembly builds the foundation for comparative and evolutionary genomic analysis in the genus of Muscidifurax and possible future biocontrol applications. 
    more » « less
  4. Vogel, K (Ed.)
    Abstract We present the first chromosome-level genome assembly for Bombus pensylvanicus, a historically widespread native pollinator species that was distributed across eastern North America but has subsequently undergone declines in range area and local relative abundance. This species has been of significant interest as a model for understanding both patterns and possible causes of bumble bee decline in the region, including the role of genetic variation. Here we present a chromosome-level reference genome assembled using Pacific Biosciences singe-molecule HiFi sequences and Hi-C data and annotated using evidence derived from RNA sequencing of multiple tissue types. The B. pensylvanicus genome has a total length of ∼352.6 Mb and was assembled into a total of 224 scaffolds, with 19 primary pseudomolecules representing putative chromosomes and an N50 = 14.872 Mb. Annotation with the Eukaryotic Genome Annotation Pipeline—External (EGAPx) identified 11,411 genes (10,263 protein coding), and BUSCO analysis of 5,991 Hymenoptera-specific BUSCO groups indicated a completeness for the proteins of 99.0% (98.6% single-copy, 0.5% duplicated) and for the genome of 98.5% (98.2% single-copy, 0.3% duplicated). We present synteny analyses with other recently assembled Bombus genomes representing different subgenera and examine the distribution of repetitive regions of the genome relative to the distribution of genes and noncoding RNAs. 
    more » « less
  5. Wheat, Christopher (Ed.)
    Abstract We present a chromosome-length genome assembly and annotation of the Black Petaltail dragonfly (Tanypteryx hageni). This habitat specialist diverged from its sister species over 70 million years ago, and separated from the most closely related Odonata with a reference genome 150 million years ago. Using PacBio HiFi reads and Hi-C data for scaffolding we produce one of the most high-quality Odonata genomes to date. A scaffold N50 of 206.6 Mb and a single copy BUSCO score of 96.2% indicate high contiguity and completeness. 
    more » « less