skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: High-Quality Genome Assembly and Comprehensive Transcriptome of the Painted Lady Butterfly Vanessa cardui
Abstract The painted lady butterfly, Vanessa cardui, has the longest migration routes, the widest hostplant diversity, and one of the most complex wing patterns of any insect. Due to minimal culturing requirements, easily characterized wing pattern elements, and technical feasibility of CRISPR/Cas9 genome editing, V. cardui is emerging as a functional genomics model for diverse research programs. Here, we report a high-quality, annotated genome assembly of the V. cardui genome, generated using 84× coverage of PacBio long-read data, which we assembled into 205 contigs with a total length of 425.4 Mb (N50 = 10.3 Mb). The genome was very complete (single-copy complete Benchmarking Universal Single-Copy Orthologs [BUSCO] 97%), with contigs assembled into presumptive chromosomes using synteny analyses. Our annotation used embryonic, larval, and pupal transcriptomes, and 20 transcriptomes across five different wing developmental stages. Gene annotations showed a high level of accuracy and completeness, with 14,437 predicted protein-coding genes. This annotated genome assembly constitutes an important resource for diverse functional genomic studies ranging from the developmental genetic basis of butterfly color pattern, to coevolution with diverse hostplants.  more » « less
Award ID(s):
1753559
PAR ID:
10320353
Author(s) / Creator(s):
; ; ;
Editor(s):
Lavrov, Dennis
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
13
Issue:
7
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Comparisons of high-quality, reference butterfly, and moth genomes have been instrumental to advancing our understanding of how hybridization, and natural selection drive genomic change during the origin of new species and novel traits. Here, we present a genome assembly of the Southern Dogface butterfly, Zerene cesonia (Pieridae) whose brilliant wing colorations have been implicated in developmental plasticity, hybridization, sexual selection, and speciation. We assembled 266,407,278 bp of the Z. cesonia genome, which accounts for 98.3% of the estimated 271 Mb genome size. Using a hybrid approach involving Chicago libraries with Hi-Rise assembly and a diploid Meraculous assembly, the final haploid genome was assembled. In the final assembly, nearly all autosomes and the Z chromosome were assembled into single scaffolds. The largest 29 scaffolds accounted for 91.4% of the genome assembly, with the remaining ∼8% distributed among another 247 scaffolds and overall N50 of 9.2 Mb. Tissue-specific RNA-seq informed annotations identified 16,442 protein-coding genes, which included 93.2% of the arthropod Benchmarking Universal Single-Copy Orthologs (BUSCO). The Z. cesonia genome assembly had ∼9% identified as repetitive elements, with a transposable element landscape rich in helitrons. Similar to other Lepidoptera genomes, Z. cesonia showed a high conservation of chromosomal synteny. The Z. cesonia assembly provides a high-quality reference for studies of chromosomal arrangements in the Pierid family, as well as for population, phylo, and functional genomic studies of adaptation and speciation. 
    more » « less
  2. Neoclytus acuminatus acuminatus, the red-headed ash borer, is a wood-boring longhorn beetle (Cerambycidae: Cerambycinae) native to North America and introduced in Eurasia and South America. Its larvae develop in dying or recently dead hardwood trees, including ecologically and economically significant species of ash, hickory, and oak. We sequenced, assembled, and annotated the genome of a female N. acuminatus and compared it to the publicly available genomes of other cerambycid species. The 508 Mb N. acuminatus genome assembly spanned 20 contigs (19 nuclear + 1 mitochondrial), with an N50 of 52.59 Mb and largest contig of 61.20 Mb. A moderately high fraction of the genome (62.63%) comprised repetitive sequences, with nearly all (99.4%) expected single-copy orthologous genes (BUSCOs) present and fully assembled. We identified 2 contigs as fragments of the N. acuminatus sex chromosome. Genome annotation identified 12,899 genes, including 109 putative horizontally transferred loci. Synteny analysis identified well-conserved blocks of collinearity between the N. acuminatus genome and other Cerambycidae. The genome contains a similar number of genes encoding putative plant cell wall degrading enzymes as other Cerambycidae. The N. acuminatus genome provides new insights into genome evolution in the family Cerambycidae, known for its rich diversity of xylophagous species, and provides a new viewpoint from which to study the evolution and genomic basis of traits such as wood-feeding and olfaction in beetles and other insects. 
    more » « less
  3. Fraser, Bonnie (Ed.)
    Abstract Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics. 
    more » « less
  4. Abstract Automeris moths are a morphologically diverse group with 135 described species that have a geographic range that spans from the New World temperate zone to the Neotropics. Many Automeris have elaborate hindwing eyespots that are thought to deter or disrupt the attack of potential predators, allowing the moth time to escape. The Io moth (Automeris io), known for its striking eyespots, is a well-studied species within the genus and is an emerging model system to study the evolution of deimatism. Existing research on the eyespot pattern development will be augmented by genomic resources that allow experimental manipulation of this emerging model. Here, we present a high-quality, PacBio HiFi genome assembly for Io moth to aid existing research on the molecular development of eyespots and future research on other deimatic traits. This 490 Mb assembly is highly contiguous (N50 = 15.78 mbs) and complete (benchmarking universal single-copy orthologs = 98.4%). Additionally, we were able to recover orthologs of genes previously identified as being involved in wing pattern formation and movement. 
    more » « less
  5. Abstract Rosalia funebris (RFUNE; Cerambycidae), the banded alder borer, is a longhorn beetle whose larvae feed on the wood of various economically and ecologically significant trees in western North America. Adults are short-lived and not known to consume plant material substantially. We sequenced, assembled, and annotated the RFUNE genome using HiFi and RNASeq data. We documented genome architecture and gene content, focusing on genes putatively involved in plant feeding (phytophagy). Comparisons were made to the well-studied genome of the Asian longhorned beetle (AGLAB; Anoplophora glabripennis) and other Cerambycidae. The 814 Mb RFUNE genome assembly was distributed across 42 contigs, with an N50 of 30.18 Mb. Repetitive sequences comprised 60.27% of the genome, and 99.0% of expected single-copy orthologous genes were fully assembled. We identified 12,657 genes, fewer than in the four other species studied, and 46.4% fewer than for Aromia moschata (same subfamily as RFUNE). Of the 7,258 orthogroups shared between RFUNE and AGLAB, 1,461 had more copies in AGLAB and 1,023 had more copies in RFUNE. We identified 240 genes in RFUNE that putatively arose via horizontal transfer events. The RFUNE genome encoded substantially fewer putative plant cell wall degrading enzymes than AGLAB, which may relate to the longer-lived plant-feeding adults of the latter species. The RFUNE genome provides new insights into cerambycid genome architecture and gene content and provides a new vantage point from which to study the evolution and genomic basis of phytophagy in beetles. 
    more » « less