skip to main content


Title: Population differentiation and structural variation in the Manduca sexta genome across the United States
Abstract Many species that are extensively studied in the laboratory are less well characterized in their natural habitat, and laboratory strains represent only a small fraction of the variation in a species’ genome. Here we investigate genomic variation in 3 natural North American populations of an agricultural pest and a model insect for many scientific disciplines, the tobacco hornworm (Manduca sexta). We show that hornworms from Arizona, Kansas, and North Carolina are genetically distinct, with Arizona being particularly differentiated from the other 2 populations using Illumina whole-genome resequencing. Peaks of differentiation exist across the genome, but here, we focus in on the most striking regions. In particular, we identify 2 likely segregating inversions found in the Arizona population. One inversion on the Z chromosome may enhance adaptive evolution of the sex chromosome. The larger, 8 Mb inversion on chromosome 12 contains a pseudogene which may be involved in the exploitation of a novel hostplant in Arizona, but functional genetic assays will be required to support this hypothesis. Nevertheless, our results reveal undiscovered natural variation and provide useful genomic data for both pest management and evolutionary genetics of this insect species.  more » « less
Award ID(s):
1920895
NSF-PAR ID:
10330467
Author(s) / Creator(s):
;
Editor(s):
Whitehead, A
Date Published:
Journal Name:
G3 Genes|Genomes|Genetics
Volume:
12
Issue:
5
ISSN:
2160-1836
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Johnson, Karyn N. (Ed.)
    ABSTRACT A pervasive pest of stored leguminous products, the bean beetle Callosobruchus maculatus (Coleoptera: Chrysomelidae) associates with a simple bacterial community during adulthood. Despite its economic importance, little is known about the compositional stability, heritability, localization, and metabolic potential of the bacterial symbionts of C. maculatus . In this study, we applied community profiling using 16S rRNA gene sequencing to reveal a highly conserved bacterial assembly shared between larvae and adults. Dominated by Firmicutes and Proteobacteria , this community is localized extracellularly along the epithelial lining of the bean beetle’s digestive tract. Our analysis revealed that only one species, Staphylococcus gallinarum (phylum Firmicutes ), is shared across all developmental stages. Isolation and whole-genome sequencing of S. gallinarum from the beetle gut yielded a circular chromosome (2.8 Mb) and one plasmid (45 kb). The strain encodes complete biosynthetic pathways for the production of B vitamins and amino acids, including tyrosine, which is increasingly recognized as an important symbiont-supplemented precursor for cuticle biosynthesis in beetles. A carbohydrate-active enzyme search revealed that the genome codes for a number of digestive enzymes, reflecting the nutritional ecology of C. maculatus . The ontogenic conservation of the gut microbiota in the bean beetle, featuring a “core” community composed of S. gallinarum , may be indicative of an adaptive role for the host. In clarifying symbiont localization and metabolic potential, we further our understanding and study of a costly pest of stored products. IMPORTANCE From supplementing essential nutrients to detoxifying plant secondary metabolites and insecticides, bacterial symbionts are a key source of adaptations for herbivorous insect pests. Despite the pervasiveness and geographical range of the bean beetle Callosobruchus maculatus , the role of microbial symbioses in its natural history remains understudied. Here, we demonstrate that the bean beetle harbors a simple gut bacterial community that is stable throughout development. This community localizes along the insect’s digestive tract and is largely dominated by Staphylococcus gallinarum . In elucidating symbiont metabolic potential, we highlight its possible adaptive significance for a widespread agricultural pest. 
    more » « less
  2. Slotte, Tanja (Ed.)
    Abstract Euphorbia peplus (petty spurge) is a small, fast-growing plant that is native to Eurasia and has become a naturalized weed in North America and Australia. E. peplus is not only medicinally valuable, serving as a source for the skin cancer drug ingenol mebutate, but also has great potential as a model for latex production owing to its small size, ease of manipulation in the laboratory, and rapid reproductive cycle. To help establish E. peplus as a new model, we generated a 267.2 Mb Hi-C-anchored PacBio HiFi nuclear genome assembly with an BUSCO score of 98.5%, a genome annotation based on RNA-seq data from six organs, and publicly accessible tools including a genome browser and an interactive organ-specific expression atlas. Chromosome number is highly variable across Euphorbia species. Using a comparative analysis of our newly sequenced E. peplus genome with other Euphorbiaceae genomes, we show that variation in Euphorbia chromosome number between E. peplus and E. lathyris is likely due to fragmentation and rearrangement rather than chromosomal duplication followed by diploidization of the duplicated sequence. Moreover, we found that the E. peplus genome is relatively compact compared to related members of the genus in part due to restricted expansion of the Ty3 transposon family. Finally, we identify a large gene cluster that contains many previously identified enzymes in the putative ingenol mebutate biosynthesis pathway, along with additional gene candidates for this biosynthetic pathway. The genomic resources we have created for E. peplus will help advance research on latex production and ingenol mebutate biosynthesis in the commercially important Euphorbiaceae family. 
    more » « less
  3. Almost all regulation of gene expression in eukaryotic genomes is mediated by the action of distant non-coding transcriptional enhancers upon proximal gene promoters. Enhancer locations cannot be accurately predicted bioinformatically because of the absence of a defined sequence code, and thus functional assays are required for their direct detection. Here we used a massively parallel reporter assay, Self-Transcribing Active Regulatory Region sequencing (STARR-seq), to generate the first comprehensive genome-wide map of enhancers in Anopheles coluzzii , a major African malaria vector in the Gambiae species complex. The screen was carried out by transfecting reporter libraries created from the genomic DNA of 60 wild A. coluzzii from Burkina Faso into A. coluzzii 4a3A cells, in order to functionally query enhancer activity of the natural population within the homologous cellular context. We report a catalog of 3,288 active genomic enhancers that were significant across three biological replicates, 74% of them located in intergenic and intronic regions. The STARR-seq enhancer screen is chromatin-free and thus detects inherent activity of a comprehensive catalog of enhancers that may be restricted in vivo to specific cell types or developmental stages. Testing of a validation panel of enhancer candidates using manual luciferase assays confirmed enhancer function in 26 of 28 (93%) of the candidates over a wide dynamic range of activity from two to at least 16-fold activity above baseline. The enhancers occupy only 0.7% of the genome, and display distinct composition features. The enhancer compartment is significantly enriched for 15 transcription factor binding site signatures, and displays divergence for specific dinucleotide repeats, as compared to matched non-enhancer genomic controls. The genome-wide catalog of A. coluzzii enhancers is publicly available in a simple searchable graphic format. This enhancer catalogue will be valuable in linking genetic and phenotypic variation, in identifying regulatory elements that could be employed in vector manipulation, and in better targeting of chromosome editing to minimize extraneous regulation influences on the introduced sequences. Importance: Understanding the role of the non-coding regulatory genome in complex disease phenotypes is essential, but even in well-characterized model organisms, identification of regulatory regions within the vast non-coding genome remains a challenge. We used a large-scale assay to generate a genome wide map of transcriptional enhancers. Such a catalogue for the important malaria vector, Anopheles coluzzii , will be an important research tool as the role of non-coding regulatory variation in differential susceptibility to malaria infection is explored and as a public resource for research on this important insect vector of disease. 
    more » « less
  4. Abstract Background

    Capturing the genetic diversity of wild relatives is crucial for improving crops because wild species are valuable sources of agronomic traits that are essential to enhance the sustainability and adaptability of domesticated cultivars. Genetic diversity across a genus can be captured in super-pangenomes, which provide a framework for interpreting genomic variations.

    Results

    Here we report the sequencing, assembly, and annotation of nine wild North American grape genomes, which are phased and scaffolded at chromosome scale. We generate a reference-unbiased super-pangenome using pairwise whole-genome alignment methods, revealing the extent of the genomic diversity among wild grape species from sequence to gene level. The pangenome graph captures genomic variation between haplotypes within a species and across the different species, and it accurately assesses the similarity of hybrids to their parents. The species selected to build the pangenome are a great representation of the genus, as illustrated by capturing known allelic variants in the sex-determining region and for Pierce’s disease resistance loci. Using pangenome-wide association analysis, we demonstrate the utility of the super-pangenome by effectively mapping short reads from genus-wide samples and identifying loci associated with salt tolerance in natural populations of grapes.

    Conclusions

    This study highlights how a reference-unbiased super-pangenome can reveal the genetic basis of adaptive traits from wild relatives and accelerate crop breeding research.

     
    more » « less
  5. Abstract

    Genetic diversity becomes structured among populations over time due to genetic drift and divergent selection. Although population structure is often treated as a uniform underlying factor, recent resequencing studies of wild populations have demonstrated that diversity in many regions of the genome may be structured quite dissimilar to the genome‐wide pattern. Here, we explored the adaptive and nonadaptive causes of such genomic heterogeneity using population‐level, whole genome resequencing data obtained from annualMimulus guttatusindividuals collected across a rugged environment landscape. We found substantial variation in how genetic differentiation is structured both within and between chromosomes, although, in contrast to other studies, known inversion polymorphisms appear to serve only minor roles in this heterogeneity. In addition, much of the genome can be clustered into eight among‐population genetic differentiation patterns, but only two of these clusters are particularly consistent with patterns of isolation by distance. By performing genotype‐environment association analysis, we also identified genomic intervals where local adaptation to specific climate factors has accentuated genetic differentiation among populations, and candidate genes in these windows indicate climate adaptation may proceed through changes affecting specialized metabolism, drought resistance, and development. Finally, by integrating our findings with previous studies, we show that multiple aspects of plant reproductive biology may be common targets of balancing selection and that variants historically involved in climate adaptation among populations have probably also fuelled rapid adaptation to microgeographic environmental variation within sites.

     
    more » « less