skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 10:00 PM ET on Friday, February 6 until 10:00 AM ET on Saturday, February 7 due to maintenance. We apologize for the inconvenience.


Title: Chromosome-Level Genome Assembly for the Angiosperm Silene conica<
Abstract The angiosperm genus Silene has been the subject of extensive study in the field of ecology and evolution, but the availability of high-quality reference genome sequences has been limited for this group. Here, we report a chromosome-level assembly for the genome of Silene conica based on Pacific Bioscience HiFi, Hi-C, and Bionano technologies. The assembly produced 10 scaffolds (1 per chromosome) with a total length of 862 Mb and only ∼1% gap content. These results confirm previous observations that S. conica and its relatives have a reduced base chromosome number relative to the genus's ancestral state of 12. Silene conica has an exceptionally large mitochondrial genome (>11 Mb), predominantly consisting of sequence of unknown origins. Analysis of shared sequence content suggests that it is unlikely that transfer of nuclear DNA is the primary driver of this mitochondrial genome expansion. More generally, this assembly should provide a valuable resource for future genomic studies in Silene, including comparative analyses with related species that recently evolved sex chromosomes.  more » « less
Award ID(s):
2048407
PAR ID:
10479097
Author(s) / Creator(s):
; ; ; ;
Editor(s):
Castric, Vincent
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
15
Issue:
11
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Mank, Judith (Ed.)
    Abstract Urosaurus nigricaudus is a phrynosomatid lizard endemic to the Baja California Peninsula in Mexico. This work presents a chromosome-level genome assembly and annotation from a male individual. We used PacBio long reads and HiRise scaffolding to generate a high-quality genomic assembly of 1.87 Gb distributed in 327 scaffolds, with an N50 of 279 Mb and an L50 of 3. Approximately 98.4% of the genome is contained in 14 scaffolds, with 6 large scaffolds (334–127 Mb) representing macrochromosomes and 8 small scaffolds (63–22 Mb) representing microchromosomes. Using standard gene modeling and transcriptomic data, we predicted 17,902 protein-coding genes on the genome. The repeat content is characterized by a large proportion of long interspersed nuclear elements that are relatively old. Synteny analysis revealed some microchromosomes with high repeat content are more prone to rearrangements but that both macro- and microchromosomes are well conserved across reptiles. We identified scaffold 14 as the X chromosome. This microchromosome presents perfect dosage compensation where the single X of males has the same expression levels as two X chromosomes in females. Finally, we estimated the effective population size for U. nigricaudus was extremely low, which may reflect a reduction in polymorphism related to it becoming a peninsular endemic. 
    more » « less
  2. Pyhäjärvi, T (Ed.)
    Abstract Blackberries (Rubus spp.) are the fourth most economically important berry crop worldwide. Genome assemblies and annotations have been developed for Rubus species in subgenus Idaeobatus, including black raspberry (R. occidentalis), red raspberry (R. idaeus), and R. chingii, but very few genomic resources exist for blackberries and their relatives in subgenus Rubus. Here we present a chromosome-length assembly and annotation of the diploid blackberry germplasm accession “Hillquist” (R. argutus). “Hillquist” is the only known source of primocane-fruiting (annual-fruiting) in tetraploid fresh-market blackberry breeding programs and is represented in the pedigree of many important cultivars worldwide. The “Hillquist” assembly, generated using Pacific Biosciences long reads scaffolded with high-throughput chromosome conformation capture sequencing, consisted of 298 Mb, of which 270 Mb (90%) was placed on 7 chromosome-length scaffolds with an average length of 38.6 Mb. Approximately 52.8% of the genome was composed of repetitive elements. The genome sequence was highly collinear with a novel maternal haplotype-resolved linkage map of the tetraploid blackberry selection A-2551TN and genome assemblies of R. chingii and red raspberry. A total of 38,503 protein-coding genes were predicted, of which 72% were functionally annotated. Eighteen flowering gene homologs within a previously mapped locus aligning to an 11.2 Mb region on chromosome Ra02 were identified as potential candidate genes for primocane-fruiting. The utility of the “Hillquist” genome has been demonstrated here by the development of the first genotyping-by-sequencing-based linkage map of tetraploid blackberry and the identification of possible candidate genes for primocane-fruiting. This chromosome-length assembly will facilitate future studies in Rubus biology, genetics, and genomics and strengthen applied breeding programs. 
    more » « less
  3. Abstract The common bed bug, Cimex lectularius, is a globally distributed pest insect of medical, veterinary, and economic importance. Previous reference genome assemblies for this species were generated from short read sequencing data, resulting in a ~650 Mb composed of thousands of contigs. Here, we present a haplotype-resolved, chromosome-level reference genome, generated from an adult Harlen strain female specimen. Using PacBio long read and Omni-C proximity sequencing, we generated a 540 Mb genome with 15 chromosomes (13 autosomes and 2 sex chromosomes - X1X2) with an N50 > 30 Mb and BUSCO > 90%. Previous karyotyping efforts indicate an XY sex chromosome system, with 2n=26 and X1X1X2X2 females and X1X2Y males; however significant fragmentation of the X chromosome has also been reported. We further use whole genome resequencing data from males and females to identify the X1 and X2 chromosomes based on sex biases in coverage. This highly contiguous reference genome assembly provides a much-improved resource for identifying chromosomal genome architecture, and for interpreting patterns of urban outbreaks and signatures of selection linked to insecticide resistance. 
    more » « less
  4. null (Ed.)
    The blue crab, Callinectes sapidus (Rathbun, 1896) is an economically, culturally, and ecologically important species found across the temperate and tropical North and South American Atlantic coast. A reference genome will enable research for this high-value species. Initial assembly combined 200× coverage Illumina paired-end reads, a 60× 8 kb mate-paired library, and 50× PacBio data using the MaSuRCA assembler resulting in a 985 Mb assembly with a scaffold N50 of 153 kb. Dovetail Chicago and HiC sequencing with the 3d DNA assembler and Juicebox assembly tools were then used for chromosome scaffolding. The 50 largest scaffolds span 810 Mb are 1.5–37 Mb long and have a repeat content of 36%. The 190 Mb unplaced sequence is in 3921 sequences over 10 kb with a repeat content of 68%. The final assembly N50 is 18.9 Mb for scaffolds and 9317 bases for contigs. Of arthropod BUSCO, ∼88% (888/1013) were complete and single copies. Using 309 million RNAseq read pairs from 12 different tissues and developmental stages, 25,249 protein-coding genes were predicted. Between C. sapidus and Portunus trituberculatus genomes, 41 of 50 large scaffolds had high nucleotide identity and protein-coding synteny, but 9 scaffolds in both assemblies were not clear matches. The protein-coding genes included 9423 one-to-one putative orthologs, of which 7165 were syntenic between the two crab species. Overall, the two crab genome assemblies show strong similarities at the nucleotide, protein, and chromosome level and verify the blue crab genome as an excellent reference for this important seafood species. 
    more » « less
  5. Sekelsky, J (Ed.)
    Abstract Although plant mitochondrial genomes typically show low rates of sequence evolution, levels of divergence in certain angiosperm lineages suggest anomalously high mitochondrial mutation rates. However, de novo mutations have never been directly analyzed in such lineages. Recent advances in high-fidelity DNA sequencing technologies have enabled detection of mitochondrial mutations when still present at low heteroplasmic frequencies. To date, these approaches have only been performed on a single plant species (Arabidopsis thaliana). Here, we apply a high-fidelity technique (Duplex Sequencing) to multiple angiosperms from the genus Silene, which exhibits extreme heterogeneity in rates of mitochondrial sequence evolution among close relatives. Consistent with phylogenetic evidence, we found that Silene latifolia maintains low mitochondrial variant frequencies that are comparable with previous measurements in Arabidopsis. Silene noctiflora also exhibited low variant frequencies despite high levels of historical sequence divergence, which supports other lines of evidence that this species has reverted to lower mitochondrial mutation rates after a past episode of acceleration. In contrast, S. conica showed much higher variant frequencies in mitochondrial (but not in plastid) DNA, consistent with an ongoing bout of elevated mitochondrial mutation rates. Moreover, we found an altered mutational spectrum in S. conica heavily biased towards AT→GC transitions. We also observed an unusually low number of mitochondrial genome copies per cell in S. conica, potentially pointing to reduced opportunities for homologous recombination to accurately repair mismatches in this species. Overall, these results suggest that historical fluctuations in mutation rates are driving extreme variation in rates of plant mitochondrial sequence evolution. 
    more » « less