Sequencing high molecular weight (HMW) DNA with long-read and linked-read technologies has promoted a major increase in more complete genome sequences for nonmodel organisms. Sequencing approaches that rely on HMW DNA have been limited to larger organisms or pools of multiple individuals, but recent advances have allowed for sequencing from individuals of small-bodied organisms. Here, we use HMW DNA sequencing with PacBio long reads and TELL-Seq linked reads to assemble and annotate the genome from a single individual feather louse (Brueelia nebulosa) from a European Starling (Sturnus vulgaris). We assembled a genome with a relatively high scaffold N50 (637 kb) and with BUSCO scores (96.1%) comparable to louse genomes assembled from pooled individuals. We annotated a number of genes (10,938) similar to the human louse (Pediculus humanus) genome. Additionally, calling phased variants revealed that the Brueelia genome is more heterozygous (∼1%) then expected for a highly obligate and dispersal-limited parasite. We also assembled and annotated the mitochondrial genome and primary endosymbiont (Sodalis) genome from the individual louse, which showed evidence for heteroplasmy in the mitogenome and a reduced genome size in the endosymbiont compared to its free-living relative. Our study is a valuable demonstration of the capability to obtain high-quality genomes from individual small, nonmodel organisms. Applying this approach to other organisms could greatly increase our understanding of the diversity and evolution of individual genomes.
- Award ID(s):
- 1829176
- NSF-PAR ID:
- 10308766
- Date Published:
- Journal Name:
- BMC Genomics
- Volume:
- 22
- Issue:
- 1
- ISSN:
- 1471-2164
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
The genus Trifolium is the largest of the tribe Trifolieae in the subfamily Papilionoideae (Fabaceae). The paucity of mitochondrial genome (mitogenome) sequences has hindered comparative analyses among the three genomic compartments of the plant cell (nucleus, mitochondrion and plastid). We assembled four mitogenomes from the two subgenera (Chronosemium and Trifolium) of the genus. The four Trifolium mitogenomes were compact (294,911–348,724 bp in length) and contained limited repetitive (6.6–8.6%) DNA. Comparison of organelle repeat content highlighted the distinct evolutionary trajectory of plastid genomes in a subset of Trifolium species. Intracellular gene transfer (IGT) was analyzed among the three genomic compartments revealing functional transfer of mitochondrial rps1 to nuclear genome along with other IGT events. Phylogenetic analysis based on mitochondrial and nuclear rps1 sequences revealed that the functional transfer in Trifolieae was independent from the event that occurred in robinioid clade that includes genus Lotus. A novel, independent fission event of ccmFn in Trifolium was identified, caused by a 59 bp deletion. Fissions of this gene reported previously in land plants were reassessed and compared with Trifolium.more » « less
-
Slotte, Tanja (Ed.)Abstract Intracellular transfers of mitochondrial DNA continue to shape nuclear genomes. Chromosome 2 of the model plant Arabidopsis thaliana contains one of the largest known nuclear insertions of mitochondrial DNA (numts). Estimated at over 600 kb in size, this numt is larger than the entire Arabidopsis mitochondrial genome. The primary Arabidopsis nuclear reference genome contains less than half of the numt because of its structural complexity and repetitiveness. Recent data sets generated with improved long-read sequencing technologies (PacBio HiFi) provide an opportunity to finally determine the accurate sequence and structure of this numt. We performed a de novo assembly using sequencing data from recent initiatives to span the Arabidopsis centromeres, producing a gap-free sequence of the Chromosome 2 numt, which is 641 kb in length and has 99.933% nucleotide sequence identity with the actual mitochondrial genome. The numt assembly is consistent with the repetitive structure previously predicted from fiber-based fluorescent in situ hybridization. Nanopore sequencing data indicate that the numt has high levels of cytosine methylation, helping to explain its biased spectrum of nucleotide sequence divergence and supporting previous inferences that it is transcriptionally inactive. The original numt insertion appears to have involved multiple mitochondrial DNA copies with alternative structures that subsequently underwent an additional duplication event within the nuclear genome. This work provides insights into numt evolution, addresses one of the last unresolved regions of the Arabidopsis reference genome, and represents a resource for distinguishing between highly similar numt and mitochondrial sequences in studies of transcription, epigenetic modifications, and de novo mutations.more » « less
-
Plastid genomes (plastomes) vary enormously in size and gene content among the many lineages of nonphotosynthetic plants, but key lineages remain unexplored. We therefore investigated plastome sequence and expression in the holoparasitic and morphologically bizarre Balanophoraceae. The two
Balanophora plastomes examined are remarkable, exhibiting features rarely if ever seen before in plastomes or in any other genomes. At 15.5 kb in size and with only 19 genes, they are among the most reduced plastomes known. They have no tRNA genes for protein synthesis, a trait found in only three other plastid lineages, and thusBalanophora plastids must import all tRNAs needed for translation.Balanophora plastomes are exceptionally compact, with numerous overlapping genes, highly reduced spacers, loss of allcis -spliced introns, and shrunken protein genes. With A+T contents of 87.8% and 88.4%, theBalanophora genomes are the most AT-rich genomes known save for a single mitochondrial genome that is merely bloated with AT-rich spacer DNA. Most plastid protein genes inBalanophora consist of ≥90% AT, with several between 95% and 98% AT, resulting in the most biased codon usage in any genome described to date. A potential consequence of its radical compositional evolution is the novel genetic code used byBalanophora plastids, in which TAG has been reassigned from stop to tryptophan. Despite its many exceptional properties, theBalanophora plastome must be functional because all examined genes are transcribed, its only intron is correctlytrans -spliced, and its protein genes, although highly divergent, are evolving under various degrees of selective constraint. -
Abstract The angiosperm genus Silene has been the subject of extensive study in the field of ecology and evolution, but the availability of high-quality reference genome sequences has been limited for this group. Here, we report a chromosome-level assembly for the genome of Silene conica based on Pacific Bioscience HiFi, Hi-C, and Bionano technologies. The assembly produced 10 scaffolds (1 per chromosome) with a total length of 862 Mb and only ∼1% gap content. These results confirm previous observations that S. conica and its relatives have a reduced base chromosome number relative to the genus's ancestral state of 12. Silene conica has an exceptionally large mitochondrial genome (>11 Mb), predominantly consisting of sequence of unknown origins. Analysis of shared sequence content suggests that it is unlikely that transfer of nuclear DNA is the primary driver of this mitochondrial genome expansion. More generally, this assembly should provide a valuable resource for future genomic studies in Silene, including comparative analyses with related species that recently evolved sex chromosomes.