Next-generation sequencing (NGS) technologies have revolutionized phylogenomics by decreasing the cost and time required to generate sequence data from multiple markers or whole genomes. Further, the fragmented DNA of biological specimens collected decades ago can be sequenced with NGS, reducing the need for collecting fresh specimens. Sequence capture, also known as anchored hybrid enrichment, is a method to produce reduced representation libraries for NGS sequencing. The technique uses single-stranded oligonucleotide probes that hybridize with pre-selected regions of the genome that are sequenced via NGS, culminating in a dataset of numerous orthologous loci from multiple taxa. Phylogenetic analyses using these sequences have the potential to resolve deep and shallow phylogenetic relationships. Identifying the factors that affect sequence capture success could save time, money, and valuable specimens that might be destructively sampled despite low likelihood of sequencing success. We investigated the impacts of specimen age, preservation method, and DNA concentration on sequence capture (number of captured sequences and sequence quality) while accounting for taxonomy and extracted tissue type in a large-scale butterfly phylogenomics project. This project used two probe sets to extract 391 loci or a subset of 13 loci from over 6,000 butterfly specimens. We found that sequence capture is a resilient method capable of amplifying loci in samples of varying age (0–111 years), preservation method (alcohol, papered, pinned), and DNA concentration (0.020 ng/μl - 316 ng/ul). Regression analyses demonstrate that sequence capture is positively correlated with DNA concentration. However, sequence capture and DNA concentration are negatively correlated with sample age and preservation method. Our findings suggest that sequence capture projects should prioritize the use of alcohol-preserved samples younger than 20 years old when available. In the absence of such specimens, dried samples of any age can yield sequence data, albeit with returns that diminish with increasing age.
more »
« less
Sequencing RNA from old, dried specimens reveals past viromes and properties of long-surviving RNA
ABSTRACT Recovery of virus sequences from old samples provides an opportunity to study virus evolution and reconstruct historic virus-host interactions. Studies of old virus sequences have mainly relied on DNA or on RNA from fixed or frozen samples. The millions of specimens in natural history museums represent a potential treasure trove of old virus sequences, but it is not clear how well RNA survives in old samples. We experimentally assessed the stability of RNA in insects stored dry at room temperature over 72 weeks. Although RNA molecules grew fragmented, RNA yields remained surprisingly constant. RT-qPCR of host and virus RNA showed minimal differences between dried and frozen specimens. To assess RNA survival in much older samples we acquiredDrosophilaspecimens from North American entomological collections. We recovered sequences from known and novel viruses including several coding complete virus genomes from a fly collected in 1908. We found that the virome ofD. melanogasterhas changed little over the past century. Galbut virus, the most prevalent virus infection in contemporaryD. melanogaster, was also the most common in historic samples. Finally, we investigated the genomic and physical features of surviving RNA. RNA that survived was fragmented, chemically damaged, and preferentially double stranded or contained in ribonucleoprotein complexes. This showed that RNA - especially certain types of RNA – can survive in biological specimens over extended periods in the absence of fixation or freezing and confirms the utility of dried specimens to provide a clearer understanding of virus evolution.
more »
« less
- Award ID(s):
- 2048214
- PAR ID:
- 10569008
- Publisher / Repository:
- bioRxiv
- Date Published:
- Format(s):
- Medium: X
- Institution:
- bioRxiv
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Endogenous viral elements (EVEs) offer insight into the evolutionary histories and hosts of contemporary viruses. This study leveraged DNA metagenomics and genomics to detect and infer the host of a non-retroviral dinoflagellate-infecting +ssRNA virus (dinoRNAV) common in coral reefs. As part of the Tara Pacific Expedition, this study surveyed 269 newly sequenced cnidarians and their resident symbiotic dinoflagellates (Symbiodiniaceae), associated metabarcodes, and publicly available metagenomes, revealing 178 dinoRNAV EVEs, predominantly among hydrocoral-dinoflagellate metagenomes. Putative associations between Symbiodiniaceae and dinoRNAV EVEs were corroborated by the characterization of dinoRNAV-like sequences in 17 of 18 scaffold-scale and one chromosome-scale dinoflagellate genome assembly, flanked by characteristically cellular sequences and in proximity to retroelements, suggesting potential mechanisms of integration. EVEs were not detected in dinoflagellate-free (aposymbiotic) cnidarian genome assemblies, including stony corals, hydrocorals, jellyfish, or seawater. The pervasive nature of dinoRNAV EVEs within dinoflagellate genomes (especiallySymbiodinium), as well as their inconsistent within-genome distribution and fragmented nature, suggest ancestral or recurrent integration of this virus with variable conservation. Broadly, these findings illustrate how +ssRNA viruses may obscure their genomes as members of nested symbioses, with implications for host evolution, exaptation, and immunity in the context of reef health and disease.more » « less
-
The Costa Rican pygmy rice rat (Oligoryzomys costaricensis) is the primary reservoir of Choclo orthohantavirus (CHOV), the causal agent of hantavirus disease, pulmonary syndrome, and fever in humans in Panama. Since the emergence of CHOV in early 2000, we have systematically sampled and archived rodents from >150 sites across Panama to establish a baseline understanding of the host and virus, producing a permanent archive of holistic specimens that we are now probing in greater detail. We summarize these collections and explore preliminary habitat/virus associations to guide future wildlife surveillance and public health efforts related to CHOV and other zoonotic pathogens. Host sequences of the mitochondrial cytochrome b gene form a single monophyletic clade in Panama, despite wide distribution across Panama. Seropositive samples were concentrated in the central region of western Panama, consistent with the ecology of this agricultural commensal and the higher incidence of CHOV in humans in that region. Hantavirus seroprevalence in the pygmy rice rat was >15% overall, with the highest prevalence in agricultural areas (21%) and the lowest prevalence in shrublands (11%). Host–pathogen distribution, transmission dynamics, genomic evolution, and habitat affinities can be derived from the preserved samples, which include frozen tissues, and now provide a foundation for expanded investigations of orthohantaviruses in Panama.more » « less
-
ABSTRACT Drosophila melanogaster, the common fruit fly, has been instrumental to our understanding of evolution, genetics and disease. There are benefits to studying these flies in the wild, including assessment of their naturally occurring microbiota. To facilitate efforts to catch wildD. melanogaster, we designed two fly traps and evaluated several candidate attractants. The first trap utilized a stable food substrate that can be used to catch live flies to establish new lab colonies. The second trap was designed to be reusable and easy to ship to enable the collection of flies over time from diverse locations. We evaluated several chemical attractants derived from banana and from marula fruit, which is the proposed ancestral food host ofD. melanogaster. We found that wild flies were preferentially attracted to banana-based odorants over marula-derived ones. Overall, these traps and attractants represent an inexpensive and simple option for the collection of wildD. melanogasterand related species for sampling or colony establishment.more » « less
-
ABSTRACT Microbes can be programmed to record participation in gene transfer by coding biological-recording devices into mobile DNA. Upon DNA uptake, these devices transcribe a catalytic RNA (cat-RNA) that binds to conserved sequences within ribosomal RNA (rRNA) and perform a trans-splicing reaction that adds a barcode to the rRNA. Existing cat-RNA designs were generated to be broad-host range, providing no control over the organisms that were barcoded. To achieve control over the organisms barcoded by cat-RNA, we created a program called Ribodesigner that uses input sets of rRNA sequences to create designs with varying specificities. We show how this algorithm can be used to identify designs that enable kingdom-wide barcoding, or selective barcoding of specific taxonomic groups within a kingdom. We use Ribodesigner to create cat-RNA designs that target Pseudomonadales while avoiding Enterobacterales, and we compare the performance of one design to a cat-RNA that was previously found to be broad host range. When conjugated into a mixture ofEscherichia coliandPseudomonas putida, the new design presents increased selectivity compared to a broad host range cat-RNA. Ribodesigner is expected to aid in developing cat-RNA that store information within user-defined sets of microbes in environmental communities for gene transfer studies. GRAPHICAL ABSTRACTmore » « less
An official website of the United States government

