skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on March 11, 2026

Title: INgen: Intracellular Genomic DNA Amplification for Downstream Applications in Sequencing and Sorting
Abstract Here, we introduce intracellular genomic amplification (INgen), a method that harnesses the cell membrane as a natural reaction chamber for DNA amplification, enabling downstream sequencing and cell sorting. Unlike traditional single-cell techniques, INgen utilizes a strand-displacing, isothermal polymerase to amplify DNAwithinfixed, permeabilized cells while maintaining the cell’s structural integrity. This approach overcomes challenges associated with both typical single-cell DNA sequencing and hindrances encountered when previously attempting to sequence genetic material from fixed microbial cells, allowing amplification of genomic regions up to 100 kb and sequencing of whole genomes from diverse cell types, includingS. cerevisiae, B. subtilis, andE. coli. Additionally, INgen can be adapted for targeted DNA enrichment using biotinylated primers and for fluorescence-based cell sorting.  more » « less
Award ID(s):
2119963
PAR ID:
10594792
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
bioRxiv
Date Published:
Format(s):
Medium: X
Institution:
bioRxiv
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract BackgroundModern plant breeding strategies rely on the intensive use of advanced genomic tools to expedite the development of improved crop varieties. Genomic DNA extraction from crop seeds eliminates the need to grow plants in contrast to fresh leaf tissue; however, it can still be a bottleneck due to the presence of stored compounds and the complexity of the matrix. The interaction of environmentally benign choline-based ionic liquids (ILs) with DNA offers an innovative approach to enhance the quality of extracted DNA from seeds. While prior IL-based plant DNA extraction workflows have primarily supported polymerase chain reaction (PCR) and quantitative PCR-based applications, their suitability for high-throughput sequencing (HTS) remained largely unexplored. This study explores the efficacy of IL-assisted method for genomic DNA extraction from soybean (Glycine max) seeds, addressing the limited application of ILs in HTS. ResultsThe optimized DNA extraction method, utilizing 25% (w/v) choline formate, enabled the recovery of high-purity DNA with abundant fragment sizes > 20 kb, suitable for downstream applications including PCR, whole genome amplification (WGA), simple sequence repeat (SSR) amplification, and high-throughput Illumina sequencing. The IL-method was benchmarked against a silica-binding method using cetyltrimethylammonium bromide (CTAB) and sodium dodecyl sulfate (SDS) as lysis agents using a commercial plant DNA extraction kit in terms of DNA yield, purity, abundant DNA fragment size distribution, and integrity. In addition, DNA isolated from this method demonstrated successful PCR amplification of markers from both the nuclear and plastid genomes and yielded > 99% whole genome coverage with Illumina (PE150) sequencing reads. ConclusionsThis is the first known instance of a whole genome sequence generated from DNA extracted with ILs. These findings mark a significant milestone in establishing ILs as promising alternatives to conventional methods for seed DNA extraction, with potential utility in third generation (long-read) sequencing experiments. 
    more » « less
  2. Abstract Cancers develop and progress as mutations accumulate, and with the advent of single-cell DNA and RNA sequencing, researchers can observe these mutations and their transcriptomic effects and predict proteomic changes with remarkable temporal and spatial precision. However, to connect genomic mutations with their transcriptomic and proteomic consequences, cells with either only DNA data or only RNA data must be mapped to a common domain. For this purpose, we present MaCroDNA, a method that uses maximum weighted bipartite matching of per-gene read counts from single-cell DNA and RNA-seq data. Using ground truth information from colorectal cancer data, we demonstrate the advantage of MaCroDNA over existing methods in accuracy and speed. Exemplifying the utility of single-cell data integration in cancer research, we suggest, based on results derived using MaCroDNA, that genomic mutations of large effect size increasingly contribute to differential expression between cells as Barrett’s esophagus progresses to esophageal cancer, reaffirming the findings of the previous studies. 
    more » « less
  3. Recent advances in transcriptomic analysis at single-cell resolution reveal cell-to-cell heterogeneity in a biological sample with unprecedented resolution. Partitioning single cells in individual micro-droplets and harvesting each cell's mRNA molecules for next-generation sequencing has proven to be an effective method for profiling transcriptomes from a large number of cells at high throughput. However, the assays to recover the full transcriptomes are time-consuming in sample preparation and require expensive reagents and sequencing cost. Many biomedical applications, such as pathogen detection, prefer highly sensitive, reliable and low-cost detection of selected genes. Here, we present a droplet-based microfluidic platform that permits seamless on-chip droplet sorting and merging, which enables completing multi-step reaction assays within a short time. By sequentially adding lysis buffers and reactant mixtures to micro-droplet reactors, we developed a novel workflow of single-cell reverse transcription loop-mediated-isothermal amplification (scRT-LAMP) to quantify specific mRNA expression levels in different cell types within one hour. Including single cell encapsulation, sorting, lysing, reactant addition, and quantitative mRNA detection, the fully on-chip workflow provides a rapid, robust, and high-throughput experimental approach for a wide variety of biomedical studies. 
    more » « less
  4. Abstract MotivationSingle-nucleotide variants (SNVs) are the most common variations in the human genome. Recently developed methods for SNV detection from single-cell DNA sequencing data, such as SCIΦ and scVILP, leverage the evolutionary history of the cells to overcome the technical errors associated with single-cell sequencing protocols. Despite being accurate, these methods are not scalable to the extensive genomic breadth of single-cell whole-genome (scWGS) and whole-exome sequencing (scWES) data. ResultsHere, we report on a new scalable method, Phylovar, which extends the phylogeny-guided variant calling approach to sequencing datasets containing millions of loci. Through benchmarking on simulated datasets under different settings, we show that, Phylovar outperforms SCIΦ in terms of running time while being more accurate than Monovar (which is not phylogeny-aware) in terms of SNV detection. Furthermore, we applied Phylovar to two real biological datasets: an scWES triple-negative breast cancer data consisting of 32 cells and 3375 loci as well as an scWGS data of neuron cells from a normal human brain containing 16 cells and approximately 2.5 million loci. For the cancer data, Phylovar detected somatic SNVs with high or moderate functional impact that were also supported by bulk sequencing dataset and for the neuron dataset, Phylovar identified 5745 SNVs with non-synonymous effects some of which were associated with neurodegenerative diseases. Availability and implementationPhylovar is implemented in Python and is publicly available at https://github.com/NakhlehLab/Phylovar. 
    more » « less
  5. Fluids circulating through oceanic crust play important roles in global biogeochemical cycling mediated by their microbial inhabitants, but studying these sites is challenged by sampling logistics and low biomass. Borehole observatories installed at the North Pond study site on the western flank of the Mid-Atlantic Ridge have enabled investigation of the microbial biosphere in cold, oxygenated basaltic oceanic crust. Here we test a methodology that applies redox-sensitive fluorescent molecules for flow cytometric sorting of cells for single cell genomic sequencing from small volumes of low biomass (approximately 10 3 cells ml –1 ) crustal fluid. We compare the resulting genomic data to a recently published paired metagenomic and metatranscriptomic analysis from the same site. Even with low coverage genome sequencing, sorting cells from less than one milliliter of crustal fluid results in similar interpretation of dominant taxa and functional profiles as compared to ‘omics analysis that typically filter orders of magnitude more fluid volume. The diverse community dominated by Gammaproteobacteria, Bacteroidetes, Desulfobacterota, Alphaproteobacteria, and Zetaproteobacteria, had evidence of autotrophy and heterotrophy, a variety of nitrogen and sulfur cycling metabolisms, and motility. Together, results indicate fluorescence activated cell sorting methodology is a powerful addition to the toolbox for the study of low biomass systems or at sites where only small sample volumes are available for analysis. 
    more » « less