skip to main content


Title: DP-DNA: A Digital Pattern-Aware DNA Encoding Scheme to Improve Encoding Density of DNA Storage
Award ID(s):
2204656
NSF-PAR ID:
10500537
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
IEEE
Date Published:
Page Range / eLocation ID:
1 to 8
Format(s):
Medium: X
Location:
Stony Brook, NY, USA
Sponsoring Org:
National Science Foundation
More Like this
  1. While single-stranded DNA (ssDNA) was once thought to be a relatively rare genomic architecture for viruses, modern metagenomics sequencing has revealed circular ssDNA viruses in most environments and in association with diverse hosts. In particular, circular ssDNA viruses encoding a homologous replication-associated protein (Rep) have been identified in the majority of eukaryotic supergroups, generating interest in the ecological effects and evolutionary history of circular Rep-encoding ssDNA viruses (CRESS DNA) viruses. This review surveys the explosion of sequence diversity and expansion of eukaryotic CRESS DNA taxonomic groups over the last decade, highlights similarities between the well-studied geminiviruses and circoviruses with newly identified groups known only through their genome sequences, discusses the ecology and evolution of eukaryotic CRESS DNA viruses, and speculates on future research horizons. 
    more » « less
  2. Roux, Simon (Ed.)

    Iterons are short, repeated DNA sequences that are important for the replication of circular single-stranded DNA viruses. No tools that can reliably predict iterons are currently available. The CRUcivirus Iteron SEarch (CRUISE) tool is a computational tool that identifies iteron candidates near stem-loop structures in viral genomes.

     
    more » « less
  3. Abstract Transposable elements represent the largest components of many eukaryotic genomes and different genomes harbor different combinations of elements. Here, we discovered a novel DNA transposon in the genome of the clubmoss Selaginella lepidophylla. Further searching for related sequences to the conserved DDE region uncovered the presence of this superfamily of elements in fish, coral, sea anemone, and other animal species. However, this element appears restricted to Bryophytes and Lycophytes in plants. This transposon, named GingerRoot, is associated with a 6 bp (base pair) target site duplication, and 100–150 bp terminal inverted repeats. Analysis of transposase sequences identified the DDE motif, a catalytic domain, which shows similarity to the integrase of Gypsy-like long terminal repeat retrotransposons, the most abundant component in plant genomes. A total of 77 intact and several hundred truncated copies of GingerRoot elements were identified in S. lepidophylla. Like Gypsy retrotransposons, GingerRoots show a lack of insertion preference near genes, which contrasts to the compact genome size of about 100 Mb. Nevertheless, a considerable portion of GingerRoot elements was found to carry gene fragments, suggesting the capacity of duplicating gene sequences is unlikely attributed to the proximity to genes. Elements carrying gene fragments appear to be less methylated, more diverged, and more distal to genes than those without gene fragments, indicating they are preferentially retained in gene-poor regions. This study has identified a broadly dispersed, novel DNA transposon, and the first plant DNA transposon with an integrase-related transposase, suggesting the possibility of de novo formation of Gypsy-like elements in plants. 
    more » « less
  4. Abstract

    Deoxyribonucleic acid (DNA) is emerging as an alternative archival memory technology. Recent advancements in DNA synthesis and sequencing have both increased the capacity and decreased the cost of storing information in de novo synthesized DNA pools. In this survey, we review methods for translating digital data to and/or from DNA molecules. An emphasis is placed on methods which have been validated by storing and retrieving real-world data via in-vitro experiments.

     
    more » « less