skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Beyond Nature’s base pairs: machine learning-enabled design of DNA-stabilized silver nanoclusters
Sequence-encoded biomolecules such as DNA and peptides are powerful programmable building blocks for nanomaterials. This paradigm is enabled by decades of prior research into how nucleic acid and amino acid sequences dictate biomolecular interactions. The properties of biomolecular materials can be significantly expanded with non-natural interactions, including metal ion coordination of nucleic acids and amino acids. However, these approaches present design challenges because it is often not well-understood how biomolecular sequence dictates such non-natural interactions. This Feature Article presents a case study in overcoming challenges in biomolecular materials with emerging approaches in data mining and machine learning for chemical design. We review progress in this area for a specific class of DNA-templated metal nanomaterials with complex sequence-to-property relationships: DNA-stabilized silver nan- oclusters (AgN-DNAs) with bright, sequence-tuned fluorescence colors and promise for biophotonics applications. A brief overview of machine learning concepts is presented, and high-throughput experimental synthesis and characterization of AgN-DNAs are discussed. Then, recent progress in machine learning-guided design of DNA sequences that select for specific AgN-DNA fluorescence properties is reviewed. We conclude with emerging opportunities in machine learning-guided design and discovery of AgN-DNAs and other sequence-encoded biomolecular nanomaterials.  more » « less
Award ID(s):
2025790
PAR ID:
10439440
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Chemical Communications
ISSN:
1359-7345
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. DNA-stabilized silver nanoclusters (AgN-DNAs) are a class of nanomaterials comprised of 10-30 silver atoms held together by short synthetic DNA template strands. AgN-DNAs are promising biosensors and fluorophores due to their small sizes, natural compatibility with DNA, and bright fluorescence---the property of absorbing light and re-emitting light of a different color. The sequence of the DNA template acts as a "genome" for AgN-DNAs, tuning the size of the encapsulated silver nanocluster, and thus its fluorescence color. However, current understanding of the AgN-DNA genome is still limited. Only a minority of DNA sequences produce highly fluorescent AgN-DNAs, and the bulky DNA strands and complex DNA-silver interactions make it challenging to use first principles chemical calculations to understand and design AgN-DNAs. Thus, a major challenge for researchers studying these nanomaterials is to develop methods to employ observational data about studied AgN-DNAs to design new nanoclusters for targeted applications. In this work, we present an approach to design AgN-DNAs by employing variational autoencoders (VAEs) as generative models. Specifically, we employ an LSTM-based β-VAE architecture and regularize its latent space to correlate with AgN-DNA properties such as color and brightness. The regularization is adaptive to skewed sample distributions of available observational data along our design axes of properties. We employ our model for design of AgN-DNAs in the near-infrared (NIR) band, where relatively few AgN-DNAs have been observed to date. Wet lab experiments validate that when employed for designing new AgN-DNAs, our model significantly shifts the distribution of AgN-DNA colors towards the NIR while simultaneously achieving bright fluorescence. This work shows that VAE-based generative models are well-suited for the design of AgN-DNAs with multiple targeted properties, with significant potential to advance the promising applications of these nanomaterials for bioimaging, biosensing, and other critical technologies. 
    more » « less
  2. null (Ed.)
    DNA serves as a versatile template for few-atom silver clusters and their organized self-assembly. These clusters possess unique structural and photophysical properties that are programmed into the DNA template sequence, resulting in a rich palette of fluorophores which hold promise as chemical and biomolecular sensors, biolabels, and nanophotonic elements. Here, we review recent advances in the fundamental understanding of DNA-templated silver clusters (Ag N -DNAs), including the role played by the silver-mediated DNA complexes which are synthetic precursors to Ag N -DNAs, structure–property relations of Ag N -DNAs, and the excited state dynamics leading to fluorescence in these clusters. We also summarize the current understanding of how DNA sequence selects the properties of Ag N -DNAs and how sequence can be harnessed for informed design and for ordered multi-cluster assembly. To catalyze future research, we end with a discussion of several opportunities and challenges, both fundamental and applied, for the Ag N -DNA research community. A comprehensive fundamental understanding of this class of metal cluster fluorophores can provide the basis for rational design and for advancement of their applications in fluorescence-based sensing, biosciences, nanophotonics, and catalysis. 
    more » « less
  3. Peptide nucleic acids (PNAs) are high-affinity synthetic nucleic acid analogs capable of hybridization with native nucleic acids. PNAs synthesized having amino acid sidechains installed at the γ-position along the backbone provide a template for a single biopolymer to simultaneously encode nucleic acid and amino acid sequences. Previously, we reported the development of “bilingual” PNAs through the synthesis of an amphiphilic sequence featuring separate blocks of hydrophobic and hydrophilic amino acid functional groups. These PNAs combined the sequence-specific binding activity of nucleic acids with the structural organization properties of peptides. Like other amphiphilic compounds, these γ-PNAs were observed to assemble spontaneously into micelle-like nanostructures in aqueous solutions and disassembly was induced through hybridization to a complementary sequence. Here, we explore whether assembly of these bilingual PNAs is possible by harnessing the nucleic acid code. Specifically, we designed an amphiphile-masking duplex system in which spontaneous amphiphile assembly is prevented through hybridization to a nucleic acid masking sequence. We show that the amphiphile is displaced upon introduction of a releasing sequence complementary to the masking sequence through toehold mediated displacement. Upon release, we observe that the amphiphile proceeds to assemble in a fashion consistent with our previously reported structures. Our approach represents a novel method for controlled stimuli-responsive assembly of PNA-based nanostructures. 
    more » « less
  4. null (Ed.)
    Abstract The helical structures of DNA and RNA were originally revealed by experimental data. Likewise, the development of programs for modeling these natural polymers was guided by known structures. These nucleic acid polymers represent only two members of a potentially vast class of polymers with similar structural features, but that differ from DNA and RNA in the backbone or nucleobases. Xeno nucleic acids (XNAs) incorporate alternative backbones that affect the conformational, chemical, and thermodynamic properties of XNAs. Given the vast chemical space of possible XNAs, computational modeling of alternative nucleic acids can accelerate the search for plausible nucleic acid analogs and guide their rational design. Additionally, a tool for the modeling of nucleic acids could help reveal what nucleic acid polymers may have existed before RNA in the early evolution of life. To aid the development of novel XNA polymers and the search for possible pre-RNA candidates, this article presents the proto-Nucleic Acid Builder (https://github.com/GT-NucleicAcids/pnab), an open-source program for modeling nucleic acid analogs with alternative backbones and nucleobases. The torsion-driven conformation search procedure implemented here predicts structures with good accuracy compared to experimental structures, and correctly demonstrates the correlation between the helical structure and the backbone conformation in DNA and RNA. 
    more » « less
  5. Nature encodes the information required for life in two fundamental biopolymers: nucleic acids and proteins. Peptide nucleic acid (PNA), a synthetic analog comprised of nucleobases arrayed along a pseudopeptide backbone, has the ability to combine the power of nucleic acids to encode information with the versatility of amino acids to encode structure and function. Historically, PNA has been perceived as a simple nucleic acid mimic having desirable properties such as high biostability and strong affinity for complementary nucleic acids. In this feature article, we aim to adjust this perception by highlighting the ability of PNA to act as a peptide mimic and showing the largely untapped potential to encode information in the amino acid sequence. First, we provide an introduction to PNA and discuss the use of conjugation to impart tunable properties to the biopolymer. Next, we describe the integration of functional groups directly into the PNA backbone to impart specific physical properties. Lastly, we highlight the use of these integrated amino acid side chains to encode peptide-like sequences in the PNA backbone, imparting novel activity and function and demonstrating the ability of PNA to simultaneously mimic both a peptide and a nucleic acid. 
    more » « less