skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on January 1, 2026

Title: Computing and Bounding Equilibrium Concentrations in Athermic Chemical Systems
Computing equilibrium concentrations of molecular complexes is generally analytically intractable and requires numerical approaches. In this work we focus on the polymer-monomer level, where indivisible molecules (monomers) combine to form complexes (polymers). Rather than employing free-energy parameters for each polymer, we focus on the athermic setting where all interactions preserve enthalpy. This setting aligns with the strongly bonded (domain-based) regime in DNA nanotechnology when strands can bind in different ways, but always with maximum overall bonding - and is consistent with the saturated configurations in the Thermodynamic Binding Networks (TBNs) model. Within this context, we develop an iterative algorithm for assigning polymer concentrations to satisfy detailed-balance, where on-target (desired) polymers are in high concentrations and off-target (undesired) polymers are in low. Even if not directly executed, our algorithm provides effective insights into upper bounds on concentration of off-target polymers, connecting combinatorial arguments about discrete configurations such as those in the TBN model to real-valued concentrations. We conclude with an application of our method to decreasing leak in DNA logic and signal propagation. Our results offer a new framework for design and verification of equilibrium concentrations when configurations are distinguished by entropic forces.  more » « less
Award ID(s):
2227578
PAR ID:
10635567
Author(s) / Creator(s):
; ;
Editor(s):
Schaeffer, Josie; Zhang, Fei
Publisher / Repository:
Schloss Dagstuhl – Leibniz-Zentrum für Informatik
Date Published:
Volume:
347
ISSN:
1868-8969
ISBN:
978-3-95977-399-7
Page Range / eLocation ID:
10:1-10:19
Subject(s) / Keyword(s):
Equilibrium concentrations Thermodynamic Binding Networks Monomer-polymer model Detailed balance Theory of computation → Models of computation Theory of computation → Design and analysis of algorithms
Format(s):
Medium: X Size: 19 pages; 1563181 bytes Other: application/pdf
Size(s):
19 pages 1563181 bytes
Right(s):
Creative Commons Attribution 4.0 International license; info:eu-repo/semantics/openAccess
Sponsoring Org:
National Science Foundation
More Like this
  1. Complex DNA topological structures, including polymer loops, are frequently observed in biological processes when protein molecules simultaneously bind to several distant sites on DNA. However, the molecular mechanisms of formation of these systems remain not well understood. Existing theoretical studies focus only on specific interactions between protein and DNA molecules at target sequences. However, the electrostatic origin of primary protein–DNA interactions suggests that interactions of proteins with all DNA segments should be considered. Here we theoretically investigate the role of non-specific interactions between protein and DNA molecules on the dynamics of loop formation. Our approach is based on analyzing a discrete-state stochastic model via a method of first-passage probabilities supplemented by Monte Carlo computer simulations. It is found that depending on a protein sliding length during the non-specific binding event three different dynamic regimes of the DNA loop formation might be observed. In addition, the loop formation time might be optimized by varying the protein sliding length, the size of the DNA molecule, and the position of the specific target sequences on DNA. Our results demonstrate the importance of non-specific protein–DNA interactions in the dynamics of DNA loop formations. 
    more » « less
  2. Many problems in the physical sciences, machine learning, and statistical inference necessitate sampling from a high-dimensional, multimodal probability distribution. Markov Chain Monte Carlo (MCMC) algorithms, the ubiquitous tool for this task, typically rely on random local updates to propagate configurations of a given system in a way that ensures that generated configurations will be distributed according to a target probability distribution asymptotically. In high-dimensional settings with multiple relevant metastable basins, local approaches require either immense computational effort or intricately designed importance sampling strategies to capture information about, for example, the relative populations of such basins. Here, we analyze an adaptive MCMC, which augments MCMC sampling with nonlocal transition kernels parameterized with generative models known as normalizing flows. We focus on a setting where there are no preexisting data, as is commonly the case for problems in which MCMC is used. Our method uses 1) an MCMC strategy that blends local moves obtained from any standard transition kernel with those from a generative model to accelerate the sampling and 2) the data generated this way to adapt the generative model and improve its efficacy in the MCMC algorithm. We provide a theoretical analysis of the convergence properties of this algorithm and investigate numerically its efficiency, in particular in terms of its propensity to equilibrate fast between metastable modes whose rough location is known a priori but respective probability weight is not. We show that our algorithm can sample effectively across large free energy barriers, providing dramatic accelerations relative to traditional MCMC algorithms. 
    more » « less
  3. We study exploration in stochastic multi-armed bandits when we have access to a divisible resource that can be allocated in varying amounts to arm pulls. We focus in particular on the allocation of distributed computing resources, where we may obtain results faster by allocating more resources per pull, but might have reduced throughput due to nonlinear scaling. For example, in simulation-based scientific studies, an expensive simulation can be sped up by running it on multiple cores. This speed-up however, is partly offset by the communication among cores, which results in lower throughput than if fewer cores were allocated to run more trials in parallel. In this paper, we explore these trade-offs in two settings. First, in a fixed confidence setting, we need to find the best arm with a given target success probability as quickly as possible. We propose an algorithm which trades off between information accumulation and throughput and show that the time taken can be upper bounded by the solution of a dynamic program whose inputs are the gaps between the sub-optimal and optimal arms. We also prove a matching hardness result. Second, we present an algorithm for a fixed deadline setting, where we are given a time deadline and need to maximize the probability of finding the best arm. We corroborate our theoretical insights with simulation experiments that show that the algorithms consistently match or outperform baseline algorithms on a variety of problem instances. 
    more » « less
  4. The Flory–Huggins theory describes the phase separation of solutions containing polymers. Although it finds widespread application from polymer physics to materials science to biology, the concentrations that coexist in separate phases at equilibrium have not been determined analytically, and numerical techniques are required that restrict the theory’s ease of application. In this work, we derive an implicit analytical solution to the Flory–Huggins theory of one polymer in a solvent by applying a procedure that we call the implicit substitution method. While the solutions are implicit and in the form of composite variables, they can be mapped explicitly to a phase diagram in composition space. We apply the same formalism to multicomponent polymeric systems, where we find analytical solutions for polydisperse mixtures of polymers of one type. Finally, while complete analytical solutions are not possible for arbitrary mixtures, we propose computationally efficient strategies to map out coexistence curves for systems with many components of different polymer types. 
    more » « less
  5. Abstract DNA serves as a model system in polymer physics due to its ability to be obtained as a uniform polymer with controllable topology and nonequilibrium behavior. Currently, a major obstacle in the widespread adoption of DNA is obtaining it on a scale and cost basis that accommodates bulk rheology and high‐throughput screening. To address this, recent advancements in bioreactor‐based plasmid DNA production is coupled with anion exchange chromatography producing a unified approach to generating gram‐scale quantities of monodisperse DNA. With this method, 1.1 grams of DNA is obtained per batch to generate solutions with concentrations up to 116 mg mL−1. This solution of uniform supercoiled and relaxed circular plasmid DNA, is roughly 69 times greater than the overlap concentration. The utility of this method is demonstrated by performing bulk rheology measurements at sample volumes up to 1 mL on DNA of different lengths, topologies, and concentrations. The measured elastic moduli are orders of magnitude larger than those previously reported for DNA and allowed for the construction of a time‐concentration superposition curve that spans 12 decades of frequency. Ultimately, these results can provide important insights into the dynamics of ring polymers and the nature of highly condensed DNA dynamics. 
    more » « less