skip to main content


Title: SGOOP-d: Estimating kinetic distances and reaction coordinate dimensionality for rare event systems from biased/unbiased simulations
Understanding kinetics including reaction pathways and associated transition rates is an important yet difficult problem in numerous chemical and biological systems especially in situations with multiple competing pathways. When these high-dimensional systems are projected on low-dimensional coordinates often needed for enhanced sampling or for interpretation of simulations and experiments, one often ends up losing the kinetic connectivity of the underlying high-dimensional landscape. Thus in the low-dimensional projection metastable states might appear closer or further than they actually are. To deal with this issue, in this work we develop a formalism that learns a multi-dimensional yet minimally complex reaction coordinate (RC) for generic high-dimensional systems, such that when projected along this RC, all possible kinetically relevant pathways can be demarcated and the true high-dimensional connectivity is maintained. One of the defining attributes of our method lies in that it can work on long unbiased simulations as well as biased simulations often needed for rare event systems. We demonstrate the utility of the method by studying a range of model systems including conformational transitions in a small peptide Ace-Ala3-Nme, where we show how SGOOPderived two-dimensional and three-dimensional reaction coordinate can capture the kinetics for 23 and all 28 out of the 28 dominant state-to-state-transitions respectively.  more » « less
Award ID(s):
1632976
NSF-PAR ID:
10287257
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
ArXivorg
ISSN:
2331-8422
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Molecular dynamics (MD) simulations generate valuable all-atom resolution trajectories of complex systems, but analyzing this high-dimensional data as well as reaching practical timescales, even with powerful supercomputers, remain open problems. As such, many specialized sampling and reaction coordinate construction methods exist that alleviate these problems. However, these methods typically don't work directly on all atomic coordinates, and still require previous knowledge of the important distinguishing features of the system, known as order parameters (OPs). Here we present AMINO, an automated method that generates such OPs by screening through a very large dictionary of OPs, such as all heavy atom contacts in a biomolecule. AMINO uses ideas from information theory to learn OPs that can then serve as an input for designing a reaction coordinate which can then be used in many enhanced sampling methods. Here we outline its key theoretical underpinnings, and apply it to systems of increasing complexity. Our applications include a problem of tremendous pharmaceutical and engineering relevance, namely, calculating the binding affinity of a protein–ligand system when all that is known is the structure of the bound system. Our calculations are performed in a human-free fashion, obtaining very accurate results compared to long unbiased MD simulations on the Anton supercomputer, but in orders of magnitude less computer time. We thus expect AMINO to be useful for the calculation of thermodynamics and kinetics in the study of diverse molecular systems. 
    more » « less
  2. null (Ed.)
    Base flipping is a key biophysical event involved in recognition of various ligands by ribonucleic acid (RNA) molecules. However, the mechanism of base flipping in RNA remains poorly understood, in part due to the lack of atomistic details on complex rearrangements in neighboring bases. In this work, we applied transition path sampling (TPS) methods to study base flipping in a double-stranded RNA (dsRNA) molecule that is known to interact with RNA-editing enzymes through this mechanism. We obtained an ensemble of 1000 transition trajectories to describe the base-flipping process. We used the likelihood maximization method to determine the refined reaction coordinate (RC) consisting of two collective variables (CVs), a distance and a dihedral angle between nucleotides that form stacking interactions with the flipping base. The free energy profile projected along the refined RC revealed three minima, two corresponding to the initial and final states and one for a metastable state. We suggest that the metastable state likely represents a wobbled conformation of nucleobases observed in NMR studies that is often characterized as the flipped state. The analyses of reactive trajectories further revealed that the base flipping is coupled to a global conformational change in a stem-loop of dsRNA. 
    more » « less
  3. INTRODUCTION A brainwide, synaptic-resolution connectivity map—a connectome—is essential for understanding how the brain generates behavior. However because of technological constraints imaging entire brains with electron microscopy (EM) and reconstructing circuits from such datasets has been challenging. To date, complete connectomes have been mapped for only three organisms, each with several hundred brain neurons: the nematode C. elegans , the larva of the sea squirt Ciona intestinalis , and of the marine annelid Platynereis dumerilii . Synapse-resolution circuit diagrams of larger brains, such as insects, fish, and mammals, have been approached by considering select subregions in isolation. However, neural computations span spatially dispersed but interconnected brain regions, and understanding any one computation requires the complete brain connectome with all its inputs and outputs. RATIONALE We therefore generated a connectome of an entire brain of a small insect, the larva of the fruit fly, Drosophila melanogaster. This animal displays a rich behavioral repertoire, including learning, value computation, and action selection, and shares homologous brain structures with adult Drosophila and larger insects. Powerful genetic tools are available for selective manipulation or recording of individual neuron types. In this tractable model system, hypotheses about the functional roles of specific neurons and circuit motifs revealed by the connectome can therefore be readily tested. RESULTS The complete synaptic-resolution connectome of the Drosophila larval brain comprises 3016 neurons and 548,000 synapses. We performed a detailed analysis of the brain circuit architecture, including connection and neuron types, network hubs, and circuit motifs. Most of the brain’s in-out hubs (73%) were postsynaptic to the learning center or presynaptic to the dopaminergic neurons that drive learning. We used graph spectral embedding to hierarchically cluster neurons based on synaptic connectivity into 93 neuron types, which were internally consistent based on other features, such as morphology and function. We developed an algorithm to track brainwide signal propagation across polysynaptic pathways and analyzed feedforward (from sensory to output) and feedback pathways, multisensory integration, and cross-hemisphere interactions. We found extensive multisensory integration throughout the brain and multiple interconnected pathways of varying depths from sensory neurons to output neurons forming a distributed processing network. The brain had a highly recurrent architecture, with 41% of neurons receiving long-range recurrent input. However, recurrence was not evenly distributed and was especially high in areas implicated in learning and action selection. Dopaminergic neurons that drive learning are amongst the most recurrent neurons in the brain. Many contralateral neurons, which projected across brain hemispheres, were in-out hubs and synapsed onto each other, facilitating extensive interhemispheric communication. We also analyzed interactions between the brain and nerve cord. We found that descending neurons targeted a small fraction of premotor elements that could play important roles in switching between locomotor states. A subset of descending neurons targeted low-order post-sensory interneurons likely modulating sensory processing. CONCLUSION The complete brain connectome of the Drosophila larva will be a lasting reference study, providing a basis for a multitude of theoretical and experimental studies of brain function. The approach and computational tools generated in this study will facilitate the analysis of future connectomes. Although the details of brain organization differ across the animal kingdom, many circuit architectures are conserved. As more brain connectomes of other organisms are mapped in the future, comparisons between them will reveal both common and therefore potentially optimal circuit architectures, as well as the idiosyncratic ones that underlie behavioral differences between organisms. Some of the architectural features observed in the Drosophila larval brain, including multilayer shortcuts and prominent nested recurrent loops, are found in state-of-the-art artificial neural networks, where they can compensate for a lack of network depth and support arbitrary, task-dependent computations. Such features could therefore increase the brain’s computational capacity, overcoming physiological constraints on the number of neurons. Future analysis of similarities and differences between brains and artificial neural networks may help in understanding brain computational principles and perhaps inspire new machine learning architectures. The connectome of the Drosophila larval brain. The morphologies of all brain neurons, reconstructed from a synapse-resolution EM volume, and the synaptic connectivity matrix of an entire brain. This connectivity information was used to hierarchically cluster all brains into 93 cell types, which were internally consistent based on morphology and known function. 
    more » « less
  4. Abstract

    Host–parasite dynamics are impacted by the relationship between host density and parasite transmission, and thus, all epidemiological models contain a central transmission–density function. Recent theoretical work demonstrates that this central parasite transmission function might be best represented by a nonlinear continuum from one linear extreme to another: density‐dependent transmission at low host densities to density‐independent transmission at high host densities. But how often are nonlinear transmission functions used, and when are they better at describing transmission in real host–parasite systems?

    To quantify existing modelling practices, we systematically reviewed seven representative ecology journals, finding 262 studies containing host–parasite models that contained linear and/or nonlinear transmission functions. We also reviewed the literature to find 28 experimental and observational studies that compared multiple transmission functions in real host–parasite systems, and tallied which functions were best supported in those systems. Finally, we created a flexible model simulation tool to explore and quantify the bias in model parameter estimates that is created when using an inaccurate transmission function.

    We found that most experimental and observational studies reported that nonlinear transmission–density functions outperformed simple linear transmission–density functions, supporting recent theoretical work. In contrast, most studies containing host–parasite models assumed that host density was constant and/or used a single, linear transmission function to explain how transmission rates changed with density. Using the wrong linear function and/or using a linear function when the underlying transmission–density relationship is even slightly nonlinear can substantially bias model parameter estimates, as demonstrated by our simulations over a broad parameter space.

    Some modelling studies may be using linear functions in host–parasite systems where nonlinear functions are more appropriate. If true, these models would yield substantially biased parameter estimates. To avoid such biases that compromise ecological understanding and prediction, we recommend that future studies compare multiple transmission functions, including nonlinear options, whenever possible.

     
    more » « less
  5. Living systems are composed of a select number of biopolymers and minerals yet exhibit an immense diversity in materials properties. The wide-ranging characteristics, such as enhanced mechanical properties of skin and bone, or responsive optical properties derived from structural coloration, are a result of the multiscale, hierarchical structure of the materials. The fields of materials and polymer chemistry have leveraged equilibrium concepts in an effort to mimic the structure complex materials seen in nature. However, realizing the remarkable properties in natural systems requires moving beyond an equilibrium perspective. An alternative method to create materials with multiscale structures is to approach the issue from a kinetic perspective and utilize chemical processes to drive phase transitions. This Account features an active area of research in our group, reaction-induced phase transitions (RIPT), which uses chemical reactions such as polymerizations to induce structural changes in soft material systems. Depending on the type of phase transition (e.g., microphase versus macrophase separation), the resulting change in state will occur at different length scales (e.g., nm – μm), thus dictating the structure of the material. For example, the in situ formation of either a block copolymer or a homopolymer initially in a monomer mixture during a polymerization will drive nanoscale or macroscale transitions, respectively. Specifically, three different examples utilizing reaction-driven phase changes will be discussed: 1) in situ polymer grafting from block copolymers, 2) multiscale polymer nanocomposites, and 3) Lewis adduct-driven phase transitions. All three areas highlight how chemical changes via polymerizations or specific chemical binding result in phase transitions that lead to nanostructural and multiscale changes. Harnessing kinetic chemical processes to promote and control material structure, as opposed to organizing pre-synthesized molecules, polymers, or nanoparticles within a thermodynamic framework, is a growing area of interest. Trapping nonequilibrium states in polymer materials has been primarily focused from a polymer chain conformation viewpoint in which synthesized polymers are subjected to different thermal and processing conditions. The impact of reaction kinetics and polymerization rate on final polymer material structure is starting to be recognized as a new way to access different morphologies not available through thermodynamic means. Furthermore, kinetic control of polymer material structure is not specific to polymerizations and encompasses any chemical reaction that induce morphology transitions. Kinetically driven processes to dictate material structure directly impact a broad range of areas including separation membranes, biomolecular condensates, cell mobility, and the self-assembly of polymers and colloids. Advancing polymer material syntheses using kinetic principles such as RIPT opens new possibilities for dictating material structure and properties beyond what is currently available with traditional self-assembly techniques. 
    more » « less