skip to main content


Title: ROTDIF-web and ALTENS: GenApp-based Science Gateways for Biomolecular Nuclear Magnetic Resonance (NMR) Data Analysis and Structure Modeling
Proteins and nucleic acids participate in essentially every biochemical process in living organisms, and the elucidation of their structure and motions is essential for our understanding how these molecular machines perform their function. Nuclear Magnetic Resonance (NMR) spectroscopy is a powerful versatile technique that provides critical information on the molecular structure and dynamics. Spin-relaxation data are used to determine the overall rotational diffusion and local motions of biological macromolecules, while residual dipolar couplings (RDCs) reveal local and long-range structural architecture of these molecules and their complexes. This information allows researchers to refine structures of proteins and nucleic acids and provides restraints for molecular docking. Several software packages have been developed by NMR researchers in order to tackle the complicated experimental data analysis and structure modeling. However, many of them are offline packages or command-line applications that require users to set up the run time environment and also to possess certain programming skills, which inevitably limits accessibility of this software to a broad scientific community. Here we present new science gateways designed for NMR/structural biology community that address these current limitations in NMR data analysis. Using the GenApp technology for scientific gateways (https://genapp.rocks), we successfully transformed ROTDIF and ALTENS, two offline packages for bio-NMR data analysis, into science gateways that provide advanced computational functionalities, cloud-based data management, and interactive 2D and 3D plotting and visualizations. Furthermore, these gateways are integrated with molecular structure visualization tools (Jmol) and with gateways/engines (SASSIE-web) capable of generating huge computer-simulated structural ensembles of proteins and nucleic acids. This enables researchers to seamlessly incorporate conformational ensembles into the analysis in order to adequately take into account structural heterogeneity and dynamic nature of biological macromolecules. ROTDIF-web offers a versatile set of integrated modules/tools for determining and predicting molecular rotational diffusion tensors and model-free characterization of bond dynamics in biomacromolecules and for docking of molecular complexes driven by the information extracted from NMR relaxation data. ALTENS allows characterization of the molecular alignment under anisotropic conditions, which enables researchers to obtain accurate local and long-range bond-vector restraints for refining 3-D structures of macromolecules and their complexes. We will describe our experience bringing our programs into GenApp and illustrate the use of these gateways for specific examples of protein systems of high biological significance. We expect these gateways to be useful to structural biologists and biophysicists as well as NMR community and to stimulate other researchers to share their scientific software in a similar way.  more » « less
Award ID(s):
1912444 1740097 1739549
NSF-PAR ID:
10116253
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Gateways 2019
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    As a discipline, structural biology has been transformed by the three-dimensional electron microscopy (3DEM) “Resolution Revolution” made possible by convergence of robust cryo-preservation of vitrified biological materials, sample handling systems, and measurement stages operating a liquid nitrogen temperature, improvements in electron optics that preserve phase information at the atomic level, direct electron detectors (DEDs), high-speed computing with graphics processing units, and rapid advances in data acquisition and processing software. 3DEM structure information (atomic coordinates and related metadata) are archived in the open-access Protein Data Bank (PDB), which currently holds more than 11,000 3DEM structures of proteins and nucleic acids, and their complexes with one another and small-molecule ligands (~ 6% of the archive). Underlying experimental data (3DEM density maps and related metadata) are stored in the Electron Microscopy Data Bank (EMDB), which currently holds more than 21,000 3DEM density maps. After describing the history of the PDB and the Worldwide Protein Data Bank (wwPDB) partnership, which jointly manages both the PDB and EMDB archives, this review examines the origins of the resolution revolution and analyzes its impact on structural biology viewed through the lens of PDB holdings. Six areas of focus exemplifying the impact of 3DEM across the biosciences are discussed in detail (icosahedral viruses, ribosomes, integral membrane proteins, SARS-CoV-2 spike proteins, cryogenic electron tomography, and integrative structure determination combining 3DEM with complementary biophysical measurement techniques), followed by a review of 3DEM structure validation by the wwPDB that underscores the importance of community engagement.

     
    more » « less
  2. Advances in time-resolved structural techniques, mainly in macromolecular crystallography and small-angle X-ray scattering (SAXS), allow for a detailed view of the dynamics of biological macromolecules and reactions between binding partners. Of particular promise, are mix-and-inject techniques, which offer a wide range of experimental possibility as microfluidic mixers are used to rapidly combine two species just prior to data collection. Most mix-and-inject approaches rely on diffusive mixers, which have been effectively used within crystallography and SAXS for a variety of systems, but their success is dependent on a specific set of conditions to facilitate fast diffusion for mixing. The use of a new chaotic advection mixer designed for microfluidic applications helps to further broaden the types of systems compatible with time-resolved mixing experiments. The chaotic advection mixer can create ultra-thin, alternating layers of liquid, enabling faster diffusion so that even more slowly diffusing molecules, like proteins or nucleic acids, can achieve fast mixing on timescales relevant to biological reactions. This mixer was first used in UV–vis absorbance and SAXS experiments with systems of a variety of molecular weights, and thus diffusion speeds. Careful effort was also dedicated to making a loop-loading sample-delivery system that consumes as little sample as possible, enabling the study of precious, laboratory-purified samples. The combination of the versatile mixer with low sample consumption opens the door to many new applications for mix-and-inject studies. 
    more » « less
  3. The three-dimensional architecture of biomolecules often creates specialized structural elements, notably short hydrogen bonds that have donor–acceptor separations below 2.7 Å. In this work, we statistically analyze 1663 high-resolution biomolecular structures from the Protein Data Bank and demonstrate that short hydrogen bonds are prevalent in proteins, protein–ligand complexes and nucleic acids. From these biological macromolecules, we characterize the preferred location, connectivity and amino acid composition in short hydrogen bonds and hydrogen bond networks, and assess their possible functional importance. Using electronic structure calculations, we further uncover how the interplay of the structural and chemical features determines the proton potential energy surfaces and proton sharing conditions in biological short hydrogen bonds. 
    more » « less
  4. Bacterial tyrosine kinases (BY-kinases) comprise a family of protein tyrosine kinases that are structurally distinct from their functional counterparts in eukaryotes and are highly conserved across the bacterial kingdom. BY-kinases act in concert with their counteracting phosphatases to regulate a variety of cellular processes, most notably the synthesis and export of polysaccharides involved in biofilm and capsule biogenesis. Biochemical data suggest that BY-kinase function involves the cyclic assembly and disassembly of oligomeric states coupled to the overall phosphorylation levels of a C-terminal tyrosine cluster. This process is driven by the opposing effects of intermolecular autophosphorylation, and dephosphorylation catalyzed by tyrosine phosphatases. In the absence of structural insight into the interactions between a BY-kinase and its phosphatase partner in atomic detail, the precise mechanism of this regulatory process has remained poorly defined. To address this gap in knowledge, we have determined the structure of the transiently assembled complex between the catalytic core of the Escherichia coli (K-12) BY-kinase Wzc and its counteracting low–molecular weight protein tyrosine phosphatase (LMW-PTP) Wzb using solution NMR techniques. Unambiguous distance restraints from paramagnetic relaxation effects were supplemented with ambiguous interaction restraints from static spectral perturbations and transient chemical shift changes inferred from relaxation dispersion measurements and used in a computational docking protocol for structure determination. This structure presents an atomic picture of the mode of interaction between an LMW-PTP and its BY-kinase substrate, and provides mechanistic insight into the phosphorylation-coupled assembly/disassembly process proposed to drive BY-kinase function. 
    more » « less
  5. ABSTRACT Chemical exchange line broadening is an important phenomenon in nuclear magnetic resonance (NMR) spectroscopy, in which a nuclear spin experiences more than one magnetic environment as a result of chemical or conformational changes of a molecule. The dynamic process of chemical exchange strongly affects the sensitivity and resolution of NMR experiments and increasingly provides a powerful probe of the interconversion between chemical and conformational states of proteins, nucleic acids, and other biologic macromolecules. A simple and often used theoretic description of chemical exchange in NMR spectroscopy is based on an idealized 2-state jump model (the random phase or telegraph signal). However, chemical exchange can also be represented as a barrier crossing event that can be modeled by using chemical reaction rate theory. The timescale of crossing is determined by the barrier height, the temperature, and the dissipation modeled as collisional or frictional damping. This tutorial explores the connection between the NMR theory of chemical exchange line broadening and strong collision models for chemical kinetics in statistical mechanics. Theoretic modeling and numeric simulation are used to map the rate of barrier crossing dynamics of a particle on a potential energy surface to the chemical exchange relaxation rate constant. By developing explicit models for the exchange dynamics, the tutorial aims to elucidate the underlying dynamical processes that give rise to the rich phenomenology of chemical exchange observed in NMR spectroscopy. Software for generating and analyzing the numeric simulations is provided in the form of Python and Fortran source codes. 
    more » « less