Structures of many large biomolecular assemblies are now being determined using integrative approaches. In these approaches, information derived from multiple experimental and computational methods is combined to compute three-dimensional structures of multi-protein complexes and other macromolecular machines. A standalone prototype data resource for integrative structures called PDB-Dev was built, based on recommendations of the Integrative and Hybrid Methods (IHM) Task Force of the Worldwide Protein Data Bank (wwPDB). This effort included developing data standards and software tools for collecting, curating, validating, visualizing, archiving, and disseminating integrative structures that span diverse spatiotemporal scales and conformational states. Mechanisms have been created to validate integrative structures based on the experimental data underpinning them. Building upon this foundational framework, PDB-Dev has been further expanded to handle large dynamic macromolecular systems and integrative structures that combine, for example, experimental restraints with atomic coordinates computed by machine learning algorithms. Data standards and supporting tools have also been extended to capture information about biomolecular dynamics, such as conformational transitions and related kinetic data derived from biophysical methods. Recently, PDB-Dev was unified with the PDB archive and rebranded as PDB-IHM (pdb-ihm.org), further promoting FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data stewardship for integrative structural biology.
more »
« less
Archiving and disseminating integrative structure models
Limitations in the applicability, accuracy, and precision of individual structure characterization methods can sometimes be overcome via an integrative modeling approach that relies on information from all available sources, including all available experimental data and prior models. The open-source Integrative Modeling Platform (IMP) is one piece of software that implements all computational aspects of integrative modeling. To maximize the impact of integrative structures, the coordinates should be made publicly available, as is already the case for structures based on X-ray crystallography, NMR spectroscopy, and electron microscopy. Moreover, the associated experimental data and modeling protocols should also be archived, such that the original results can easily be reproduced. Finally, it is essential that the integrative structures are validated as part of their publication and deposition. A number of research groups have already developed software to implement integrative modeling and have generated a number of structures, prompting the formation of an Integrative/Hybrid Methods Task Force. Following the recommendations of this task force, the existing PDBx/mmCIF data representation used for atomic PDB structures has been extended to address the requirements for archiving integrative structural models. This IHM-dictionary adds a flexible model representation, including coarse graining, models in multiple states and/or related by time or other order, and multiple input experimental information sources. A prototype archiving system called PDB-Dev ( https://pdb-dev.wwpdb.org ) has also been created to archive integrative structural models, together with a Python library to facilitate handling of integrative models in PDBx/mmCIF format.
more »
« less
- Award ID(s):
- 1756248
- PAR ID:
- 10108385
- Date Published:
- Journal Name:
- Journal of Biomolecular NMR
- ISSN:
- 0925-2738
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Structures of many complex biological assemblies are increasingly determined using integrative approaches, in which data from multiple experimental methods are combined. A standalone system, called PDB-Dev, has been developed for archiving integrative structures and making them publicly available. Here, the data standards and software tools that support PDB-Dev are described along with the new and updated components of the PDB-Dev data-collection, processing and archiving infrastructure. Following the FAIR (Findable, Accessible, Interoperable and Reusable) principles, PDB-Dev ensures that the results of integrative structure determinations are freely accessible to everyone.more » « less
-
Abstract The Protein Data Bank (PDB) archives 3D structures of macromolecules determined experimentally using various methods. It is jointly managed by the Worldwide Protein Data Bank (wwPDB) consortium. Research Collaboratory for Structural Bioinformatics (RCSB) PDB, the US data center for the PDB, provides streamlined access to >240 000 structures through a variety of research-focused tools on RCSB.org. In addition, RCSB.org makes available over 1 million computed structure models (CSMs) predicted using deep learning methods and archived in the AlphaFold Database and ModelArchive. The PDB-IHM system was developed as a wwPDB project based on community recommendations to archive structures determined using integrative/hybrid methods (IHM). These structures are computed by combining information from multiple experimental and computational techniques to overcome the limitations of traditional single methods (e.g. macromolecular crystallography, 3D electron microscopy, nuclear magnetic resonance spectroscopy). In 2024, PDB-IHM was unified with the PDB to archive integrative structures alongside single-method experimental structures. These integrative structures have been made accessible via the RCSB.org website, facilitating efficient delivery of IHM data to a broad community of PDB users. Herein, we describe the expanded capabilities of RCSB.org that support discovery, analysis, and visualization of integrative structures together with single-method experimental structures and CSMs.more » « less
-
null (Ed.)Abstract FRET experiments can provide state-specific structural information of complex dynamic biomolecular assemblies. However, to overcome the sparsity of FRET experiments, they need to be combined with computer simulations. We introduce a program suite with ( i ) an automated design tool for FRET experiments, which determines how many and which FRET pairs should be used to minimize the uncertainty and maximize the accuracy of an integrative structure, ( ii ) an efficient approach for FRET-assisted coarse-grained structural modeling, and all-atom molecular dynamics simulations-based refinement, and ( iii ) a quantitative quality estimate for judging the accuracy of FRET-derived structures as opposed to precision. We benchmark our tools against simulated and experimental data of proteins with multiple conformational states and demonstrate an accuracy of ~3 Å RMSD Cα against X-ray structures for sets of 15 to 23 FRET pairs. Free and open-source software for the introduced workflow is available at https://github.com/Fluorescence-Tools . A web server for FRET-assisted structural modeling of proteins is available at http://nmsim.de .more » « less
-
null (Ed.)Single-molecule FRET (smFRET) has become a mainstream technique for studying biomolecular structural dynamics. The rapid and wide adoption of smFRET experiments by an ever-increasing number of groups has generated significant progress in sample preparation, measurement procedures, data analysis, algorithms and documentation. Several labs that employ smFRET approaches have joined forces to inform the smFRET community about streamlining how to perform experiments and analyze results for obtaining quantitative information on biomolecular structure and dynamics. The recent efforts include blind tests to assess the accuracy and the precision of smFRET experiments among different labs using various procedures. These multi-lab studies have led to the development of smFRET procedures and documentation, which are important when submitting entries into the archiving system for integrative structure models, PDB-Dev. This position paper describes the current ‘state of the art’ from different perspectives, points to unresolved methodological issues for quantitative structural studies, provides a set of ‘soft recommendations’ about which an emerging consensus exists, and lists openly available resources for newcomers and seasoned practitioners. To make further progress, we strongly encourage ‘open science’ practices.more » « less
An official website of the United States government

