Abstract MotivationDeep learning has revolutionized protein tertiary structure prediction recently. The cutting-edge deep learning methods such as AlphaFold can predict high-accuracy tertiary structures for most individual protein chains. However, the accuracy of predicting quaternary structures of protein complexes consisting of multiple chains is still relatively low due to lack of advanced deep learning methods in the field. Because interchain residue–residue contacts can be used as distance restraints to guide quaternary structure modeling, here we develop a deep dilated convolutional residual network method (DRCon) to predict interchain residue–residue contacts in homodimers from residue–residue co-evolutionary signals derived from multiple sequence alignments of monomers, intrachain residue–residue contacts of monomers extracted from true/predicted tertiary structures or predicted by deep learning, and other sequence and structural features. ResultsTested on three homodimer test datasets (Homo_std dataset, DeepHomo dataset and CASP-CAPRI dataset), the precision of DRCon for top L/5 interchain contact predictions (L: length of monomer in a homodimer) is 43.46%, 47.10% and 33.50% respectively at 6 Å contact threshold, which is substantially better than DeepHomo and DNCON2_inter and similar to Glinter. Moreover, our experiments demonstrate that using predicted tertiary structure or intrachain contacts of monomers in the unbound state as input, DRCon still performs well, even though its accuracy is lower than using true tertiary structures in the bound state are used as input. Finally, our case study shows that good interchain contact predictions can be used to build high-accuracy quaternary structure models of homodimers. Availability and implementationThe source code of DRCon is available at https://github.com/jianlin-cheng/DRCon. The datasets are available at https://zenodo.org/record/5998532#.YgF70vXMKsB. Supplementary informationSupplementary data are available at Bioinformatics online.
more »
« less
High-throughput determination of RNA tertiary contact thermodynamics by quantitative DMS chemical mapping
Abstract Structured RNAs often contain long-range tertiary contacts that are critical to their function. Despite the importance of tertiary contacts, methods to measure their thermodynamics are low throughput or require specialized instruments. Here, we introduce a new quantitative chemical mapping method (qMaPseq) to measure Mg2+-induced formation of tertiary contact thermodynamics in a high-throughput manner using standard biochemistry equipment. With qMaPseq, we measured the ΔG of 98 unique tetraloop/tetraloop receptor (TL/TLR) variants in a one-pot reaction. These results agree well with measurements from specialized instruments (R2= 0.64). Furthermore, the DMS reactivity of the TL directly correlates to the stability of the contact (R2= 0.68), the first direct evidence that a single DMS reactivity measurement reports on thermodynamics. Combined with structure prediction, DMS reactivity allowed the development of experimentally accurate 3D models of TLR mutants. These results demonstrate that qMaPseq is broadly accessible, high-throughput and directly links DMS reactivity to thermodynamics.
more »
« less
- Award ID(s):
- 2143638
- PAR ID:
- 10529293
- Publisher / Repository:
- Oxford University Press
- Date Published:
- Journal Name:
- Nucleic Acids Research
- Volume:
- 52
- Issue:
- 16
- ISSN:
- 0305-1048
- Format(s):
- Medium: X Size: p. 9953-9965
- Size(s):
- p. 9953-9965
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Nearly two decades after Westhof and Michel first proposed that RNA tetraloops may interact with distal helices, tetraloop–receptor interactions have been recognized as ubiquitous elements of RNA tertiary structure. The unique architecture of GNRA tetraloops ( N =any nucleotide, R =purine) enables interaction with a variety of receptors, e.g., helical minor grooves and asymmetric internal loops. The most common example of the latter is the GAAA tetraloop–11 nt tetraloop receptor motif. Biophysical characterization of this motif provided evidence for the modularity of RNA structure, with applications spanning improved crystallization methods to RNA tectonics. In this review, we identify and compare types of GNRA tetraloop–receptor interactions. Then we explore the abundance of structural, kinetic, and thermodynamic information on the frequently occurring and most widely studied GAAA tetraloop–11 nt receptor motif. Studies of this interaction have revealed powerful paradigms for structural assembly of RNA, as well as providing new insights into the roles of cations, transition states and protein chaperones in RNA folding pathways. However, further research will clearly be necessary to characterize other tetraloop–receptor and long-range tertiary binding interactions in detail – an important milestone in the quantitative prediction of free energy landscapes for RNA folding.more » « less
-
Survival relies on the ability to flexibly choose between different actions according to varying environmental circumstances. Many lines of evidence indicate that action selection involves signaling in corticostriatal circuits, including the orbitofrontal cortex (OFC) and dorsomedial striatum (DMS). While choice-specific responses have been found in individual neurons from both areas, it is unclear whether populations of OFC or DMS neurons are better at encoding an animal’s choice. To address this, we trained head-fixed mice to perform an auditory guided two-alternative choice task, which required moving a joystick forward or backward. We then used silicon microprobes to simultaneously measure the spiking activity of OFC and DMS ensembles, allowing us to directly compare population dynamics between these areas within the same animals. Consistent with previous literature, both areas contained neurons that were selective for specific stimulus-action associations. However, analysis of concurrently recorded ensemble activity revealed that the animal’s trial-by-trial behavior could be decoded more accurately from DMS dynamics. These results reveal substantial regional differences in encoding action selection, suggesting that DMS neural dynamics are more specialized than OFC at representing an animal’s choice of action. NEW & NOTEWORTHY While previous literature shows that both orbitofrontal cortex (OFC) and dorsomedial striatum (DMS) represent information relevant to selecting specific actions, few studies have directly compared neural signals between these areas. Here we compared OFC and DMS dynamics in mice performing a two-alternative choice task. We found that the animal’s choice could be decoded more accurately from DMS population activity. This work provides among the first evidence that OFC and DMS differentially represent information about an animal’s selected action.more » « less
-
Abstract Deep learning methods that achieved great success in predicting intrachain residue-residue contacts have been applied to predict interchain contacts between proteins. However, these methods require multiple sequence alignments (MSAs) of a pair of interacting proteins (dimers) as input, which are often difficult to obtain because there are not many known protein complexes available to generate MSAs of sufficient depth for a pair of proteins. In recognizing that multiple sequence alignments of a monomer that forms homomultimers contain the co-evolutionary signals of both intrachain and interchain residue pairs in contact, we applied DNCON2 (a deep learning-based protein intrachain residue-residue contact predictor) to predict both intrachain and interchain contacts for homomultimers using multiple sequence alignment (MSA) and other co-evolutionary features of a single monomer followed by discrimination of interchain and intrachain contacts according to the tertiary structure of the monomer. We name this tool DNCON2_Inter. Allowing true-positive predictions within two residue shifts, the best average precision was obtained for the Top-L/10 predictions of 22.9% for homodimers and 17.0% for higher-order homomultimers. In some instances, especially where interchain contact densities are high, DNCON2_Inter predicted interchain contacts with 100% precision. We also developed Con_Complex, a complex structure reconstruction tool that uses predicted contacts to produce the structure of the complex. Using Con_Complex, we show that the predicted contacts can be used to accurately construct the structure of some complexes. Our experiment demonstrates that monomeric multiple sequence alignments can be used with deep learning to predict interchain contacts of homomeric proteins.more » « less
An official website of the United States government
