Protein side chain packing (PSCP) is a fundamental problem in the field of protein engineering, as high‐confidence and low‐energy conformations of amino acid side chains are crucial for understanding (and designing) protein folding, protein–protein interactions, and protein‐ligand interactions. Traditional PSCP methods (such as the Rosetta Packer) often rely on a library of discrete side chain conformations, or rotamers, and a forcefield to guide the structure to low‐energy conformations. Recently, deep learning (DL) based methods (such as DLPacker, AttnPacker, and DiffPack) have demonstrated state‐of‐the‐art predictions and speed in the PSCP task. Building off the success of geometric graph neural networks for protein modeling, we present the Protein Invariant Point Packer (PIPPack) which effectively processes local structural and sequence information to produce realistic, idealized side chain coordinates using ‐angle distribution predictions and geometry‐aware invariant point message passing (IPMP). On a test set of ∼1400 high‐quality protein chains, PIPPack is highly competitive with other state‐of‐the‐art PSCP methods in rotamer recovery and per‐residue RMSD but is significantly faster.
One of the most intriguing features of biological systems is their ability to regulate the steady-state fluxes of the underlying biochemical reactions; however, the regulatory mechanisms and their physicochemical properties are not fully understood. Fundamentally, flux regulation can be explained with a chemical kinetic formalism describing the transitions between discrete states, with the reaction rates defined by an underlying free energy landscape. Which features of the energy landscape affect the flux distribution? Here we prove that the ratios of the steady-state fluxes of quasi–first-order biochemical processes are invariant to energy perturbations of the discrete states and are only affected by the energy barriers. In other words, the nonequilibrium flux distribution is under kinetic and not thermodynamic control. We illustrate the generality of this result for three biological processes. For the network describing protein folding along competing pathways, the probabilities of proceeding via these pathways are shown to be invariant to the stability of the intermediates or to the presence of additional misfolded states. For the network describing protein synthesis, the error rate and the energy expenditure per peptide bond is proven to be independent of the stability of the intermediate states. For molecular motors such as myosin-V, the ratio of forward to backward steps and the number of adenosine 5′-triphosphate (ATP) molecules hydrolyzed per step is demonstrated to be invariant to energy perturbations of the intermediate states. These findings place important constraints on the ability of mutations and drug perturbations to affect the steady-state flux distribution for a wide class of biological processes.
more » « less- PAR ID:
- 10143496
- Publisher / Repository:
- Proceedings of the National Academy of Sciences
- Date Published:
- Journal Name:
- Proceedings of the National Academy of Sciences
- Volume:
- 117
- Issue:
- 16
- ISSN:
- 0027-8424
- Page Range / eLocation ID:
- p. 8884-8889
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
Abstract We present an analysis of the energy partitioning in the magnetotail during a substorm at 03:58:00 UT on 7 February 2009. The analysis employs a multiscale approach where we use a state from a global magnetohydrodynamics (MHD) model to spawn a kinetic particle‐in‐cell (PIC) simulation of a large portion of the tail. We directly investigate the energy fluxes resulting from magnetic reconnection. The kinetic run provides information on the additional processes absent in the MHD description. The ion bulk energy and enthalpy fluxes carry the greatest energy, but the Poynting flux and electron enthalpy flux also carry a significant portion. The other fluxes (e.g., heat flux) are relatively small but are especially important because they allow us to identify the extra processes present only in the kinetic description. The energy fluxes present in the MHD approximation (Poynting flux, enthalpy flux, and bulk energy flux) are quantitatively accurate, and the kinetic correction does not greatly alter the MHD picture. However, there are two unique effects resulting from the kinetic physics. First, the formation of a rarefaction of the plasma flow into the reconnection site leads to a progressive decline in time of the particle energy fluxes with respect to the Poynting flux. Second, we observe that the instabilities developing in the kinetic reconnection outflows form structures absent from the MHD description. These structures reveal themselves as fluctuations within the energy fluxes. Especially notable are regions of inverted heat flux, where the heat flux is in the opposite direction to the total energy and mass flow.
-
Escherichia coli single-stranded (ss)DNA binding (SSB) protein mediates genome maintenance processes by regulating access to ssDNA. This homotetrameric protein wraps ssDNA in multiple distinct binding modes that may be used selectively in different DNA processes, and whose detailed wrapping topologies remain speculative. Here, we used single-molecule force and fluorescence spectroscopy to investigate E. coli SSB binding to ssDNA. Stretching a single ssDNA-SSB complex reveals discrete states that correlate with known binding modes, the likely ssDNA conformations and diffusion dynamics in each, and the kinetic pathways by which the protein wraps ssDNA and is dissociated. The data allow us to construct an energy landscape for the ssDNA-SSB complex, revealing that unwrapping energy costs increase the more ssDNA is unraveled. Our findings provide insights into the mechanism by which proteins gain access to ssDNA bound by SSB, as demonstrated by experiments in which SSB is displaced by the E. coli recombinase RecA.
-
The frequency distributions can characterize the population-potential landscape related to the stability of ecological states. We illustrate the practical utility of this approach by analyzing a forest–savanna model. Savanna and forest states coexist under certain conditions, consistent with past theoretical work and empirical observations. However, a grassland state, unseen in the corresponding deterministic model, emerges as an alternative quasi-stable state under fluctuations, providing a theoretical basis for the appearance of widespread grasslands in some empirical analyses. The ecological dynamics are determined by both the population-potential landscape gradient and the steady-state probability flux. The flux quantifies the net input/output to the ecological system and therefore the degree of nonequilibriumness. Landscape and flux together determine the transitions between stable states characterized by dominant paths and switching rates. The intrinsic potential landscape admits a Lyapunov function, which provides a quantitative measure of global stability. We find that the average flux, entropy production rate, and free energy have significant changes near bifurcations under both finite and zero fluctuation. These may provide both dynamical and thermodynamic origins of the bifurcations. We identified the variances in observed frequency time traces, fluctuations, and time irreversibility as kinematic measures for bifurcations. This framework opens the way to characterize ecological systems globally, to uncover how they change among states, and to quantify the emergence of quasi-stable states under stochastic fluctuations.
-
Abstract Developing improved catalysts for sustainable chemical processes often involves understanding atomistic origins of catalytic activity, selectivity, and stability. Using density functional theory and steady‐state kinetic analyses, we probe the elementary steps that form decomposition products that limit selectivity in vinyl acetate (VA) synthesis on Pd surfaces covered with acetate species. Acetate formation and coupling with ethylene control the VA formation catalytic cycle and steady‐state coverage, but acetate and ethylene can separately decompose to form CO2. Both decompositions involve initial C−H activations at acetate vacancies, followed by additional C−H activations and eventual C−O formations and C−C cleavages involving reactions with molecular oxygen. Acetate decomposition paths with non‐oxidative kinetically‐relevant steps exhibit similar free energy barriers to oxidative paths. In contrast, the non‐oxidative ethylene path involving an ethylidyne intermediate exhibits a much lower barrier than paths with oxidative kinetically‐relevant steps. Ethylene decomposition is very facile at low coverages but is more coverage‐sensitive, leading to similar decomposition and VA formation barriers at coverages accessible at steady state, which is consistent with moderate VA selectivity in measurements and ethylene vs. acetate decomposition contributions assessed from regressed kinetic parameters. These insights provide a detailed framework for describing VA synthesis rates and selectivity on metallic catalyst surfaces.