<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Synergy among Pausing, Intrinsic Proofreading, and Accessory Proteins Results in Optimal Transcription Speed and Tolerable Accuracy</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>04/13/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10416881</idno>
					<idno type="doi">10.1021/acs.jpclett.3c00345</idno>
					<title level='j'>The Journal of Physical Chemistry Letters</title>
<idno>1948-7185</idno>
<biblScope unit="volume">14</biblScope>
<biblScope unit="issue">14</biblScope>					

					<author>Tripti Midha</author><author>Joel D. Mallory</author><author>Anatoly B. Kolomeisky</author><author>Oleg A. Igoshin</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Cleavage of dinucleotides after the misincorporational pauses serves as a proofreading mechanism that increases transcriptional elongation accuracy. The accuracy is further improved by accessory proteins such as GreA and TFIIS. However, it is not clear why RNAP pauses and why cleavage-factor-assisted proofreading is necessary despite transcriptional errors in vitro being of the same order as those in downstream translation. Here, we developed a chemical-kinetic model that incorporates most relevant features of transcriptional proofreading and uncovers how the balance between speed and accuracy is achieved. We found that long pauses are essential for high accuracy, whereas cleavage-factor-stimulated proofreading optimizes speed. Moreover, in comparison to the cleavage of a single nucleotide or three nucleotides, RNAP backtracking and dinucleotide cleavage improve both speed and accuracy. Our results thereby show how the molecular mechanism and the kinetic parameters of the transcriptional process were evolutionarily optimized to achieve maximal speed and tolerable accuracy.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>T he ability of cells to read and transfer molecular information requires discrimination among chemically similar substrates. <ref type="bibr">1</ref> For instance, each enzyme involved in processes such as DNA replication, transcription, and translation must accurately distinguish correct versus incorrect substrates in each biochemical step. <ref type="bibr">[2]</ref><ref type="bibr">[3]</ref><ref type="bibr">[4]</ref> Many of these processes display remarkable accuracy; e.g., DNA polymerases on average incorporate fewer than one incorrect base in the polynucleotide chain for every 10 7 steps. <ref type="bibr">2</ref> To achieve such accuracy, these enzymes employ kinetic proofreading (KPR), a mechanism that reduces the number of errors below the equilibrium thermodynamic limit with additional non-equilibrium steps that are coupled to energy dissipation (e.g., ATP hydrolysis). <ref type="bibr">5,</ref><ref type="bibr">6</ref> As a result, higher accuracy comes at the expense of increased energy costs and slower kinetics. Recent theoretical models have clarified how the free energy landscapes of enzymes evolved to optimize speed-accuracydissipation trade-offs for DNA replication, translation, tRNA charging, and coronavirus RNA genome replication. <ref type="bibr">[7]</ref><ref type="bibr">[8]</ref><ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref><ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref> For instance, evolution tuned the kinetic parameters of DNA polymerase and ribosome toward maximizing speed while still maintaining the the number of errors at tolerable levels. Structural investigations as well as in vitro and in vivo enzymatic studies of RNA polymerase (RNAP) showed that transcription in bacterial and eukaryotic cells also evolved to employ KPR. <ref type="bibr">15,</ref><ref type="bibr">16,</ref><ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref> However, the molecular details of the proofreading mechanisms are more complex than a classical Hopfield-Ninio picture. <ref type="bibr">5,</ref><ref type="bibr">6,</ref><ref type="bibr">24</ref> The proofreading mechanism for transcription by RNAP involves several unique features: pauses, backtracking, and endonucleolytic cleavage of multiple bases. <ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref><ref type="bibr">[28]</ref> Pauses are the intervals where transcriptional elongation has temporarily ceased after the addition of a noncognate nucleotide. Backtracking, which sometimes follows pausing, involves an upstream translocation of RNAP by one or several bases. Following the backtracking, endonucleolytic cleavage of the most recently added ribonucleotides occurs. <ref type="bibr">22,</ref><ref type="bibr">28</ref> Backtracking by one base pair, which is known to be energetically more favorable, leads to the endonucleolytic dinucleotide cleavage event. <ref type="bibr">[29]</ref><ref type="bibr">[30]</ref><ref type="bibr">[31]</ref> The cleavage of two nucleotides at once provides two checkpoints for the cleavage of each misincorporated nucleotide. The first checkpoint enables the recently misincorporated nucleotide to be removed along with the previously incorporated nucleotide. The second checkpoint allows for the cleavage of the misincorporated nucleotide along with the next incorporated nucleotide. Furthermore, additional accessory proteins, such as GreA in the bacterium Escherichia coli, TFIIS in eukaryotes, and TFS in archaea, can increase transcriptional fidelity by preferentially stimulating the cleavage of misincorporated nucleotides. <ref type="bibr">28,</ref><ref type="bibr">32</ref> In vitro experiments have shown that in eukaryotes, the RNA polymerase II subunit Rpb9 improves transcriptional fidelity by decreasing error propagation rates and increasing the error excision rate, which is facilitated by TFIIS. <ref type="bibr">33,</ref><ref type="bibr">34</ref> Although Rpb9 does not directly contribute to the intrinsic cleavage activity of pol II, it is nevertheless required for the optimal function of the transcript cleavage factor TFIIS. <ref type="bibr">33</ref> The mutation in Rpb9 can decrease the duration of pause, leading to a larger error. <ref type="bibr">[34]</ref><ref type="bibr">[35]</ref><ref type="bibr">[36]</ref> Several studies have also argued that TFIIS can simultaneously decrease the duration of pauses. <ref type="bibr">37,</ref><ref type="bibr">38</ref> A similar role is played by the accessory protein GreB in bacterial cells, <ref type="bibr">28</ref> though not all bacterial species have its homologues. <ref type="bibr">39</ref> A comprehensive understanding of the roles of pauses, backtracking, and endonucleolytic cleavage is critically important for elucidating the molecular mechanisms of KPR for transcription. For this purpose, several kinetic models for RNAP transcription based on the sequence-dependent motion of RNAP and the experimentally determined standard free energies of the underlying steps have been formulated. <ref type="bibr">[40]</ref><ref type="bibr">[41]</ref><ref type="bibr">[42]</ref><ref type="bibr">[43]</ref> One of the models primarily focused on the initial substrate selection steps and, therefore, included only one proofreading checkpoint. <ref type="bibr">42</ref> Another recently developed model considered the second step of proofreading but did not include the pauses at that step, <ref type="bibr">43</ref> contrary to the experimental observations. <ref type="bibr">34</ref> Moreover, none of the existing models take cleavage-factorassisted proofreading into account, which limits their applications for in vivo systems. Thus, a complete understanding of how evolution optimizes the speed and accuracy of RNAP transcription is still lacking.</p><p>In this work, we address several key questions about the microscopic details of proofreading by RNAP. Which kinetic steps affect the speed and accuracy of the transcription, and are there trade-offs between these for all kinetic parameters? How does pausing after misincorporation affect transcriptional accuracy and speed? How much do the cleavage factors contribute to the transcription accuracy and speed? To answer these questions, we constructed a kinetic model that takes into account the most relevant features and applied the forward master equation formalism to analytically compute the error, speed, and the steady-state probabilities of distinct elongation states. We varied the rate constants of the underlying chemical reactions around their experimentally determined values to observe their effect on speed and accuracy. Furthermore, to understand the importance of backtracking and dinucleotide cleavage, we compared the predictions of our model with those of an alternative KPR model with no backtracking and singleor triple-nucleotide cleavage. The results allow us to formulate evolutionary design principles of the transcriptional process, i.e., uncover how it was optimized for the balance of speed and accuracy.</p><p>A more detailed description of RNAP transcription along the DNA template in the RNA synthesis is presented in Figure <ref type="figure">S1A</ref>. The process involves the reversible phosphodiester bond formation step and the proofreading step comprising backtracking and dinucleotide cleavage. In the presence of noncognate nucleotides, each phosphodiester bond formation can lead to two outcomes. The rate of each addition depends on whether the nucleotide that binds the elongation complex is cognate or noncognate to the current residue in the DNA template. <ref type="bibr">17,</ref><ref type="bibr">44</ref> Moreover, if the noncognate nucleotide is incorporated at the last or second-to-last steps of elongation, RNAP pauses; i.e., the subsequent phosphodiester bond formation step is slower. In the same manner, the rate of dinucleotide cleavage also depends on the nature (i.e., cognate or noncognate) of the last two incorporated nucleotides. <ref type="bibr">27,</ref><ref type="bibr">34</ref> Thus, to develop a kinetic model for transcription elongation, we have to explicitly account for the branching at every step. However, as explained in the Supporting Information, with periodic boundary conditions and with the assumption that noncognate incorporations are rare, one can describe the process with a simplified kinetic scheme with only six different elongation states as shown in Figure <ref type="figure">1</ref>. Although the proposed model in Figure <ref type="figure">1</ref> is an approximate model of the true biochemical network of states, it is composed of the most relevant steps on the enzymatic pathways for which the rate constants have been determined experimentally. The rates corresponding to each reaction in the model are listed in Table <ref type="table">S1</ref>. We relate the rate constants k &#177;i,W to the rate constants k &#177;i,R by introducing the discrimination factor</p><p>2, 3, p. Factors f i provide the kinetic discrimination between the R and W pathways. For example, pausing is characterized by discrimination factor f 2 . It distinguishes the rate of extension after a mismatch k 2,W , with the match extension rate k 2,R . <ref type="bibr">34</ref> A smaller value of f 2 implies a smaller mismatch extension rate and thus a longer duration of pauses after the addition of the wrong nucleotide.</p><p>To compute the transcriptional characteristic properties such as speed and error, we use the forward master equation formalism. It allows us to exactly compute the steady-state probabilities of the six elongation states (see the Supporting Information for the mathematical derivations). Notably, the steady-state probabilities of E, ER, and ERR are practically the same. Similarly, states EW and ERW have the same steadystate probability, as both represent the state with the first-time encounter with the wrong nucleotide. These steady-state probabilities can be used to obtain the stationary fluxes for each transition. We define the error rate &#951; = J W /J R , i.e., as the ratio of stationary fluxes to create the wrong and right products (J W and J R , respectively). The RNAP speed (V) is defined as the sum of the stationary fluxes forming the final product states; i.e., V = J R + J W &#8776; J R , as J W &#8810; J R .</p><p>We now explore how different features of the proofreading mechanism affect the speed-accuracy trade-off and the transcriptional dynamics. We analyze the error-speed tradeoff plots (see, e.g., Figure <ref type="figure">2D</ref>), generated by varying a single rate constant k i,R or discrimination factor f i to see how evolution changes the speed and error while keeping all of the other kinetic parameters fixed at their experimentally determined values (see Table <ref type="table">S1</ref>). We say that there is a trade-off between accuracy and speed if an increase in either speed or accuracy corresponds to a decrease in the other.</p><p>Speedy transcription elongation is important to ensure the timely synthesis of RNA and rapid changes in gene expression profiles. In our model, this elongation rate is controlled by the kinetic parameters of different steps (Figure <ref type="figure">1</ref>). To understand which steps are the most relevant in affecting the speed locally (i.e., around the native parameter values), we performed the sensitivity analysis of speed with respect to all independent kinetic parameters. The results (Figure <ref type="figure">2A</ref>) indicate that the reaction speed is most sensitive to changes in the native phosphodiester bond formation rate, k 1,R . Other parameters, such as cleavage rate k 3,R , can also affect the speed but to a much lesser degree. To understand the effect more globally, we compute the speed with k 1,R varying over 2 orders of magnitude (Figure <ref type="figure">2B</ref>). The results imply that the speed of the system can monotonically increase with the phosphodiester bond formation rate. Notably, the experimentally estimated k 1,R value corresponds to an intermediate value of speed. To determine why the transcription elongation does not occur at a higher phosphodiester bond formation rate, we analyzed how the error rate is affected by changes in this rate. The results depicted in Figure <ref type="figure">2C</ref> demonstrate that increasing the polymerization rate also monotonically increases the error. Thus, a change in the phosphodiester bond formation rate can accelerate the transcription at the cost of a higher error or can increase the transcriptional accuracy at a cost of slower elongation. This trade-off is visualized in Figure <ref type="figure">2D</ref>. Therefore, the naturally selected value of this rate seems to be determined by these competing considerations, leading to suboptimal speed and error rates.</p><p>Given the large effect of the phosphodiester bond formation rate on speed, it is important to understand why this rate is occasionally reduced during transcription elongation. This sporadic deceleration can be classified into two categories: sequence-dependent and sequence-independent pauses. Sequence-dependent pauses occur due to RNA hairpin formation <ref type="bibr">45</ref> with a frequency of &#8804;1 pause per 100 nucleotides. <ref type="bibr">46</ref> These pauses are generally short and are not associated with backtracking leading to oligonucleotide cleavage. <ref type="bibr">47</ref> On the contrary, sequence-independent pauses are randomly distributed and occur once per 1000 nucleotides, which is a rate equivalent to the nucleotide misincorporation rate. These pauses are associated with backtracking leading to the cleavage of recently misincorporated nucleotides. <ref type="bibr">28</ref> In our model, we incorporate sequence-independent pauses only after a misincorporation because of their role in proofreading. <ref type="bibr">48</ref> For the pauses caused by misincorporation, the reduction of the rate after mismatch is described by discrimination factor f 2 . Therefore, we analyze the effects of f 2 on the speed-accuracy trade-off curve. The increase in this parameter (f 2 &#8594; 1) would accelerate phosphodiester bond formation after the misincorporation eliminating the pauses. The results depicted in Figure <ref type="figure">3A</ref> show that with these changes both the error and the speed monotonically increase. However, the magnitude of the increase in the speed is quite low compared to changes in the error: while the error corresponding to the native system with pauses (green circle) is approximately 1 order of magnitude  The green color represents the native system with pausing after incorporation. As f 2 decreases, the duration of pauses decreases. The red circle denotes the system with no pausing after misincorporation, i.e., f 2 = 1. (B) Error-speed (&#951;-V) interplay for the dinucleotide cleavage step. The native system with intrinsic proofreading (green circle) lies on the non-trade-off branch (solid line) of the &#951;-V curve. To the left of the optimal (maximum) speed lies the trade-off branch (dotted line). The pink triangle denotes the system in the absence of cleavage (k 3,R &#8594; 0). The purple circle represents the system for E. coli bacteria with GreA increasing the native intrinsic value of k 3,R by 25 times, while the blue circle characterizes the system for eukaryotes in the presence of the accessory protein TFIIS that increases the intrinsic cleavage rate by only 100 times.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>The Journal of Physical Chemistry Letters pubs.acs.org/JPCL</head><p>Letter less than that of the system without pauses (red circle), the speed changes by &lt;20%. Thus, the pauses before the cleavage improve the accuracy significantly with only a minor effect on the speed. Our theoretical analysis also suggests (Figure <ref type="figure">2A</ref>) that the speed is also moderately sensitive to changes in cleavage rate k 3,R . Therefore, we investigated how the changes in k 3,R affect speed and accuracy. Interestingly, the results depicted in Figure <ref type="figure">3B</ref> show that there is no trade-off between accuracy and speed with changes in the cleavage rate. With an increase in k 3,R , the error decreases and the speed increases. In other words, the native system with intrinsic proofreading (green circle) lies on the non-trade-off curve in Figure <ref type="figure">3B</ref>. The complete elimination of dinucleotide cleavage [that is, k 3,R = 0 (pink triangle)] will result in an error that is &#8764;1 order of magnitude higher and a speed that is &#8764;80% slower than those of the native case. This can be understood from the fact that the increase in the rate of cleavage decreases the relative time the enzyme spends in the wrong product states: EW, ERW, and EWR. This is evident from the decreased return of their corresponding probabilities P W = P ERW and P EWR , as shown in Figure <ref type="figure">S4</ref>. Thus, higher rates of cleavage increase the speed by releasing the paused enzyme from states EW, ERW, and EWR and improve the accuracy by reducing the probability of reaching the final wrong product state. However, a further increase in the cleavage rate can eventually lead to a decrease in the speed as the probability of futile cleavage of the correct nucleotide becomes significant (trade-off branch in Figure <ref type="figure">3B</ref>, dashed line to the left of the blue circle). Therefore, there is a value of the cleavage rate that optimizes the speed (blue circle in Figure <ref type="figure">3B</ref>). Interestingly, the native value corresponding to the intrinsic proofreading is significantly lower than this optimal value, resulting in a higher error and lower speed. Therefore, the mechanisms for further accelerating dinucleotide cleavage are expected to be beneficial.</p><p>Both bacterial and eukaryotic cells produce accessory proteins such as GreA in E. coli and TFIIS in eukaryotes, which bind to the polymerase during transcription and stimulate intrinsic cleavage. In E. coli, GreA is known to increase the cleavage efficiency of the RNAP enzyme by 25-35-fold. <ref type="bibr">49</ref> The purple circle in Figure <ref type="figure">3B</ref> represents the error and speed corresponding to the 25-fold increase in k 3,R . This assisted proofreading not only decreases the error by 2 orders of magnitude but also increases the speed by approximately 24%. <ref type="bibr">50,</ref><ref type="bibr">51</ref> This increased speed is very close to the maximal value (blue circle). For eukaryotes, the protein factor TFIIS increases the cleavage efficiency by almost 100-fold (blue circle, Figure <ref type="figure">3B</ref>). If that were the only effect of TFIIS, we would expect a further order of magnitude decrease in error and the optimal speed to be achieved. However, TFIIS not only increases the cleavage efficiency but also decreases the duration of pauses. <ref type="bibr">37,</ref><ref type="bibr">38</ref> To analyze the complete effect of TFIIS on the speed and accuracy of transcription, we need to understand the interplay between the two roles of these protein factors: reduction of pause duration and acceleration of cleavage. However, the magnitude of its effect on the duration of pauses following misincorporation has not been quantified. For the sake of simplicity, we assume that it decreases the duration of the pauses by the same magnitude as it increases the rate of cleavage. To model that, we introduce a stimulating factor &#945;, by which both the cleavage rate and the mismatch extension rate are increased, and investigate the effect of these changes on speed and accuracy. Figure <ref type="figure">4A</ref> shows that increasing stimulating factor &#945; does not significantly change the error. This is in contrast to the results in which these parameters are varied individually. When only the mismatch extension rate is increased, the error monotonically increases (Figure <ref type="figure">3A</ref>). When the cleavage rate is increasing, the error significantly decreases (Figure <ref type="figure">3B</ref>). Thus, two opposite effects of the mismatch extension rate and cleavage rate on error cancel one another when these rates are increased simultaneously. The error-speed curve indicates that while a 100-fold increase in these rates only slightly increases the error (blue circle, Figure <ref type="figure">4A</ref>), it brings the speed to a nearly optimal value (blue circle, Figure <ref type="figure">4B</ref>). Thus, we conclude that the primary role of TFIIS is the improvement of speed rather than accuracy.</p><p>Unlike the KPR mechanism for other biological processes, such as DNA replication, which involves the cleavage of a single misincorporated nucleotide, RNAP backtracks and cleaves two or more nucleotides at a time. However, the reason for this feature of the error correction mechanism by RNAP is still unknown. To investigate this, we analyzed how transcription accuracy and speed are affected if RNAP does not backtrack and cleaves only a single nucleotide. To this end, we developed a KPR scheme, as shown in Figure <ref type="figure">S2</ref>, consisting of the pausing but no backtracking and hence only singlenucleotide cleavage with the rates chosen to match with the corresponding rates for the model in Figure <ref type="figure">1</ref>, given in Table <ref type="table">S2</ref>. Two versions of the model without backtracking are analyzed. The first (Figure <ref type="figure">S2A</ref>) includes pauses after the first and second checkpoints, i.e., at both EW/ERW and EWR states. The second model (Figure <ref type="figure">S2B</ref>) includes pausing only at the first checkpoint, i.e., only after the most recent misincorporation (EW/ERW state). We computed the error and the speed for both models using the same methodology (see the Supporting Information). We first analyzed the model in the absence of accessory proteins. Interestingly, we found that, for the model with pausing at both checkpoints, error rate &#951; (=1.8 &#215; 10 -4 ) is &#8764;3.3 times higher than the error for the system with backtracking and dinucleotide cleavage [4.2 &#215; 10 -5 (green circle in Figure <ref type="figure">1</ref>)]. Moreover, the speed [V = 22 nucleotides (nts)/s] is approximately 10% smaller than the speed computed for the model in Figure <ref type="figure">1</ref> (V &#8776; 25 nts/s). Similarly, for the second model, the error increased to 2 &#215; 10 -4 , which is &#8764;3.8 times higher than that of the dinucleotide model. However, as the pausing at the second cleavage step is missing, it increases the speed to 26 nts/s. This speed is just 4% more than the speed for the dinucleotide cleavage model with pausing at both checkpoints (see Figure <ref type="figure">5</ref>). Moreover, the The Journal of Physical Chemistry Letters pubs.acs.org/JPCL Letter results hold even in the presence of TFIIS, which simultaneously affects the duration of pause and the intrinsic single-nucleotide cleavage by 100-fold. In that case, the error for the single-nucleotide cleavage model is still &#8764;3.5-fold larger than the error for the dinucleotide cleavage model, while the speeds for all models are almost the same.</p><p>The reason for the larger error for the models with no backtracking is the small chance of the removal of a misincorporated nucleotide from the second checkpoint EWR as compared to the dinucleotide cleavage model. This is evident from the steady-state probabilities of state EWR (P EWR &#8776; 0.14) for the single-nucleotide cleavage model (Figure <ref type="figure">S2A</ref>), which is approximately 3-fold more than the P EWR of &#8776;0.036 for the dinucleotide cleavage model (see Figure <ref type="figure">S3</ref>). Also, the probability of state E (the same as ER and ERR) (P E = P ER = P ERR &#8776; 0.23) for the single-nucleotide cleavage model is 11% less than the corresponding probabilities (P E = P ER = P ERR &#8776; 0.26) for the dinucleotide cleavage model (see Figure <ref type="figure">S3</ref>). Thus, the misincorporated nucleotide spends more time in state EWR and further tends to form a wrong product instead of being cleaved back to EW and then to E, hence decreasing the speed. Therefore, we conclude that for better accuracy and faster speed, backtracking and cleavage of dinucleotides are much better than single-nucleotide cleavage.</p><p>To further analyze if the cleavage of two nucleotides at once is optimal during transcription elongation, we constructed a KPR model with trinucleotide cleavage (Figure <ref type="figure">S5</ref>) with the corresponding kinetic parameters shown in Table <ref type="table">S3</ref>. We observed that, in the absence of accessory proteins, and with pausing present at both checkpoints, the error for the system is 1.7 &#215; 10 -4 , which is 3-fold greater than that of the dinucleotide cleavage model. The speed for the trinucleotide cleavage model (V &#8776; 13 nts/s) is &#8764;93% less than that of the dinucleotide cleavage model. However, if we take pausing with the three neighbors into account, the error becomes 4.9 &#215; 10 -5 , which is approximately 16% more than that of the dinucleotide cleavage model. However, the speed is still V &#8776; 13 nts/s, which is 93% more than that of the dinucleotide cleavage model. This analysis suggests that the cleavage of two nucleotides at once during the transcription elongation proofreading mechanism is optimal for improving the transcriptional speed and accuracy.</p><p>The speed and accuracy of transcription are very important. The speed of transcription can directly contribute to the fitness of organisms; 52-54 e.g., in bacterial ribosome synthesis, the transcription of rRNA is the rate-limiting step thereby affects the cell growth rate. <ref type="bibr">55</ref> With respect to accuracy, transcriptional mistakes can be costly if mistranscribed RNA cannot perform its function or encodes a wrong amino acid. For mRNA, additional mistakes can be introduced during translation, so the error rate for these steps (&#8764;3 &#215; 10 -4,56,57 ) can be used as an estimate of what is physiologically acceptable. To assess how different unique features of the RNAP proofreading mechanism affect speed and accuracy, we constructed a kinetic proofreading model that can account for the dinucleotide cleavage and the existence of the two checkpoints for proofreading. At each checkpoint state, the error correction depends on the competition between the extension rate and the cleavage rate. At the first checkpoint state following the attachment of the incorrect (W) nucleotide, dinucleotide "RW" is removed. If that does not happen, there is a second chance to remove the wrong nucleotide after another extension step with the cleavage of dinucleotide "WR". We aimed to keep the model quite generic so that it could be applied to both eukaryotic and prokaryotic transcription but nevertheless used realistic parameter estimates from the literature for the parameter values. The results indicate that pausing, backtracking, and cleavage assisted by accessory proteins allow for maximal transcriptional speed with tolerable accuracy (Figure <ref type="figure">5</ref>).</p><p>In the model, RNAP pauses after a misincorporation event. <ref type="bibr">28</ref> These pauses are sequence-independent and associated with backtracking leading to dinucleotide cleavage. Therefore, these pauses are expected to affect the error of the proofreading mechanism unlike the more frequent but shorter sequence-dependent pauses. <ref type="bibr">46,</ref><ref type="bibr">47</ref> Our findings suggest that pauses following each checkpoint step are important for achieving high transcriptional accuracy. This allows the dinucleotide cleavage rate to effectively compete with the extension rate, but only when the wrong substrate is incorporated in one of the last two steps. Elimination of pauses increases the transcription speed but leads to a severalfold increase in the error. Complete elimination of proofreading (i.e., no pausing and cleavage) would lead to error and speed values very similar to those of the case with no pauses. These results indicate the essential role of pauses in the intrinsic proofreading mechanism. On the contrary, the presence of pausing in the absence of cleavage would only slow transcription without improving the accuracy. Furthermore, with only the first step of proofreading, the error decreases almost 3-fold with an &#8776;20% decrease in speed. The second step of proofreading further reduces the error by 5-fold at the cost of an only &#8776;3% loss of speed. This is in contrast to the result found in ref 43, where the contribution of the second step of proofreading in enhancing accuracy is quite low. The reason for the discrepancy is that the authors of ref 43 did not account for the pauses during the second checkpoint step. These pauses have been experimentally observed <ref type="bibr">28,</ref><ref type="bibr">34</ref> and are essential to ensure that dinucleotide cleavage can even occur before the next elongation step.</p><p>To further understand the evolutionary selection of the kinetic parameters, we varied these parameters around their estimated values to evaluate their effects on the speed- The Journal of Physical Chemistry Letters pubs.acs.org/JPCL Letter accuracy trade-off behavior. The results showed that the tradeoff between speed and accuracy is not universal, and its occurrence depends on the specific values of the kinetic rates.</p><p>Observing that the transcriptional speed is most sensitive to the variation of the phosphodiester bond formation rate (see Figure <ref type="figure">2A</ref>), we varied this rate around its native value and found that a higher speed can be achieved by increasing the phosphodiester bond formation rate at the cost of a higher error rate. Similarly, a higher accuracy can be achieved by decreasing the phosphodiester formation rate only at the price of a lower speed. Motivated by a study that found that the absence of the Rpb9 subunit in RNAP II could disrupt the fidelity of the transcription process by increasing the mismatch extension rate by 2-3-fold, <ref type="bibr">34</ref> we analyzed the speed-accuracy trade-off behavior for the mismatch extension step. Again, a trade-off was found. Increasing the mismatch extension rate (i.e., reducing the duration of pauses) significantly increases the error of the system at the cost of a slightly lower speed. On the contrary, for the variation of the dinucleotide cleavage rate, there is a partitioning of the error-speed curve into trade-off and non-trade-off branches. Interestingly, near the experimentally determined value of the dinucleotide cleavage rate, the speed-accuracy curve exhibits non-trade-off behavior. Indeed, an optimal speed with a simultaneously higher accuracy can be observed for the significantly increased cleavage rate. A similar effect of increasing the cleavage has also been reported in ref 24.</p><p>An increase in the cleavage rate can be achieved with the help of cleavage-stimulating proteins such as GreA in E. coli bacteria and TFIIS in eukaryotes as well as subunit Rpb9 of RNAP II. <ref type="bibr">35</ref> These factors yield a nearly optimal transcription speed (Figure <ref type="figure">3B</ref>). Even when the acceleration of cleavage is accompanied by the removal of pauses, as is the case with TFIIS, the system still obtains the maximum possible speed but the error is almost the same as that with only intrinsic proofreading (Figure <ref type="figure">4</ref>). This is consistent with the experimental finding that TFIIS is necessary in vitro to enhance the transcription elongation rate. <ref type="bibr">15,</ref><ref type="bibr">21,</ref><ref type="bibr">58</ref> Moreover, these accessory proteins do not change the qualitative trade-off behavior seen for the phosphodiester bond formation rate (see Figure <ref type="figure">S6</ref>). Thus, it seems that evolution has optimized the kinetic parameters involved in the transcription by RNA polymerase to achieve the maximum possible speed with the desired tolerable accuracy.</p><p>It is known that the proofreading step requires the consumption of free energy through the hydrolysis of energy-rich nucleotide triphosphate (NTP) molecules and dissipates energy through futile cycles. <ref type="bibr">7,</ref><ref type="bibr">59,</ref><ref type="bibr">60</ref> To avoid an unnecessary waste of energy, the biological systems aim to keep the amount of energy dissipated small. <ref type="bibr">61</ref> For example, for translation during the tRNA charging step and selection step during protein synthesis by an E. coli ribosome, a notable increase in the expenditure of energy from proofreading prevented evolution from completely optimizing the speed and accuracy of these processes. <ref type="bibr">7,</ref><ref type="bibr">8,</ref><ref type="bibr">10</ref> In both of these examples, the proofreading cost was significant with 3-13% of the nucleotide hydrolysis not associated with product formation. <ref type="bibr">7,</ref><ref type="bibr">10</ref> Given the large cost, the system avoided increasing the proofreading cost to further increase the speed. <ref type="bibr">62</ref> This raises the question of why the transcription process achieves its optimal speed via cleavage-factor-assisted proofreading if it could increase the proofreading dissipation energy. To understand this, we computed the proofreading cost for transcription and showed that its value is 2.7 &#215; 10 -4 for intrinsic proofreading and 1.3 &#215; 10 -3 for the cleavage-assisted case. TFIIS-assisted cleavage increases the speed by 26% at the proofreading cost of &#8764;1 order of magnitude. This implies that RNAP II could bear this proofreading cost for a significantly large increase in speed.</p><p>All experimental studies for proofreading by RNAP during transcription elongation have claimed the cleavage of at least two nucleotides. The studies never observed the cleavage of a single nucleotide. To understand why RNAP does not cleave a single nucleotide, unlike in other proofreading processes by different enzymes, <ref type="bibr">63</ref> we also constructed a kinetic proofreading model with single-nucleotide cleavage. It was found that the error for the single-nucleotide cleavage model with pausing having either a one-neighbor effect or a two-neighbor effect is always larger (&#8764;3.8-fold) than in our proposed system with the dinucleotide cleavage. This high error for the single-nucleotide cleavage model remains unaffected in the presence of accessory proteins, affecting both the duration of the pause and cleavage rate. However, the speed for the single-nucleotide cleavage model is less than the speed for our proposed model in the absence of accessory proteins, and when they are present, all models operate at an optimal speed. This implies that the requirement to achieve high accuracy prevents RNAP from cleaving single nucleotides.</p><p>In summary, our study presents a coherent understanding of why RNAP pauses, backtracks, and cleaves dinucleotides based on how these processes affect accuracy and speed. Our theoretical analysis shows how the ultimate balance between speed and accuracy is achieved by adjusting various kinetic rates and through the effects of accessory proteins. It remains to be seen how these parameters vary among different organisms to fine-tune the speed-accuracy trade-off to the specific demands.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&#9632; ASSOCIATED CONTENT</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>* s&#305; Supporting Information</head><p>The Supporting Information is available free of charge at <ref type="url">https://pubs.acs.org/doi/10.1021/acs.jpclett.3c00345</ref>.</p><p>Detailed description of the KPR model for transcription elongation by RNAP and quasi-equilibrium approach to eliminate intermediate steps; experimentally determined kinetic rates and discrimination factors; forward master equation formalism for computing steady-state probabilities for the dinucleotide cleavage model with backtracking and the single-nucleotide cleavage model without backtracking; definitions of error rate &#951;, speed V, and proofreading cost C in terms of stationary fluxes; steady-state probabilities and characteristic properties for the KPR models computed with the native experimental kinetic parameters; sensitivity of the speed to various kinetic parameters of the model; and experimental data for the error and speed of mRNA transcription by RNA polymerase (PDF)</p><p>&#9632; AUTHOR <ref type="bibr">INFORMATION</ref> </p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>https://doi.org/10.1021/acs.jpclett.3c00345 J. Phys. Chem. Lett. 2023, 14, 3422-3429 Downloaded via RICE UNIV on April 11, 2023 at 13:13:46 (UTC).See https://pubs.acs.org/sharingguidelines for options on how to legitimately share published articles.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1"><p>https://doi.org/10.1021/acs.jpclett.3c00345 J. Phys. Chem. Lett. 2023, 14, 3422-3429</p></note>
		</body>
		</text>
</TEI>
