<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Unveiling the RNA-mediated allosteric activation discloses functional hotspots in CRISPR-Cas13a</title></titleStmt>
			<publicationStmt>
				<publisher>Oxford Journals</publisher>
				<date>11/30/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10534516</idno>
					<idno type="doi">10.1093/nar/gkad1127</idno>
					<title level='j'>Nucleic Acids Research</title>
<idno>0305-1048</idno>
<biblScope unit="volume">52</biblScope>
<biblScope unit="issue">2</biblScope>					

					<author>Souvik Sinha</author><author>AdrianM MolinaVargas</author><author>PabloR Arantes</author><author>Amun Patel</author><author>MitchellR O’Connell</author><author>Giulia Palermo</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<title>Abstract</title> <p>Cas13a is a recent addition to the CRISPR-Cas toolkit that exclusively targets RNA, which makes it a promising tool for RNA detection. It utilizes a CRISPR RNA (crRNA) to target RNA sequences and trigger a composite active site formed by two ‘Higher Eukaryotes and Prokaryotes Nucleotide’ (HEPN) domains, cleaving any solvent-exposed RNA. In this system, an intriguing form of allosteric communication controls the RNA cleavage activity, yet its molecular details are unknown. Here, multiple-microsecond molecular dynamics simulations are combined with graph theory to decipher this intricate activation mechanism. We show that the binding of a target RNA acts as an allosteric effector, by amplifying the communication signals over the dynamical noise through interactions of the crRNA at the buried HEPN1-2 interface. By introducing a novel Signal-to-Noise Ratio (SNR) of communication efficiency, we reveal critical allosteric residues—R377,N378, andR973—that rearrange their interactions upon target RNA binding. Alanine mutation of these residues is shown to select target RNA over an extended complementary sequence beyond guide-target duplex for RNA cleavage, establishing the functional significance of these hotspots. Collectively our findings offer a fundamental understanding of the Cas13a mechanism of action and pave new avenues for the development of highly selective RNA-based cleavage and detection tools.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>CRISPR ( Clustered Regularly Interspaced Short Palindromic Repeats ) and their associated ( Cas ) proteins are RNA-guided prokaryotic adaptive immune systems that protect bacteria against invading genetic elements ( 1 ) . The transformative use of CRISPR-Cas9 for genome editing ( 2 ) led to the 8boom9 of the 8CRISPR oeld9 and the discovery of novel CRISPR-Cas systems that remarkably expand applications in genome editing and beyond <ref type="bibr">(3)</ref><ref type="bibr">(4)</ref><ref type="bibr">(5)</ref> . Cas13a ( formerly known as C2c2 ) is a recently discovered CRISPR-associated protein that tar-gets RNA <ref type="bibr">( 6 ,7 )</ref> , a property that is powerful for RNA detection, regulation, and imaging <ref type="bibr">(8)</ref><ref type="bibr">(9)</ref><ref type="bibr">(10)</ref><ref type="bibr">(11)</ref><ref type="bibr">(12)</ref><ref type="bibr">(13)</ref> . The Cas13a effector protein complexes with a CRISPR RNA ( crRNA ) sequence, which is used as a guide to form the RNA-targeting interference complex <ref type="bibr">( 6 , 7 , 14 )</ref> . The latter can cleave singlestranded RNA ( ssRNA ) sequences using two Higher Eukaryotes and Prokaryotes Nucleotide ( HEPN ) catalytic domains <ref type="bibr">( 7 ,15 )</ref> , which are found in RNA targeting enzymes <ref type="bibr">( 16 )</ref> . Upon activation of the HEPN domains, Cas13a degrades its target RNAs through cis cleavage and other solvent-exposed ssR-NAs through a non-specioc trans cleavage activity <ref type="bibr">( 6 ,7 )</ref> . This is a potent 8collateral damage9 that has enabled the development of ultrasensitive RNA detection tools like SHERLOCK <ref type="bibr">( 13 )</ref> , SPRINT <ref type="bibr">( 12 )</ref> , direct S AR S-CoV-2 RNA detection assays <ref type="bibr">( 17 )</ref> and many others.</p><p>The biophysical function of the Cas13a effector is characterized by intricate allosteric signalling, whose molecular details are highly unclear <ref type="bibr">( 18 )</ref> . Structures of Cas13a from Leptotrichia buccalis ( Lbu ) reveal a bilobed architecture, comprising a 8REC9 &#945;-helical lobe that recognizes the crRNA and a 8NUC9 lobe consisting of the catalytic HEPN1-2 domains and a helical Linker ( Figure <ref type="figure">1</ref> ) <ref type="bibr">( 19 )</ref> . Biochemical studies have shown that the binding of a complementary target RNA ( tgRNA ) to the REC lobe allosterically activates the HEPN domains <ref type="bibr">( 18 )</ref> , which are spatially distant, to form a composite active site for RNA cleavage <ref type="bibr">( 7 ,15 )</ref> . However, the mechanism of this activation and information transfer from the tgRNA binding site to the HEPN1-2 catalytic cleft is poorly understood.</p><p>Interestingly, the crRNA spacer-i.e. the 28 nucleotides ( nt. ) sequence that guides binding of the tgRNA-was shown to hold a critical role in the onset of the allosteric response <ref type="bibr">( 18 )</ref> . The spacer contains a 8seed9 region ( nt. <ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref><ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref> , where perfect base pairing is required for tgRNA binding, and a 8switch9 region ( nt. 5-8, Figure <ref type="figure">1 D</ref> ) , whose binding of the tgRNA has been suggested to induce the activation of the catalytic cleft <ref type="bibr">( 18 )</ref> . Nevertheless, the 8seed9 and 8switch9 regions are approximately located 34 &#197; and 41 &#197; from the catalytic core respectively, puzzling the allosteric response of these regions toward activation.</p><p>Functional studies, both in-vitro and in-vivo , have also shown that extending the duplex complementarity at the 5 nank of the crRNA spacer, impacts the RNA degrading ability of Cas13a <ref type="bibr">( 20 )</ref> . Such extended pairing beyond the guidetarget duplex is called tag ( segment from the crRNA ) -antitag ( segment from the tgRNA ) pairing ( blue segment in Figure <ref type="figure">1 D</ref> ) . This dependence on the complementarity length was proposed as a strategy to discriminate between self-and nonself-targets, which is key for Cas13-based technologies <ref type="bibr">( 21 )</ref> . This suggests the allosteric response is regulated by the degree of complementarity between the guide crRNA ( also referred to as a guide-RNA ) and tgRNA.</p><p>In light of this knowledge, understanding the allosteric regulation of Cas13a activation and the role of RNA is crucial to improving the RNA targeting speciocity of Cas13a, and engineering its controlled function to discriminate non-selftargets.</p><p>Here, we used extensive molecular dynamics ( MD ) simulations, reaching a conformational sampling of &#8764;170 &#181;s, and conducted thorough graph theory analysis, providing a comprehensive RNA-mediated allosteric mechanism of the Cas13a protein. Most signiocantly, our computational investigation discloses critical residues whose mutation in ala-nine is shown experimentally to select tgRNA over an extended tag-anti-tag complementarity, for RNA cleavages. By introducing a novel graph theory-based analysis-signal-tonoise ratio ( SNR ) of communication efociency -we establish the critical hotspots for Cas13a activation, showing that our computational approach can guide the development of more selective RNA cleavage and detection strategies. These ondings offer a fundamental understanding of the Cas13a mechanism of action and pave new avenues for the development of more selective RNA-targeting CRISPR-Cas13a systems.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Materials and methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Structural models</head><p>Molecular simulations were based on the structure of the Leptotrichia buccalis ( Lbu ) Cas13a bound to a crRNA ( PDB: 5XWY, at 3.2 &#197; resolution ( <ref type="formula">15</ref>) ) , obtained via cryo-EM, and on the structure of the LbuCas13a in complex with a crRNA and tgRNA ( PDB: 5XWP ) , obtained by single-wavelength anomalous diffraction at 3.08 &#197; resolution <ref type="bibr">( 15 )</ref> . The Lbu-Cas13a bound to an extended tag-anti-tag RNA ( atgRNA ) was built by including a longer duplex with eight base-pair extended atgRNA, obtained from the cryo-EM structure of the Leptotrichia shahii Cas13a bound to atgRNA ( PDB: 7DMQ, at 3.06 &#197; resolution ) <ref type="bibr">( 20 )</ref> . In all systems, we reinstated the catalytic H1053 and R1048 in the HEPN domains, which were mutated in alanine in the experimental structures <ref type="bibr">( 15 )</ref> . The protonation states of histidine residues have been computed using the H++ software <ref type="bibr">( 22 )</ref> reporting singly protonated neutral states ( protonated on the &#949; position ) . This allows histidine residues to act as a base within the HEPN1-2 cleft, as recently shown for a type III CRISPR-Cas system holding the HEPN1-2 RNase activity <ref type="bibr">( 23 )</ref> . The systems were solvated, leading to simulation cells of &#8764;138 &#215; 97 &#215; 130 &#197; 3 , and neutralized by the addition of an adequate number of Na + ions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Molecular dynamics simulations</head><p>Molecular dynamics ( MD ) simulations were performed using a simulation protocol tailored for protein / nucleic acid complexes ( detailed in Supplementary Text ) . We employed the Amber ff19SB <ref type="bibr">( 24 )</ref> force oeld, including the &#967;OL3 corrections for RNA <ref type="bibr">( 25 ,26 )</ref> . The TIP3P model was used for explicit water molecules <ref type="bibr">( 27 )</ref> . Production runs were carried out in the NVT ensemble, using an integration time step of 2 fs. For each system, i.e. Cas13a bound to a crRNA ( crRNA-Cas13a ) , in complex with a crRNA and tgRNA ( tgRNA-Cas13a ) and bound to an extended tag-anti-tag RNA ( atgRNA-Cas13a ) , we performed &#8764;5 &#181;s of MD simulations in three replicates. We also carried out a &#8764;5 &#181;s long simulation of a tgRNA-bound complex substituting the A ( -3 ) base of the crRNA with cytosine C ( -3 ) . Subsequently, we considered four variants ( R377A, N378A, R963A, and R973A ) of Cas13a bound to a tgRNA, including an atgRNA, similar to the wild-type ( WT ) Cas13a complexes. These systems were also simulated for &#8764;5 &#181;s in three replicates. Overall, we accumulated &#8764;170 &#181;s of total sampling. All simulations were performed using the GPUempowered version of the AMBER 20 code <ref type="bibr">( 28 )</ref> . Analyses were performed over the aggregated multi&#181;s sampling collected for each complex ( i.e. &#8764;15 &#181;s ) . This was motivated by our previous studies of allostery in the CRISPR-Cas systems <ref type="bibr">(29)</ref><ref type="bibr">(30)</ref><ref type="bibr">(31)</ref><ref type="bibr">(32)</ref><ref type="bibr">(33)</ref> , showing that an aggregated multi&#181;s sampling offers a robust ensemble for the purposes of our analysis ( described below and in Supplementary Text ) .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Analysis of Jensen-Shannon distances</head><p>To characterize the difference in the conformational dynamics of the HEPN domains, we analysed the distributions of all intra-backbone dihedrals ( BB torsions ) and backbone C &#945; distances ( BB distances ) in the investigated systems. To compare the distributions of the abovementioned features between any two of our systems, we computed the Jensen-Shannon Distance ( JSD or D JS ) , a symmetrized version of Kullback-Leibler divergence ( D KL ) . <ref type="bibr">( 34 )</ref> The JSD ranges from 0 to 1, where 0 corresponds to two identical distributions and 1 corresponds to a pair of separated distributions. For two distributions P i and P j , and considering a feature x f from two different ensembles i and j ,</p><p>where M = 1 2 ( P i + P j ) . The Kullback-Leibler divergence, D KL , corresponds to two distributions P i and P j is of the following form:</p><p>JSD values were computed using the Python ENSemble Analysis ( PENSA ) open-source library <ref type="bibr">( 35 )</ref> . Kernel density estimations of the JSD values were plotted to describe the JSD range and compare the systems.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Inter-domain correlations</head><p>To evaluate the inter-dependent coupling between the Cas13a domains and the nucleic acids, we computed inter-domain generalized correlation scores <ref type="bibr">( 36 )</ref> . For each protein residue i , a correlation score ( Cs ) parameter can be computed as:</p><p>which sums the atom-based generalized correlations, GC i j , established by residue i with the residues j, based on Shannon9s entropy estimation of mutual information ( details in Supplementary Text ) <ref type="bibr">( 37 )</ref> . Cs are a measure of the number and the intensity of the GC i j coefocients displayed by each residue pair.</p><p>To detail inter-domain correlations, the Cs were accumulated and normalized. First, the Cs were calculated for each residue i belonging to a specioc protein domain ( e.g. HEPN1 ( I ) ) , with the residues j belonging to another protein domain of interest ( e.g. HEPN2 ) . Then, the Cs were accumulated over all residues j of each specioc domain and normalized by the number of coupling residues. This resulted in a set of inter-domain Cs , ranging from 0 ( not-correlated ) to 1 ( correlated ) , measuring the strength of the overall correlation that each domain establishes with the others.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Dynamic network and signal-to-noise ratio</head><p>To characterize the allosteric pathways of communication, network analysis was applied <ref type="bibr">( 38 )</ref> . In dynamical networks, C &#945; atoms of proteins and backbone P atoms of nucleotides, as well as N1 atoms in purines, and N9 in pyrimidines, are represented as nodes, connected by edges weighted by the generalized correlations GC i j according to:</p><p>Details on network analysis are in Supplementary Text. From the dynamical network, we estimated the efociency of crosstalk between the crRNA spacer regions ( i.e. 8seed9 ( nt. 9-14 ) ; 8switch9 ( nt. 5-8 ) ; as well as nt. 1-4; nt. 15-18 and nt. <ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref> and the catalytic residues ( R472, H477, R1048, H1053 ) by introducing a novel Signal-to-Noise Ratio ( SNR ) measure. SNR measures the preference of communication between predeoned distant sites-i.e. the signal-over the remaining pathways in the network-i.e. the noise, estimating how allosteric pathways stand out ( i.e. are favourable ) over the entire communication network.</p><p>For the SNR calculation, we orst computed the optimal ( i.e. the shortest ) and top ove sub-optimal pathways ( with longer lengths, ranked in comparison to the optimal path length ) between all crRNA bases and the Cas13a residues, using well-established algorithms ( vide infra ) . Then, the cumulative betweennesses of each pathway ( S k ) was calculated as the sum of the betweennesses of all the edges in that specioc pathway:</p><p>where b i is the edge betweenness ( i.e. the number of shortest pathways that cross the edge, measuring the 8trafoc9 passing through them ) between node i and i + 1 , and n is the number of edges in the k th pathway. The distribution of S k between the crRNA bases and all protein residues was deoned as the noise, whereas the distribution of S k between the crRNA nucleotide regions of interest ( e.g. 8seed9 ) and the HEPN1-2 catalytic residues were considered as signals. Since S k depends on the number of edges present in the path, to ensure consistency, we characterized the SNR on the basis of shorter ( edge count: 6-8 ) , medium ( edge count: 9-11 ) , and longer ( edge count: 12-14 ) paths ( details in Supplementary Text ) . Notably, while the optimal path corresponds to the most likely mode of communication, suboptimal paths can also be crucial routes for communication transfer <ref type="bibr">( 31 ,38 )</ref> . Hence, in addition to the optimal path, we also considered the top ove sub-optimal pathways for our SNR analysis. Well-established algorithms were employed for shortest-path analysis. The Floyd-Warshall algorithm was utilized to compute the optimal paths between the network nodes. The ove sub-optimal paths were computed in rank from the shortest to the longest, using Yen9s algorithm, which computes single-source K -shortest loop-less paths ( i.e. without repeated nodes ) for a graph with non-negative edge weights <ref type="bibr">( 39 )</ref> . To identify residues important for allosteric communication, we computed the occurrence of each residue appearing in at least one of the pathways ( i.e. optimal and suboptimal ) . This analysis also reports on the conservation of allosteric pathways, as pathways characterized by a lesser number of residues with higher occurrence are likely to be more conserved than those exhibiting a greater number of residues with lower occurrence.</p><p>Finally, the SNR corresponding to signals from each crRNA region ( i.e. 8seed9 ( nt. 9-14 ) ; 8switch9 ( nt. 5-8 ) ; as well as nt 1-4; nt. 15-18 and nt. <ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref> to the HEPN1-2 catalytic core residues ( R472, H477, R1048, H1053 ) was computed as:</p><p>where E(S ) and Var (S ) correspond to the expectation and variance of the signal distribution respectively; and E( N) / Var ( N) are the expectation / variance of the noise distribution. To provide the signiocance of the signal over the noise, we used a general approach based on p-value calculation. Our goal was to test the hypothesis that the signal is an outlier of the noise distribution. We can construct a best-ot probability distribution based on the noise data by treating our variable ( the sum of betweennesses ) as stochastic. Treating the signal as a sample from this population, we can assess the rarity of that sample9s mean when randomly collecting samples of the same size. This rarity is deoned as the p-value of the sample. The p-value is then computed using the Z -score:</p><p>where &#963; [ N] corresponds to the standard deviation of the noise, and n is the number of signals. Here, we found that the best-ot distributions for the noise follow a log-normal distribution. Hence, we took the logarithm of the signals and noise to transform the data points into a normal distribution. From this transformed distribution, we obtained the mean and standard deviations for the calculation of the Z signal . All networks were built using the Dynetan Python library <ref type="bibr">( 38 )</ref> . Path-based analyses were performed using NetworkX Python library <ref type="bibr">( 40 )</ref> and our in-house Python scripts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Cas13a protein expression and puriocation</head><p>For expression of wild-type LbuCas13a we used Addgene Plasmid #83482 <ref type="bibr">( 7 )</ref> . LbuCas13a mutants were cloned from the WT vector via site-directed mutagenesis using the primers indicated in Supplementary Table <ref type="table">S1</ref> . All constructs were purioed as previously described <ref type="bibr">( 7 ,41 )</ref>, with some modiocations. Brieny, expression vectors were transformed into Rosetta2 DE3 grown in LB media supplemented with 0.5% w / v glucose at 37 &#8226; C. Protein expression was induced at mid-log phase (OD 600 &#8764;0.6) with 0.5 mM IPTG, followed by incubation at 16 &#8226; C overnight. Cell pellets were resuspended in lysis buffer (50 mM HEPES [pH 7.0], 1 M NaCl, 5 mM imidazole, 5% (v / v) glycerol, 1 mM D TT, 0.5 mM PMSF, ED TA-free protease inhibitor [Roche]), lysed by sonication, and clarioed by centrifugation at 15,000g. Soluble His 6 -MBP-TEV-Cas13a was isolated over metal ion afonity chromatography, and in order to cleave off the His 6 -MBP tag, the protein-containing eluate was incubated with TEV protease at 4 &#8226; C overnight while dialyzing into ion exchange buffer (50 mM HEPES [pH 7.0], 250 mM NaCl, 5% (v / v) glycerol, 1 mM DTT). The cleaved protein was loaded onto a HiTrap SP column (GE Healthcare) and eluted over a linear KCl (0.25-1 M) gradient. Fractions containing LbuCas13a were pooled, concentrated, and further purioed via size-exclusion chromatography on an S200 column (GE Healthcare) in gel oltration buffer (20 mM HEPES [pH 7.0], 200 mM KCl, 5% glycerol (v / v), 1 mM DTT), snap-frozen in liquid N 2 and were subsequently stored at -80 &#8226; C. Protein purity was assessed by loading &#8764;2 ug of total protein in a 4-12% Bis-Tris gel and staining with Coomassie Blue dye.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fluorescent ssRNA nuclease assays</head><p>Mature crRNAs and ssRNA targets were commercially synthesized (IDT; Supplementary Table <ref type="table">S2</ref> ). Cas13 trans-cleavage nuclease activity assays were performed in 10 mM HEPES pH 7.0, 50 mM KCl, 5 mM MgCl2, and 5% glycerol. Brieny, 100 nM LbuCas13a:crRNA complexes were assembled in for 30 min at 37 &#8226; C. 100 nM of Rnase Alert reporter (IDT) and various onal concentrations of ssRNA-target were added to initiate the reaction. These reactions were incubated in a nuorescence plate reader (Tecan Spark) for up to 60 min at 37 &#8226; C with nuorescence measurements taken every 5 min ( &#955; ex : 485 nm; &#955; em : 535 nm). Time-course and end-point values at 1 hour were background-subtracted, normalized, and analysed with their associated standard errors using GraphPad Prism9.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conformational changes of the HEPN1-2 domains</head><p>MD simulations were carried out on three RNA-bound complexes of the LbuCas13a: the binary complex bound to a guide crRNA only (crRNA-Cas13a), and the ternary complexes in which Cas13a binds to a 28 nt. tgRNA matching the crRNA (tgRNA-Cas13a), and a 36 nt. tag-anti-tag RNA (atgRNA-Cas13a, Figure <ref type="figure">1</ref> ). In these systems, we reinstated the catalytic H1053 and R1048 in the HEPN domains, which were mutated to alanine in the structures to prevent RNA cleavage <ref type="bibr">( 19 )</ref>. An aggregate sampling of &#8764;15 &#181;s was collected for each system, from three simulation replicas of &#8764;5 &#181;s each.</p><p>To characterize the difference in the HEPN1-2 conformational dynamics among the systems studied, we orst analysed the distribution of the Jensen-Shannon Distances ( JSD ) <ref type="bibr">( 34 )</ref>, measuring the similarity of two distributions, ranging from 0 (similar) to 1 (dissimilar). The JSD distributions were computed to compare all intra-backbone distances (BB distances) and backbone torsions (BB torsions) of the HEPN domains in the investigated systems (see Material and Methods ). We observe that, while overall the JSD distributions are similar for the overall HEPN1-2 domains ( Supplementary Figure <ref type="figure">S1</ref> ), the backbone dynamics of the catalytic cleft (residues 470-480, 1045-1055) clearly display a separation between the crRNA-and tgRNA-Cas13a systems (Figure <ref type="figure">2</ref> A, B, red distribution). A separation in the dynamics of the catalytic cleft is also observed when comparing the crRNA-and atgRNAbound Cas13a (blue distribution), while the catalytic cleft dynamics in the atgRNA-Cas13a are least separated from those of the tgRNA-Cas13a (grey distribution). This observation is complemented by the Solvent Accessible Surface Area (S AS A), showing that the HEPN1-2 catalytic cleft is significantly more accessible to the solvent in the tgRNA-bound system than in the crRNA-Cas13a binary system (Figure <ref type="figure">2 C</ref>), also deviating from the experimental structures <ref type="bibr">( 19 )</ref>. Interestingly, time-evolution plots of the S AS A across our simulation replicas show that in the tgRNA-bound system, the catalytic pocket more frequently transitions from open-to-close conformation with respect to the crRNA-Cas13a, which displays a relatively closed pocket ( Supplementary Figure <ref type="figure">S2</ref> ). This indicates that a closed catalytic core in the crRNA-bound complex opens upon tgRNA binding (Figure <ref type="figure">2 D</ref>). Interestingly, in the presence of an atgRNA, the catalytic cleft samples both open and closed conformations, also accessing states where the pocket is even less accessible than in the crRNA-Cas13a. This is also notable from the time-evolution plots, reporting a higher amplitude and frequency of the open-to-close transition in the atgRNA-Cas13a compared to the tgRNA-bound system ( Supplementary Figure <ref type="figure">S2</ref> ), overall indicating a higher plasticity of the pocket.</p><p>These observations indicate that when Cas13a is bound to the crRNA alone, the HEPN1-2 catalytic cleft assumes a closed conformation. On the other hand, tgRNA binding results in an opening of the cleft. Moreover, the plasticity of the HEPN1-2 cleft appears to be a key distinguishing factor between the tgRNA-and atgRNA-Cas13a systems. In the atgRNA-Cas13a complex, the catalytic cleft is more nexible in comparison to the tgRNA-bound Cas13a, suggesting that also the length of base-pair complementarity impacts the HEPN dynamics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Target RNA shifts the dynamics of Cas13a</head><p>To understand how the binding of the tgRNA, in the presence and absence of an extended tag-anti-tag pair (atgRNA), impacts the dynamics of the spatially distant HEPN1-2 domains, it is imperative to measure the dynamic correlations among these spatially distant sites <ref type="bibr">( 42 )</ref>. We employed a generalized correlation (GC) analysis, relying on Shannon9s entropy-based estimation of mutual information <ref type="bibr">( 37 )</ref>, to describe the linear and non-linear couplings of amino acids and nucleobases. Analysis of the per-domain GC reveals that tgRNA binding increases the overall inter-domain correlations in Cas13a, including the catalytic HEPN1-2 domains, with respect to the crRNA-and atgRNA-bound complexes (Figures <ref type="figure">3 A</ref>, <ref type="figure">S3</ref>). This observation renects the biophysical basis of protein allostery <ref type="bibr">( 43 ,44 )</ref>. Accordingly, the binding of an allosteric effector-in the present case the tgRNA that activates Cas13a-shifts the protein conformational landscape <ref type="bibr">( 32 )</ref>. This results in an overall increase of coupled motions, which mediate the communication between distant sites. Correlations between the crRNA and the Cas13a domains are found to improve in the ternary complexes, compared to the crRNA-Cas13a. Dynamical differences are further captured by Principal Component Analysis (PCA), which separates the binary crRNA-Cas13a complex from the ternary complexes along the orst principal component (PC1), while distinguishing the tgRNA-and atgRNA-bound complexes along PC2 ( Supplementary Figure <ref type="figure">S4</ref> ). This observation is in line with the domain-wise PCs, which indicate that the main differentiation between the tgRNA-and atgRNA-bound Cas13a systems lies in the PC2 of the NUC domains, particularly the Linker and the HEPN1-2 domains.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Signal-to-noise ratio of communication efociency</head><p>To understand how the observed dynamical differences impact allosteric signalling, we performed graph theory-derived network analysis <ref type="bibr">( 38 )</ref>. This approach is suited for the characterization of allosteric mechanisms, as shown in a number of studies <ref type="bibr">( 38 ,45-48 )</ref>, including those performed by our research group <ref type="bibr">(29)</ref><ref type="bibr">(30)</ref><ref type="bibr">(31)</ref><ref type="bibr">(32)</ref><ref type="bibr">(33)</ref>.</p><p>We intended to estimate the communication efociency between the crRNA spacer region and the catalytic cleft in the systems. In this respect, traditional shortest-path measurements are useful to ond the most likely communication pathways between sites, but they do not report how the identioed pathways stand out (i.e. are favourable) over the entire communication network <ref type="bibr">( 38 )</ref>. Assessing this favourability is crucial since the dominant allosteric pathways are particularly effective in transmitting communication between distantly coupled subunits. Hence, we introduced a novel signal-to-noise ratio ( SNR ) estimate of the information transfer, measuring the preference of communication between predeoned distant sites -i.e. the signal -over the remaining pathways of comparable length in the network-i.e. the noise. High SNR values indicate the preference of the network to communicate through the signal over other noisy routes (see Materials and methods).</p><p>First, we computed the optimal (i.e. the shortest) and top ove sub-optimal pathways between all crRNA bases and the Cas13a residues, obtaining a distribution of communication efociency in terms of the sum of edge betweennesses (i.e. the number of shortest pathways that cross the edge, measuring the 8trafoc9 passing through them). This provided a comprehensive overview of all communications, constituting the crosstalk noise between crRNA and protein. Then, we computed the optimal and sub-optimal pathways communicating two regions of the crRNA spacer (i.e. 8seed9 and 8switch9), which have been reported to be critical for Cas13a activation) <ref type="bibr">( 18 )</ref>, with the HEPN1-2 catalytic residues (R472, H477, R1048, H1053). This represents the signal of our interest. To ensure consistency in the scale of comparison, we characterized the SNR across short (6-8 edge counts), medium <ref type="bibr">(9)</ref><ref type="bibr">(10)</ref><ref type="bibr">(11)</ref>, and long paths <ref type="bibr">(12)</ref><ref type="bibr">(13)</ref><ref type="bibr">(14)</ref>, based on the number of edges communicating the crRNA regions with the catalytic residues (details in Supplementary Text, Supplementary Figure <ref type="figure">S5</ref> ). The obtained SNR indicates the extent to which the signal deviates from the distribution noise, thus renecting the prevalence of the signal over the noise (Figure <ref type="figure">3 B</ref>). Little-to-no overlap of the signal with the noise (i.e. high SNR ) indicates the prevalence of the allosteric signal.</p><p>In the crRNA-Cas13a, broad noise distributions overlap with the signals irrespective of the path lengths, dampening the SNR compared to the tgRNA-Cas13a. In the tgRNA-Cas13a, ampliocation of signals from the 8switch9 regions for medium-to-long path lengths indicate that tgRNA binding improves the crosstalk efociency between the crRNA spacer and the HEPN1-2 catalytic residues. The 8seed9 region also amplioes the signal across longer paths. Signals sourcing from other regions of the crRNA spacer were also computed in the tgRNA-bound complex ( Supplementary Figure <ref type="figure">S6</ref> ), reporting a lower SNR with respect to the 8switch9 and 8seed9 regions. This observation agrees with previous biochemical data <ref type="bibr">( 18 )</ref>, suggesting that the complementarity at the 8switch9 region could trigger the allosteric activation of HEPN1-2, with mismatches in this region preventing LbuCas13a activation. In the atgRNA-Cas13a, a modest SNR is detected over medium and long path lengths, while the communication increases over shorter paths for signals sourcing from the 8switch9 region.</p><p>Overall, the communication is remarkably efocient in the tgRNA-bound Cas13a, in line with the experimental suggestion that tgRNA binding allosterically triggers activation of the HEPN1-2 domains ( 18 ). As noted above, allosteric phenomena are commonly associated with a shift in dynamics, resulting in the efocient transfer of signals. Our ondings thereby suggest that efocient signalling is associated with a shift in correlations that prioritize the communication signals over communication noise. Therefore, shifted correlations result in noise-free communication pathways for allosteric signals.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Key residues for allosteric coupling</head><p>The SNR revealed that the communication signal from the crRNA spacer to the HEPN1-2 catalytic cleft is remarkably efocient in the tgRNA-Cas13a. To further understand the signal transduction mechanism in this system, we computed the signalling pathways (i.e. the optimal and top ove suboptimal paths) connecting the 8switch9 and 8seed9 regions of the crRNA to the catalytic core residues (Figure <ref type="figure">4 A</ref>).</p><p>The pathways connecting the 8switch9 region to the catalytic residues exclusively follow a route that directly connects the crRNA bases of the repeat region (A(-5)-C(-1)) to the catalytic core through the HEPN1(I)-2 interface (Figures <ref type="figure">4 B</ref>, <ref type="figure">S7A</ref>). On the other hand, the 8seed9 region communicates with the catalytic core through multiple routes, involv-ing the Linker-HEPN2 interface, the HEPN1(II) residues, as well as the HEPN1(I)-2 interfacial residues through the cr-RNA repeat, similar to the communication observed for the 8switch9.</p><p>Hence, the pathways connecting the 8switch9 nucleotides to the catalytic residues display a lower number of residues with increased occurrences, tracing a more efocient communication path, compared to the pathways connecting the 8seed9. This observation further aforms the critical role of the 8switch9 in the allosteric activation of HEPN1-2 <ref type="bibr">( 18 )</ref>. Extending our analysis to 50 suboptimal pathways reported a similar trend ( Supplementary Figure <ref type="figure">S7 B</ref>), where a direct route consistently connects the 8switch9 region to the HEPN1(I)-2 catalytic core through the crRNA repeat bases A(-4) and A(-3). This evidence pinpoints a pivotal role of the crRNA repeat region in signal transmission.</p><p>To better understand our observations, we analysed the interactions between the crRNA repeat region (A(-5)-(C-1)) and the proximal HEPN1(I)-2 interface (residues: 371-383; 963-975). Notably, this interface region is located distally with respect to the catalytic cleft (Figure <ref type="figure">4 A</ref>). We observed that in the tgRNA-bound system, the A(-3) base of the cr-RNA repeat region penetrates the interface (Figure <ref type="figure">4 C</ref>), hampering interactions between HEPN1(I) and HEPN2. This is conormed by the histogram of differential contact stability ( f crRN A -tgRN A , Supplementary Figure <ref type="figure">S8 A</ref>), showing that, in the crRNA-Cas13a, the nipped-out A(-3) base causes increased interactions at the HEPN1(I)-2 interface, with respect to the tgRNA-bound system. In the atgRNA-Cas13a, the cr-RNA repeat region is sequestered due to the extended taganti-tag complementarity, leading to stable interfacial contacts compared to the tgRNA-Cas13a ( Supplementary Figure <ref type="figure">S8 B</ref>).</p><p>To detail the interactions of the A(-3) base in the tgRNAbound Cas13a, we conducted an in-depth contact analysis at the HEPN1(I)-2 interface. A Sankey plot was used to report the frequency, f , of stable contacts between residues of the HEPN1(I) and HEPN2 domains, and the crRNA, forming for g10% of the simulation time ( f g 0 . 1 ) in the tgRNA-Cas13a (Figure <ref type="figure">4</ref> D, details in Supplementary Text). In this plot, residues are connected through edges, whose width is proportional to f . We observe that A(-3) substantially interacts with polar / positively charged residues, mainly R377, N378, and R963. Analysis of the differential contact stability was also carried out to characterize interactions that gain stability in tgRNA-Cas13a, compared to the crRNA-Cas13a (details in Supplementary Text).</p><p>As expected, a loss of stable contacts at the HEPN1(I)-2 interface is observed in the tgRNA-Cas13a, compared to the crRNA-bound crRNA system ( Supplementary Figure <ref type="figure">S9 A</ref>). Upon tgRNA binding, R377 and N378 of HEPN1(I) gain interactions with A(-3); and R963 and R973 of HEPN2 increase their contacts with the neighbouring bases, compared to the crRNA-Cas13a ( Supplementary Figure <ref type="figure">S9 B</ref>). To test whether this observation is sequence dependent, we substituted the A(-3) base with a smaller cytosine in the tgRNA-Cas13a and carried out an additional &#8764;5 &#181;s-long simulation. In this system, the C(-3) base stably locates at the HEPN1(I)-2 interface and interacts with R377, N378, and R963 (Figure <ref type="figure">4 E</ref>). Taken together, these observations suggest that these rearranged interactions involving charged / polar residues between HEPN1(I)-2 could be critical for allosteric signalling from the crRNA spacer to the catalytic core. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Role of HEPN1-2 interfacial interactions</head><p>To experimentally verify our observations, we generated and purioed the wild-type (WT) Cas13a and four variants, mutating charged / polar residues to alanine at the HEPN1(I)-2 interface (i.e. R377A, N378A, R963A, and R973A, Figure <ref type="figure">4 F</ref>). We designed and generated tgRNAs and crRNAs containing the spacer used by Liu et al. <ref type="bibr">( 19 )</ref> (PDB: 5XWP, Figure <ref type="figure">5 A</ref>) and an anti-tag containing target RNA (atgRNA), holding an extended 8-nt. sequence with complementarity to the crRNA direct repeat (Figure <ref type="figure">5 B</ref>). We performed nuorescent RNA transcleavage assays with WT LbuCa13a and the variants bound to our tgRNA and atgRNA sequences (see Material and Methods ). In the WT LbuCas13a, there is robust activation of the nuclease activity with a tgRNA, while the presence of an at-gRNA results in a small decrease in the apparent cleavage rates over time (Figure <ref type="figure">5 C</ref>), albeit the amount of end-point cleavage product was only slightly reduced relative to the tgRNA (Figure <ref type="figure">5 D</ref>). In our LbuCas13a variants, N378A and R973A maintained a robust activity for the tgRNA, the R377A variant suffered a reduction in cleavage efociency and R963A was not active at all (Figure <ref type="figure">5 D</ref>). In the presence of an atgRNA, none of the variants displayed any signiocant nuclease activation, suggesting that these variants are more sensitive to anti-tag containing RNAs than the WT LbuCas13a (Figure <ref type="figure">5 C</ref>, <ref type="figure">D</ref>). To further validate these observations, we obtained crRNA-tgRNA (Figure <ref type="figure">5 E</ref> (PDB:7DMQ) ( 20 ). Our cleavage assays showed a similar activation pattern to our data above, where our LbuCas13a variants show increased inhibition in the presence of an atgRNA (Figure <ref type="figure">5 G</ref>, <ref type="figure">H</ref>).</p><p>Taken together, these experimental data, using a 28 nt. spacer similar to the simulations, reveal that the WT Lbu-Cas13a maintains its activity irrespective of tgRNA or at-gRNA binding. On the other hand, our variants still cleave RNA with tgRNA bound, while hampered nuclease activation in the presence of an extended tag-anti-tag complementarity is observed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Cas13a variants reorg aniz e the allosteric communication network</head><p>To evaluate how the signalling transfer is affected by the mutations we made, we collected a &#8764;15 &#181;s ensemble for each of our four Cas13a variants bound to a tgRNA and atgRNA and compared them with the corresponding WT Cas13a complexes. Analysis of the JSD and S AS A distributions reveal that the conformational dynamics of the HEPN1-2 catalytic cleft are perturbed by single mutations at the HEPN1(I)-2 interface compared to the WT Cas13a ( Supplementary Figure <ref type="figure">S10</ref> ). We then analysed the signalling transfer by comparing the maximal SNR detected in the variants with that of the WT Cas13a, considering all signals sourcing from the critical 8switch9 region, and establishing a consistent scale for comparison (Figures 6 A, S11-12). The highest SNR across different path lengths indicates the most favoured communication route in the system, irrespective of path lengths. We thereby computed the ratio between the maximal SNR in the variants and in the WT Cas13a ( SNR ratio = SN R max -variant / SN R max -wt ). This comparison indicates whether point mutations at the HEPN1(I)-2 interface impact the strength of the communication between the crRNA 8switch9 and the catalytic residues compared to the WT.</p><p>We observe that in the tgRNA-bound systems, the R377A, N378A, and R973A variants maintain a high SNR ratio , as evidence of efocient communication compared to the WT Cas13a (Figure <ref type="figure">6 A</ref>). On the other hand, R963A reduces the signal with respect to the WT, indicating altered communication. Upon atgRNA binding, the SNR ratio reaches 40-60% of perturbations in all variants with respect to the WT, with R973A maintaining a SNR ratio approximately within 30% of the WT. This suggests that, in the atgRNA-bound variants, the signalling from the 8switch9 to the HEPN1-2 catalytic core is reduced. This is in line with the experimental activity, showing that none of the variants displays a signiocant nuclease activation in the presence of an atgRNA (Figure <ref type="figure">5 C</ref>, <ref type="figure">D</ref>).</p><p>To further understand the signalling transfer in the tgRNAbound variants, and the observation of a reduced SNR ratio in the inactive R963A mutant, we analysed the interactions of the crRNA repeat bases and the proximal HEPN1(I)-2 interface. In the WT tgRNA-Cas13a, the A(-3) base forms stable contacts with R377, N378, and R963, along with several other HEPN2 residues (Figure <ref type="figure">4 D</ref>). These interactions are preserved in the tgRNA-bound mutants except, as expected, for the mutated residue ( Supplementary Figure <ref type="figure">S13</ref> ). Nevertheless, in the tgRNA-bound R963A mutant, the A(-3) base is more nexible and is frequently extruded from the HEPN1(I)-2 interface (Figure <ref type="figure">6 B</ref>). The A(-3) conformations are monitored on a polar plot reporting the distance d , which describes the displacement of A(-3) with respect to the C &#945; atom at position 963, and the dihedral angle &#952;, reporting the rotation of the A(-3) purine base with respect to the crRNA backbone (Figure <ref type="figure">6</ref> C, S14). The polar plot evidences the higher nexibility of A(-3) in the R963A mutant, compared to the remaining systems, and its extrusion from the HEPN1(I)-2 interface. This observation can be ascribed to the loss of interaction between the R963 guanidinium side chain and the crRNA phosphate backbone that, on the other hand, is maintained in the other systems ( Supplementary Figure <ref type="figure">S15</ref> ). We recall that the R963A substitution hampers the tgRNA-Cas13a activity (Figure <ref type="figure">5</ref> ). This suggests that the positioning of the A(-3) base, and the dynamics of the crRNA repeat region at the HEPN1(I)-2 interface, critically affects the transmission of the signal from the 8switch9 to the catalytic core, as evidenced by lower SNR with respect to the WT.</p><p>To characterize the allosteric communication in our atgRNA-bound variants, and how it compares to the tgRNA-Cas13a, we inspected the pathways communicating the 8switch9 to the HEPN1-2 catalytic core. In the atgRNAbound WT Cas13a, allosteric pathways mainly involve the HEPN1(I), HEPN1(II), and HEPN2 interfaces and the tgRNA (Figures <ref type="figure">6 D</ref>, <ref type="figure">S16</ref>), at odds with the direct routes passing through the crRNA repeat and HEPN1(I)-2 observed in the tgRNA-Cas13a (Figure <ref type="figure">4</ref> B and S6). When introducing our point mutations in the tgRNA-bound Cas13a, the major routes of communication are similar to those of the WT system, with the addition of a few residues from HEPN1(II) ( Supplementary Figure <ref type="figure">S17 A</ref>). In the atgRNA-bound variants, allosteric pathways are more sensitive to perturbations, compared to the tgRNA-bound variants. In the presence of an at-gRNA, the variants increase the number of crRNA bases involved in the communication with respect to WT atgRNA-Cas13a, with the N378A and R963A variants also losing the tgRNA communication channel ( Supplementary Figure <ref type="figure">S17 B</ref>). An analysis of the interactions at the major interfaces along the allosteric pathways (i.e. HEPN1(I), HEPN1(II) and HEPN2, Figure <ref type="figure">6 E</ref>) also shows that the mutations result in a reduction of stable contacts at the HEPN1(I)-HEPN2 interface with respect to the WT Cas13a, while gained back at the HEPN1(I)-HEPN1(II) interface. These rearrangements at the critical HEPN1(I)-(II)-HEPN2 interfaces are the basis of the altered signalling pathways observed in the atgRNA-bound variants ( Supplementary Figure <ref type="figure">S17</ref> ).</p><p>In summary, analysis of the allosteric communication in our mutants reveals that the inactive mutants reduce the signal with respect to the WT. This altered communication can be attributed to the dynamics of the crRNA repeat region in the tgRNA-bound systems, and to an overall perturbation of the crosstalk pathways in the presence of an atgRNA. All in all, alterations of the protein-RNA interactions at the and ( i i ) the dihedral angle &#952; between the C39@A(-4), C39@A(-3), C8@A(-4) and C2@A(-4) atoms (angular coordinate, in degrees), computed from the simulated ensembles of the WT tgRNA-bound Cas13a and its mutants. HEPN1(I)-2 interface in the mutants with respect to the WT, point for the allosteric regulation of the activity.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion</head><p>Here, extensive MD simulations were combined with graph theory and experimental assays to characterize the allosteric activation mechanism in Cas13a, a newly emerged CRISPR-Cas system, which is being developed as a powerful tool for RNA cleavage, detection and imaging.</p><p>Multiple&#181;s simulations reveal that the binding of a target RNA (tgRNA) at the recognition lobe impacts the dynamics of the spatially distant HEPN1-2 catalytic core, resulting in altered dynamics with respect to the crRNA-bound form (Figure <ref type="figure">2 A-B</ref>), and in the opening of the catalytic cleft (Figure <ref type="figure">2 C</ref>). This is in line with biochemical data, suggesting that tgRNA binding allosterically activates HEPN1-2 to form a composite active site <ref type="bibr">( 15 ,18 )</ref>. In the presence of an extended tag-anti-tag pairing (atgRNA), the HEPN1-2 catalytic cleft increases its nexibility, compared to both the crRNA-and tgRNA-Cas13a, indicating that the length of the complementarity also affects the HEPN1-2 dynamics.</p><p>Analysis of mutually coupled motions shows that tgRNA binding induces a shift in the system9s dynamics (Figure <ref type="figure">3 A</ref>), evidenced by increased inter-domain correlations. Coupled dynamics are also found in the presence of an atgRNA, mainly involving the crRNA and the Cas13a domains. This 8shift in dynamics9 is typical evidence of an allosteric response <ref type="bibr">( 32 ,43 )</ref>, reinforcing the notion of the tgRNA as an effector of Cas13a9s allosteric function.</p><p>To better understand how the observed differences in the systems9 dynamics impact the nux of information from the sites of tgRNA binding to the catalytic core, we estimated the crosstalk efociency across the studied systems. We analysed how the communication signal from the crRNA (site of tgRNA binding) to the HEPN1-2 cleft emerges over the remaining pathways (i.e. the noise) using a signal-to-noise ratio ( SNR ) measure. We found that, upon tgRNA binding, the signal from the 8switch9 region of the crRNA amplioes over the noise, providing evidence of efocient crosstalk (Figure <ref type="figure">3 B</ref>). The communication sourcing from the 8switch9 is also maintained upon atgRNA binding. These observations underscore the critical role of this region in the allosteric signalling between the sites of effector binding and RNA cleavage. We recall that, in line with our observations, biochemical data have noted the 8switch9 region to be crucial in triggering the allosteric activation of HEPN1-2 <ref type="bibr">( 18 )</ref>.</p><p>Analysis of the allosteric pathways also shows that the 8switch9 region efociently communicates with the HEPN1-2 catalytic core through the crRNA bases of the repeat region (i.e. A(-5)-C(-1), Figure <ref type="figure">4 B</ref>). This pinpoints a cardinal role for the crRNA repeat region, which prompted us to analyse its interactions with Cas13a. We examined the interactions with the proximal HEPN1(I)-2 interface, notably located distally with respect to the catalytic cleft (Figure <ref type="figure">4 C</ref>). We observe that in the tgRNA-Cas13a, the A(-3) base of the crRNA repeat penetrates the HEPN1(I)-2 interface, decreasing inter-domain interactions compared to the crRNA-Cas13a where A(-3) is extruded. The extended tag-anti-tag complementarity also sequesters A(-3) from the HEPN1(I)-2 interface. The interaction network further reveals several polar / charged residues critical for HEPN(I)-2 binding, that exhibit increased interac-tions with A(-3) upon tgRNA biding (Figure <ref type="figure">4 D-E</ref>). In detail, N378, R963A, R973 and R377A rearrange their interactions at the HEPN(I)-2 interface upon tgRNA binding, which could be critical in mediating the allosteric information transfer process.</p><p>To experimentally test the role of these polar / charged residues in the allosteric activation of Cas13a, we performed mutagenesis and RNA cleavage experiments. Alanine substitution of N378, R377, and R973 maintains a robust activity for the tgRNA, while R963A is not active (Figure <ref type="figure">5</ref> ). In the presence of an atgRNA, none of the variants displays signiocant nuclease activation. This concludes that our N378A, R377A, and R973A variants can discriminate tgRNA for cleavage over atgRNA. Our computational analysis and the experimental assays thereby disclose polar / charged residues that are pivotal for the allosteric signalling and whose mutation in alanine can push the preference for tgRNA-activated cleavage vs. atgRNA-activated cleavage.</p><p>To understand the mechanistic function of our mutants, we performed additional multi&#181;s long simulations of our variants. Analysis of the allosteric signal sourcing from the 8switch9 region to the HEPN1-2 catalytic cleft, shows that our variants mostly perturb the SNR upon atgRNA binding (with respect to the WT atgRNA-Cas13a, Figure <ref type="figure">6</ref> A), while the tgRNA-bound variants preserve efocient signalling compared to the WT tgRNA-Cas13a. Interestingly, the tgRNAbound R963A Cas13a, which is experimentally shown to be inactive (Figure <ref type="figure">5 D</ref>), reduces the signal with respect to the WT, indicating altered communication. It is also notable that R963 is among the highest-occurring residues in the allosteric routes of the WT Cas13a ( Supplementary Figure <ref type="figure">S7</ref> ), while it does not appear in the allosteric pathways of the tgRNA-bound R963A mutant ( Supplementary Figure <ref type="figure">S17 A</ref>). Hence, mutating a residue belonging to the WT allosteric pathway has a signiocant impact on the activity, while other residues not included in the path show a less dramatic impact, in particular when the allosteric effector is bound (tgRNA). This further supports that the allosteric pathway connecting the sites of tgRNA binding with the catalytic cleft controls the activity in the WT Cas13a. In the R963A-tgRNA system, the A(-3) base is frequently extruded from the HEPN1(I)-2 interface (Figure <ref type="figure">6 C</ref>), suggesting that the positioning of the A(-3) base (and the dynamics of the crRNA repeat region) critically affects the transmission of the signal from the 8switch9 to the catalytic core, as evidenced by lower SNR with respect to the WT.</p><p>Analysis of the allosteric pathways also shows that our variants preserve the communication connecting the 8switch9 to the catalytic cleft in the tgRNA-bound system ( Supplementary Figure <ref type="figure">S17 A</ref>). On the other hand, in the presence of a tag-anti-tag, our mutations perturb and reduce the conservation of the signalling pathways observed in the WT atgRNA-Cas13a ( Supplementary Figure <ref type="figure">S17 B</ref>). This observed perturbation of the allosteric crosstalk pathways, observed in the atgRNA-bound variants, along with altered SNR compared to the WT, is qualitatively consistent with the experimentally observed inactivity of the atgRNA-bound variants (Figure <ref type="figure">5</ref> ). Overall, the inactive mutants (i.e. the atgRNAbound variants and the tgRNA-bound R963A) reduce the signal with respect to the WT. This reduced signalling can be ascribed to the dynamics of the crRNA repeat region in the tgRNA-bound systems, and to an overall perturbation of the crosstalk pathways in the presence of an atgRNA.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusions</head><p>In summary, our study characterizes the allosteric activation mechanism of Cas13a and discloses critical point mutations able to promote tgRNA-mediated over an extended tag-anti-tag-mediated complementarity, for RNA cleavage activation. We show that the binding of a tgRNA acts as an allosteric effector of the spatially distant HEPN1-2 catalytic cleft, by amplifying the allosteric signals that connect the sites of tgRNA binding with the HEPN1-2 catalytic site. By introducing a novel graph theory-based analysis-signal-to-noise ratio ( SNR ) of communication efociency-we show that the allosteric signal stands out over the dynamical noise when passing through the crRNA repeat region. Critical residues at this interface (R377, N378, and R973) rearrange their interactions upon tgRNA binding and are experimentally shown to select tgRNA, over an extended atgRNA, for RNA cleavages. Considering this selectivity, we speculate here that alanine mutation of R377, N378, and R973 could improve (or alter) the selectivity of Cas13a. This hypothesis is conormed in our companion paper by , showing that our computational approach can guide the development of more selective RNA cleavage and detection strategies. Taken together, our ondings offer a fundamental understanding of the CRISPR-Cas13a mechanistic function, and pave the way for harnessing innovative computational approaches for engineering innovative RNA-based cleavage and detection tools.</p></div></body>
		</text>
</TEI>
