<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Parallel degradome-seq and DMS-MaPseq substantially revise the miRNA biogenesis atlas in Arabidopsis</title></titleStmt>
			<publicationStmt>
				<publisher>Nature</publisher>
				<date>07/01/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10566252</idno>
					<idno type="doi">10.1038/s41477-024-01725-9</idno>
					<title level='j'>Nature Plants</title>
<idno>2055-0278</idno>
<biblScope unit="volume">10</biblScope>
<biblScope unit="issue">7</biblScope>					

					<author>Xingxing Yan</author><author>Changhao Li</author><author>Kaiye Liu</author><author>Tianru Zhang</author><author>Qian Xu</author><author>Xindi Li</author><author>Jiaying Zhu</author><author>Ziying Wang</author><author>Anikah Yusuf</author><author>Shuqing Cao</author><author>Xu Peng</author><author>James J Cai</author><author>Xiuren Zhang</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[This comprehensive study thus substantially revises the current model of pri-miRNA processing from different perspectives in plants.
Nature Plants
Resource]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>MicroRNAs (miRNAs) are produced from highly structured primary transcripts (pri-miRNAs) and regulate numerous biological processes in eukaryotes. Due to the extreme heterogeneity of these structures, the initial processing sites of plant pri-miRNAs and the structural rules that determine their processing have been predicted for many miRNAs but remain elusive for others. Here we used semi-active DCL1 mutants and advanced degradome-sequencing strategies to accurately identify the initial processing sites for 147 of 326 previously annotated Arabidopsis miRNAs and to illustrate their associated pri-miRNA cleavage patterns. Elucidating the in vivo RNA secondary structures of 73 pri-miRNAs revealed that about 95% of them differ from in silico predictions, and that the revised structures offer clearer interpretation of the processing sites and patterns. Finally, DCL1 partners Serrate and HYL1 could synergistically and independently impact processing patterns and in vivo RNA secondary structures of pri-miRNAs. Together, our work sheds light on the precise processing mechanisms of plant pri-miRNAs.</p><p>MicroRNAs (miRNAs) regulate numerous biological processes in plants and animals <ref type="bibr">[1]</ref><ref type="bibr">[2]</ref><ref type="bibr">[3]</ref> . Mature miRNAs are loaded into Argonaute (AGO) protein to form RNA-induced silencing complex (RISC), which fulfils slicing or translational repression of the target transcripts on the basis of sequence complementarity <ref type="bibr">4,</ref><ref type="bibr">5</ref> . The production of functional miRNAs entails precise and efficient processing of primary miRNAs (pri-miRNAs) by Microprocessor. Pri-miRNAs are characterized by stem-loop structures consisting of flanking single-stranded (ss) basal segments, a lower stem, a double-stranded (ds) duplex of miRNA and its complementary strand (miRNA/*), an upper stem, and a terminal loop <ref type="bibr">6</ref> . In metazoans, the structures of pri-miRNAs are relatively uniform <ref type="bibr">[7]</ref><ref type="bibr">[8]</ref><ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref> . By contrast, plant pri-miRNAs display remarkable diversity in shapes and sizes, with variable positioning of miRNA/* duplexes, even within the same MIRNA families <ref type="bibr">6</ref> . Due to structural heterogeneity, plant pri-miRNAs can be processed from base-to-loop (BTL) and sequential BTL (SBTL) or from loop-to-base (LTB) and sequential LTB (SLTB) <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref><ref type="bibr">[15]</ref> . Notably, terminal-loop-branched pri-miRNAs can be processed bidirectionally from either BTL or LTB, frequently resulting in productive</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Degradome-seq defines the first cutting sites of pri-miRNAs</head><p>To systematically pinpoint the first cleavage sites of pri-miRNAs, we transformed native-promoter-driven Flag-4Myc (FM) tagged semi-active forms of DCL1-E1507Q and DCL1-E1696Q into dcl1-9 heterozygotes (Fig. <ref type="figure">1b</ref> and Supplementary Fig. <ref type="figure">1a</ref>,<ref type="figure">b</ref>). Since dcl1-9 harbours a transfer DNA insertion in the second dsRNA-binding domain (exon 19) and its homozygotes are embryonic lethal, we initially assumed that the introduction of DCL1 E1507Q and DCL1 E1696Q into dcl1-9 -/-would generate hypomorphic alleles. Among hundreds of complementation lines, we were able to obtain numerous transgenic lines with clear developmental defects mimicking dcl1-9 -/-lines. However, all these plants were genotyped as dcl1-9 +/-;DCL1 E1507Q and dcl1-9 +/-;DCL1 E1696Q lines, which still harboured a copy of the wild-type (WT) DCL1 allele (Fig. <ref type="figure">1c</ref>). Furthermore, the severity of the plant growth abnormality was correlated with the expression level of the semi-active DCL1 variants and reduced levels of miRNAs in the transgenic lines (Supplementary Fig. <ref type="figure">1a</ref>,<ref type="figure">b</ref>). The semi-active variants of DCL1 thus served as dominant-negative forms to compete with the WT DCL1 protein for pri-miRNAs, leading to the accumulation of a series of intermediately processed forms of pri-miRNAs with descending orders of amounts in the transgenic lines (Supplementary Fig. <ref type="figure">1c</ref>). From these materials, we recovered 3&#8242; ends of cleaved pri-miRNA segments, performed degradome-seq, and obtained in-depth and high-quality datasets (Methods) (Fig. <ref type="figure">1d</ref> and Supplementary Fig. <ref type="figure">1d</ref>,<ref type="figure">e</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Bona fide miRNAs based on the cleavage activity of DCL1</head><p>We extended the reads 200 bp upstream and downstream of the annotated pri-miRNAs, aiming at capturing the possible cleavage sites in a broader context flanking the pri-miRNAs. Mapping of sequence reads onto the extended regions revealed that the patterns of reads' distributions are remarkably diversified among different pri-miRNAs (Fig. <ref type="figure">1e-g</ref>). For two canonical pri-miRNAs, pri-miR864 and pri-miR162a, we were able to detect some remnants of DCL1-processed intermediates in WT plants (Fig. <ref type="figure">1e</ref>,f, top). By contrast, we clearly observed several abrupt steps of read accumulation in pri-miRNA regions (the purple and yellow-brown arrows in the shaded transcript areas) in the complementation lines expressing DCL1-E1507Q and DCL1-E1696Q, respectively (Fig. <ref type="figure">1e</ref>,f, middle two panels). The abrupt increase of reads indicates that intermediate pri-miRNA products, resulting from a series of abortive processing steps by the DCL1 complex, were substantially accumulated in DCL1 semi-active mutants, enabling us to track sequential activities of DCL1 on pri-miRNAs. The steep patterns of read accumulations along pri-miRNAs could also serve as a benchmark to identify bona fide substrates of DCL1 and resultant miRNAs in vivo. With this criterion, we surveyed all 326 previously annotated pri-miRNAs from the and abortive processing of miRNAs, respectively <ref type="bibr">6</ref> . Plant Microprocessor canonically cuts pri-miRNAs at a site 15-17 nucleotides (nt) from a reference ssRNA-dsRNA junction region <ref type="bibr">6,</ref><ref type="bibr">16</ref> (Fig. <ref type="figure">1a</ref>). Paradoxically, numerous pri-miRNAs lack the reference region within the expected distance according to in silico RNA modelling. Also, RNA secondary structures (RSS) of pri-miRNAs are dynamic and can be remodelled by RNA helicases, as exemplified by CHR2 in vivo <ref type="bibr">17</ref> . How these pri-miRNAs are recognized and precisely processed by Microprocessor in vivo remains elusive. In addition, due to the RSS complexity of pri-miRNAs and challenges in identifying the processing patterns, the bona fide identities of numerous annotated miRNAs are still inconclusive <ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref> . Furthermore, many pri-miRNAs contain extended hairpin structures, and their first cutting sites and the reference structures determining the initial cleavages remain to be determined.</p><p>Arabidopsis Microprocessor comprises Dicer-like1 (DCL1) and two core cofactors, zinc-finger protein Serrate (SE) and a dsRNA-binding protein called hyponastic leaves 1 (HYL1) <ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref> . Microprocessor sequentially processes pri-miRNAs to precursors of miRNAs (pre-miRNAs) and finally to miRNA/* duplexes <ref type="bibr">24</ref> . DCL1 harbours two RNase III domainsthe RIII a and RIII b domains. We have previously identified two key residues, Glu1507 and Glu1696, located in RIII a and b, respectively, as critical to the enzymatic activity of this protein <ref type="bibr">6</ref> . The substitution of glutamine for glutamate at either of the residues leads to two semi-active DCL1 point mutants (E1507Q and E1696Q, respectively) that only cut one strand of pri-miRNA duplexes and fail to proceed to further processing, resulting in partially processed pri-miRNA intermediates. This abortive processing event would create an opportunity to pinpoint the first cleavage sites by Microprocessor in vivo. Both HYL1 and SE have been reported to stimulate cleavage of the precursors by DCL1 and to improve its accuracy <ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref> . This notwithstanding, their contributions to the molecular ruler of plant Microprocessor and impacts on RSS of pri-miRNAs are poorly understood.</p><p>Here we pinpointed the first cleavage sites of pri-miRNAs via degradome sequencing (degradome-seq) of lines expressing semi-active DCL1 variants. We clarified the 147 bona fide substrates of DCL1 from the 326 annotated pri-miRNAs and then comprehensively re-sorted the processing patterns of the true pri-miRNAs. In parallel, we conducted DMS-MaPseq <ref type="bibr">28</ref> to decipher the in vivo RSS of pri-miRNAs. Notably, 69 of 73 detectable pri-miRNAs (~95%) displayed RSS that deviated to varying degrees from the in silico predicted RSS, providing a better interpretation of the initial cleavage sites for numerous pri-miRNAs. Surprisingly, whereas numerous pri-miRNAs contain the canonical reference sites to direct Microprocessor for cleavage, they tend to harbour internal loops or bulges that are 9-11 nt away from the initial cleavage sites and might be meaningful in plants. Furthermore, approximately 77% of pri-miRNAs underwent their first cuttings at internal loops or bulge regions, providing a new guideline for artificial miRNA design. We also found that SE and HYL1 imposed varying impacts on the effectiveness and precision as well as in vivo RSS of different pri-miRNAs. BTL, SBTL, LTB, SLTB and bidirectional processing. In each case, DCL1 initially cleaves pri-miRNAs at a site that is ~15-17 nt away from a reference ssRNA-dsRNA junction (15-17-nt molecular ruler). The blue and pink regions in the pri-miRNA diagrams represent miRNA/* duplexes. Black arrowheads and text in the cartoon indicate the cleavage direction from base to loop, while grey arrowheads and text represent the cleavage pattern from loop to base. b, The domain arrangement of DCL1 (WT) and point alterations in semi-active DCL1 mutants (E1507Q and E1696Q). The nuclear localization signal (NLS), helicase domain, DUF283, Platform, PAZ, RNase III a and III b, and dsRNA-binding domain (dsRBD) are shown in black, yellow, green, pink, dark orange, yellow-brown, purple and blue, respectively. c, Phenotypes of three-week-old Col-0, dcl1-9 +/-and transgenic dcl1-9 +/-;P DCL1 -FM-DCL1 E1507Q and dcl1-9 +/-;P DCL1 -FM-DCL1 E1696Q plants. Scale bars, 1 cm. d, Scheme for construction of the degradome-seq libraries. RT, reverse transcription. e-g, Exemplified cleavage patterns of DCL1-dependent pri-miR864 (BTL direction) (e), DCL1-dependent pri-miR162a (LTB direction) (f) and DCL1-independent sRNA5630a (g). The yellow-brown and purple arrows represent RNase III a and III b cleavage sites, respectively. The percentages indicate the relative cutting ratios. Note that the previously claimed pri-miR5630a does not have a clear cleavage site by DCL1 in the WT or the DCL1 E1507Q and DCL1 E1696Q lines. h, The fraction of pri-miRNAs that have been verified to be DCL1 dependent. The total number of Arabidopsis pri-miRNAs is taken from miRBase Release 22.1. Note that only 147 of the 326 previously annotated pri-miRNAs are bona fide miRNAs on the basis of the cleavage activity of DCL1. The processing patterns for 35% of the true pri-miRNAs could be re-validated, whereas the other 30% display processing patterns deviating from the published literature and have now been re-annotated. Additionally, the processing patterns of the other 35% of pri-miRNAs were newly identified here. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>G U U U U G C U U C A G G U A U G A U U G A C U U C A A A A A</head><p>C G C C A U U 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 142</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 A miRBase Release 22.1 and found that a total of 147 pri-miRNAs displayed clear processing patterns in the two DCL1 semi-active mutants. These pri-miRNAs were thus confirmed to be genuine targets of DCL1, and their products are bona fide miRNAs (Type I: DCL1-dependent miRNAs) (Fig. <ref type="figure">1h</ref> and Supplementary Table <ref type="table">1</ref>). Intriguingly, we found that 81 of the 326 annotated pri-miRNAs, exemplified by pri-miR5630a, did not show any DCL1 cleavage activity in the DCL1 semi-active mutants and WT plants, as the read accumulations are gradually distributed along the reference sequence (Fig. <ref type="figure">1g</ref>,<ref type="figure">h</ref>). This group of small RNAs (sRNAs) should thus be re-annotated to DCL1-independent sRNAs. In addition, 98 pri-miRNAs remained to be re-annotated because the read counts of the processed pri-miRNAs were extremely low (less than 15 reads) or barely detectable in this study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Algorithm for classifying the processing patterns of pri-miRNAs</head><p>We next pinpointed the first cleavage positions for 147 true pri-miRNAs. We calculated the ratios of cleavages at the sites of individual nucleotides by dividing the accumulated reads at each position by the total accumulated reads across the extended pri-miRNAs (Supplementary Fig. <ref type="figure">1f</ref>). Because the dominant-negative form of DCL1 yielded the largest amount of abortive processing products after the first cleavage that could not proceed to the next round of processing, the nucleotides with the highest cutting ratios should be defined as the first cleavage sites along the pri-miRNAs. For instance, pri-miR864 displayed a predominant cutting (86%) at one position and a satellite cutting at a nearby site (7%) along the extended pri-miRNAs in the DCL1 E1696Q transgenic line. This position should be counted as the first cleavage site on one strand of the pri-miR864 duplex. In parallel, a predominant cutting site (46%) and its neighbourhood with a 16% cutting ratio were detected in the DCL1 E1507Q line, and this position should be the first cut on the other strand of pri-miR864. Two minor cutting events (13% and 5%) were also detected in the DCL1 E1507Q line, clearly resulting from the activity of residual WT DCL1 protein in the dominant-negative line (Fig. <ref type="figure">1e</ref>, bottom). Concomitant alignment of the two predominant cutting sites on the two strands of the pri-miR864 duplex placed the first cleavage sites close to the base region relative to the miR864/* duplex. This type of processing pattern indicates that Microprocessor processes pri-miR864 from a BTL direction, as reported previously <ref type="bibr">12,</ref><ref type="bibr">14,</ref><ref type="bibr">15</ref> .</p><p>Pri-miR162a displayed the highest cutting frequency of 56% at one site in the DCL1 E1507Q line, representing the first cleavage on one strand of the pri-miR162a duplex. The second-largest cleavage event (5%) took place 21 nt downstream due to the activity of the WT DCL1 protein in the line. Similarly, we detected the highest cutting frequency of 28.5%, indicative of the first processing site on the other strand of the pri-miR162a duplex, in the DCL1 E1696Q line. Again, two additional cutting sites with ratios of approximately 10.1% and 11.7% with a 21-nt internal distance on one strand of the duplex were observed due to one copy of WT DCL1 protein in the line, also because only the poly(A)-harboured debris of pri-miRNAs processed from Microprocessor could be recovered in the assay (Fig. <ref type="figure">1f</ref>, bottom). Since the two cleavage sites with the highest cutting frequency were proximal to the top region relative to the miR162/* duplex, and the secondary cleavage sites were distal from the top region, this type of processing is considered to be a standardized LTB pattern, as observed in previous studies <ref type="bibr">14,</ref><ref type="bibr">15</ref> .</p><p>We also found that DCL1 RNase domain b first cleaved at the 5&#8242; arm of pri-miRNA in the DCL1 E1507Q line, while DCL1 RNase domain a fulfilled the first cleavage at the 3&#8242; arm in the DCL1 E1696Q line for the BTL-type pri-miRNAs. By contrast, the DCL1 RNase domains a and b cut the 5&#8242; and 3&#8242; arms, respectively, for the LTB-type pri-miRNAs. This finding aligns with the in vitro conclusion from our previous work <ref type="bibr">6</ref> . Taken together, our degradome analysis of pri-miR864 and pri-miR162a served as a benchmark for annotating the processing patterns and exploring new features of all 147 bona fide pri-miRNAs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>The processing atlas of 147 pri-miRNAs</head><p>Pri-miRNAs (37) with the BTL pattern. We found that 37 of 147 pri-miRNAs adhered to the BTL processing mode (Fig. <ref type="figure">2a-c</ref> and Supplementary Fig. <ref type="figure">2</ref>). Among these, 21 pri-miRNAs were previously demonstrated <ref type="bibr">12,</ref><ref type="bibr">14,</ref><ref type="bibr">15</ref> and were re-validated unambiguously here (Fig. <ref type="figure">2a</ref> and Supplementary Fig. <ref type="figure">2a</ref>,b(i)-(iii)). These pri-miRNAs include pri-miR161; pri-miR165a; pri-miR166f and pri-miR166g; pri-miR167a, pri-miR167c and pri-miR167d; pri-miR168b; pri-miR169a; pri-miR170; pri-miR172b; pri-miR393b; pri-miR395b and pri-miR395f; pri-miR397a and pri-miR397b; pri-miR399a, pri-miR399d and pri-miR399f; pri-miR827; and pri-miR864.</p><p>Pri-miR173, pri-miR399b and pri-miR156f were re-annotated to the BTL pattern. Notably, pri-miR173 predominantly generates the canonical miR173/* as reported previously <ref type="bibr">14,</ref><ref type="bibr">15</ref> (Fig. <ref type="figure">2b</ref> and Supplementary Fig. <ref type="figure">2c</ref>). However, we observed an additional cutting site with a 5-nt shift at a considerable ratio (14%), leading to a new species of miRNA/* that is clearly enriched in AGO2 complexes <ref type="bibr">29,</ref><ref type="bibr">30</ref> (Supplementary Fig. <ref type="figure">2d</ref>). In line with this, a target (AT4G20460) of the new miRNA could be readily recovered from our degradome-seq data (Supplementary Fig. <ref type="figure">2e</ref>). We thus designated the new species of miRNA/* as 5p-2 in the 5&#8242; arm and 3p-2 in the 3&#8242; arm (red brackets), whereas we refer to the previously annotated products as 5p-1 in the 5&#8242; arm and 3p-1 in the 3&#8242; arm (blue brackets) (Fig. <ref type="figure">2b</ref>). Similarly, pri-miR399b and pri-miR156f also exhibited two distinct starting sites for cleavage and generated two pairs of functional miRNA/*s (5p-1/3p-1 in blue and 5p-2/3p-2 in red; Supplementary Fig. <ref type="figure">2f</ref>,<ref type="figure">g</ref>).</p><p>We also report the BTL processing pattern for 13 species of pri-miRNAs whose processing directions were unknown. They are pri-miR158b, pri-miR776, pri-miR777, pri-miR780a, pri-miR831, pri-miR847, pri-miR851a, pri-miR857a, pri-miR859, pri-miR867, pri-miR4240, pri-miR5013 and pri-miR5020b (Fig. <ref type="figure">2c</ref> and Supplementary Fig. <ref type="figure">2h</ref>-j(i),(ii)). Most of these pri-miRNAs showed clear and f, Pri-miR160a is 1 of 13 examples whose canonical LTB processing patterns are fully validated here. g, Pri-miR395d is one of five examples that are revised to be processed through LTB. h, Pri-miR4245 is 1 of 20 examples whose processing patterns are newly identified to follow the LTB direction. i, Pri-miR159a is one of five examples whose canonical SLTB processing patterns are fully validated here. j, Pri-miR839a is one of eight newly identified examples following the SLTB pattern. k, Pri-miR166a is 1 of 24 canonical pri-miRNAs with bidirectional processing patterns. l, Pri-miR396a is one example of five re-annotated bidirectional pri-miRNAs with new productive and abortive products. m, Pri-miR825 is one example of 14 re-annotated bidirectional pri-miRNAs with new productive products. It can follow BTL processing to produce a non-canonical but abundant pair of miRNA/* (5p-2/3p-2) besides the previously reported LTB pattern that produces canonical miR825/* (5p-1/3p-1). The cleavage site marked with an asterisk was also discovered in the WT <ref type="bibr">14</ref> . In a-m, the percentages indicate the relative ratios of cleavage sites of individual pri-miRNAs by two semi-active DCL1 variants. Black dotted arrows denote the intended cutting positions that are not detectable in our system. Thicker arrows indicate higher cutting ratios. The pri-miRNA names in orange, blue and green indicate re-validated, re-annotated and newly identified ones, respectively. The five patterns (BTL, SBTL, LTB, SLTB and bidirectional) are summarized in the middle.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nature Plants</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Resource</head><p><ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> </p><p>19% 5% 5p-2 3p-2 5p-1 3p-1 40% 26% 14% 5p-2 5p-1 68% 3p-2 11.9% 3p-1 94.7% 5.8% 88% 12.6% 69.3% 40% 1.9% 3p-2 5p-1 3p-1 5p-2 * 19% 24% 1.3% 81% 3p-2 5p-1 3p-1 5p-2 5p-3 3p-3 23.7% 78% 5.7% 62.1% 5p-2 5p-1 3p-1 Pri-miR167a Pri-miR173 Pri-miR780a Pri-miR166a Pri-miR396a Pri-miR825 BTL (37) LTB (38) SLTB (13) SBTL (16) Bidirectional (43) 77% 50% 3p-2 5p-2 5p-1 3p-1 32% 16.7% 16.7% 27.3% 17.6% 95.1% 16% 68.5% 70% 53% 12% 71% Pri-miR169j Pri-miR823 Pri-miR160a Pri-miR395d Pri-miR4245 49.5% 18.6% 94.7% 4.3% 7.7% 4.9% 30% 34% 3.3% 4.6% 2.5% 3% 17.3% 17.5% Pri-miR159a Pri-miR839a c d e f g h i j b a m l k Nature Plants Resource <ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> </p><p>unique first cleavage sites at the base regions and generated uniform pre-miRNAs for the next round of processing. However, there were some exceptions. Pri-miR780a showed two initial cutting sites proximal to the base region: the predominant one, which produced a large amount of miR780/* (5p-1/3p-1, in blue), and a minor one that generated a new species of miRNA/* (5p-2/3p-2, in red). Both products were functional, because sufficient read counts could be recovered from AGO1-RISC (Fig. <ref type="figure">2c</ref> and Supplementary Fig. <ref type="figure">2h</ref>,<ref type="figure">i</ref>). Similarly, three other pri-miRNAs, pri-miR831, pri-miR847 and pri-miR851a, also displayed two or more types of the BTL processing pattern and new functional or abortive forms of miRNA/*s (Supplementary Fig. <ref type="figure">2j(i)</ref>).</p><p>Pri-miRNAs ( <ref type="formula">16</ref>) with the SBTL pattern. Certain pri-miRNAs have extended stems and undergo sequential cleavages before reaching the positions of miRNA/*s. Approximately 13 pri-miRNAs have been reported to exhibit the SBTL processing pattern <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref><ref type="bibr">[15]</ref> and were re-validated here. These pri-miRNAs include pri-miR163; pri-miR169b, pri-miR169c, pri-miR169f, pri-miR169g, pri-miR169i, pri-miR169j, pri-miR169l, pri-miR169m and pri-miR169n; pri-miR394a; pri-miR402; and pri-miR447b (Fig. <ref type="figure">2d</ref> and Supplementary Fig. <ref type="figure">3a</ref>,b(i),(ii)). We also extended the list to include pri-miR823, pri-miR822 and pri-miR845a (Fig. <ref type="figure">2e</ref> and Supplementary Fig. <ref type="figure">3c</ref>,<ref type="figure">d</ref>).</p><p>Pri-miRNAs (38) with the LTB pattern. A total of 13 pri-miRNAs have been reported to undergo the LTB mode of processing <ref type="bibr">12,</ref><ref type="bibr">14,</ref><ref type="bibr">15</ref> and were fully validated in our study (Fig. <ref type="figure">2f</ref> and Supplementary Fig. <ref type="figure">4a</ref>,b(i),(ii)). These pri-miRNAs include pri-miR156a-pri-miR156e and pri-miR156h, pri-miR160a-pri-miR160c, pri-miR162a, pri-miR171b and pri-miR171c, and pri-miR400. We re-annotated the LTB processing pattern for pri-miR395d, pri-miR159c, pri-miR390b, pri-miR408 and pri-miR779a (Fig. <ref type="figure">2g</ref> and Supplementary Fig. <ref type="figure">4c</ref>,<ref type="figure">d</ref>).</p><p>We deciphered the LTB processing mode for 20 new species of pri-miRNAs (Fig. <ref type="figure">2h</ref> and Supplementary Fig. <ref type="figure">4e</ref>,f(i)-(iii)). They include pri-miR775, pri-miR833a, pri-miR841a, pri-miR848a, pri-miR849a, pri-miR852, pri-miR853, pri-miR1888a, pri-miR2112, pri-miR3440b, pri-miR4245, pri-miR5012, pri-miR5017, pri-miR5021, pri-miR5026, pri-miR5028, pri-miR5634, pri-miR5637, pri-miR5640 and pri-miR5654. A representative case is pri-miR4245, which produced two sets of miRNA/*s: one was the canonical 5p-1/3p-1 (in blue), while the other was the newly identified 5p-2/3p-2 (in red); they could both be recovered from AGO-RISC (Fig. <ref type="figure">2h</ref> and Supplementary Fig. <ref type="figure">4e</ref>). Similarly, pri-miR775, pri-miR848a, pri-miR3440b and pri-miR5017 (Supplementary Fig. <ref type="figure">4f</ref>(i),(ii)) all showed two or three kinds of the LTB processing pattern that generated new species of miRNA/*s in addition to the previously reported miRNA/*s. Notably, the processing site for the canonical miR848/* (5p-1/3p-1, in blue) was not detectable in our study, although very low reads of the miRNA/* could be recovered from AGO1-RISC. In contrast, the new set of miRNA/* (5p-2/3p-2, in red) can be recovered from AGO1-RISC even with more reads (Supplementary Fig. <ref type="figure">4f(i)</ref>).</p><p>Pri-miRNAs (13) with the SLTB pattern. Five pri-miRNAs, pri-miR159a and pri-miR159b and pri-miR319a-pri-miR319c, were previously reported to have the SLTB pattern and were further validated in our study (Fig. <ref type="figure">2i</ref> and Supplementary Fig. <ref type="figure">5a</ref>,<ref type="figure">b</ref>). We also extended this pattern to eight new pri-miRNAs including pri-miR835a, pri-miR839a, pri-miR840a, pri-miR856, pri-miR869a, pri-miR1888b, pri-miR5024 and pri-miR5656 (Fig. <ref type="figure">2j</ref> and Supplementary Fig. <ref type="figure">5c</ref>,<ref type="figure">d</ref>). Interestingly, pri-miR869a could produce a new set of miRNA/* (5p-2/3p-2, in red) that was even more abundant than the annotated 5p-1/3p-1 (in blue) (Supplementary Fig. <ref type="figure">5d</ref>). Pri-miR1888b produced two sets of miRNA/*s (5p-1/3p-1, in blue; 5p-2/3p-2, in red) which could be detected in sRNA-seq data. However, neither of them is identical to the predicted miRNA/*, which should be re-annotated (Supplementary Fig. <ref type="figure">5d</ref>). Additionally, pri-miR5656 did not produce the annotated duplex (5p-1/3p-1, in blue) but produced a new miRNA/* product, highlighted by red brackets (5p-2/3p-2), which can be recovered from AGO1-RISC (Supplementary Fig. <ref type="figure">5d</ref>). (43) with the bidirectional processing pattern. Many plant pri-miRNAs showed the bidirectional processing pattern due to the presence of terminal branched loops <ref type="bibr">6</ref> . One example is pri-miR166a, which could undergo BTL processing to produce the canonical miR166/* (5p-1/3p-1) or an LTB mode to produce an abortive product, 5p-2 (Fig. <ref type="figure">2k</ref> and Supplementary Fig. <ref type="figure">6a</ref>). In fact, this scenario could be extended to an additional 23 pri-miRNAs (Supplementary Fig. <ref type="figure">6b</ref>(i)-(iv)). Surprisingly, many founding pri-miRNAs such as pri-miR158a, pri-miR164b and pri-miR164c, pri-miR165b, and pri-miR166b displayed bidirectional processing. These pri-miRNAs do not have terminal branched loops, but they have big internal loops that serve as triggers for bidirectional processing (Supplementary Fig. <ref type="figure">6b(i)</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Pri-miRNAs</head><p>Pri-miR396a displayed a unique pattern of processing (Fig. <ref type="figure">2l</ref> and Supplementary Fig. <ref type="figure">6c</ref>): it underwent major BTL processing and produced the annotated miR396/* (5p-1/3p-1, in blue) that was loaded into AGO1-RISC. This pri-miRNA could also be processed in an LTB direction, yielding a new set of productive products (5p-2/3p-2, in red). The miR396/* (5p-2/3p-2) was sorted into AGO2 with considerable read numbers and had a new target (AT2G29340) for silencing (Supplementary Fig. <ref type="figure">6d</ref>,<ref type="figure">e</ref>). Moreover, this pri-miRNA could also be cut right in the middle of the miR396/* duplex with a moderate frequency (24%) but yielded a set of abortive products (5p-3/3p-3, in light green) (Fig. <ref type="figure">2l</ref>). This pattern with three initial processing sites was also detected for pri-miR162b, pri-miR172c, pri-miR167b and pri-miR844a (Supplementary Fig. <ref type="figure">6f</ref>(i),(ii)).</p><p>We also found that 14 pri-miRNAs could employ a bidirectional processing pattern but produce two sets of miRNA/*s that are both functional. For instance, pri-miR825 could be cleaved in an LTB direction, producing the annotated miR825/* (blue bracket) (Fig. <ref type="figure">2m</ref> and Supplementary Fig. <ref type="figure">6g</ref>). This pri-miRNA could also be processed in a BTL manner, producing the new functional 5p-2/3p-2, which was sorted into AGO2 (red brackets) (Fig. <ref type="figure">2m</ref> and Supplementary Fig. <ref type="figure">6h</ref>). The other 13 pri-miRNAs with such a pattern included pri-miR157c, pri-miR169d, pri-miR171a, pri-miR393a, pri-miR396b, pri-miR398a, pri-miR403, pri-miR472a, pri-miR824, pri-miR868a, pri-miR3933, pri-miR5014a and pri-miR8183 (Supplementary Fig. <ref type="figure">6i</ref>(i)-(iv)).</p><p>Taken together, we have validated the previously reported processing patterns for 52 pri-miRNAs <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref><ref type="bibr">[15]</ref><ref type="bibr">31</ref> , while re-annotating the processing modes for 43 pri-miRNAs and reporting the processing patterns of 52 new pri-miRNAs that were not revealed before (Fig. <ref type="figure">2</ref> and Supplementary Table <ref type="table">1</ref>). We sorted all 147 pri-miRNAs into five categories according to their cleavage patterns and present these new productive miRNA/*s as well as their potential targets in Supplementary Table <ref type="table">2</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>SE and HYL1 act in defining the first cutting sites of pri-miRNAs</head><p>We next assessed the influence of HYL1 and SE on the determination of the initial processing sites on pri-miRNAs. To this end, we re-sorted the 147 bona fide pri-miRNAs into three patterns: BTL-, LTB-and sequential-processed pri-miRNAs (including SBTL and SLTB; Methods). Since the sites with the highest cutting ratios in the DCL1 E1507Q and DCL1 E1696Q lines represent the first cleavage positions in pri-miRNAs, we designated positions as '0' for all pri-miRNAs processed in either the BTL or LTB mode. We then extended from the '0' positions to 100 nt upstream and downstream at 10-nt intervals and counted the number of initial cleavages that fell at each interval of pri-miRNAs in Col-0, DCL1 E1507Q , DCL1 E1696Q , hyl1-2 and se-2 (Fig. <ref type="figure">3a</ref>,<ref type="figure">b</ref>). Importantly, ensemble analysis of all pri-miRNAs did not reveal significant accumulation of the initial cutting sites across both BTL and LTB pri-miRNAs in hyl1-2 and se-2. These results suggest that the first cut selections of most pri-miRNAs are dependent on HYL1/SE (Fig. <ref type="figure">3a</ref>,<ref type="figure">b</ref>). Alternatively, these pri-miRNAs might not be accessible to DCL1 for cleavage in the mutants.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nature Plants</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Resource</head><p><ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> </p><p>Certain pri-miRNAs, however, indeed exhibited the first cutting sites at position '0' in the se and hyl1 mutants, implying that the absence of SE or HYL1 does not impact the selection of the first cutting sites for these pri-miRNAs (Fig. <ref type="figure">3a</ref>,<ref type="figure">b</ref>). The numbers of such pri-miRNAs in the se and hyl1 mutants are lower than those in Col-0, indicating that the selection of first cleavages for the missing pri-miRNAs can be dependent on SE and HYL1 (Fig. <ref type="figure">3a</ref>,<ref type="figure">b</ref>). In this scenario, we re-sorted pri-miRNAs on the basis of the impact of SE/HYL1 on the selection of the initial cleavage sites. First, certain pri-miRNAs did not exhibit any cutting events in their transcript regions in the se or hyl1 mutants. These pri-miRNAs could be genuinely classified as SE/HYL1-dependent, as their initial processing fully relies on SE or HYL1. With this definition, regions in the pri-miRNA diagrams represent the miRNA/* duplex. The data are from two biological replicates for each sample. c, The numbers (top) and processing patterns (bottom) of pri-miRNAs that are impacted by SE and/or HYL1 proteins. Note that only the first cutting sites of BTL patterns are indicated by black arrowheads. d, Venn diagram showing overlapping of miRNAs that are dependent on or independent of SE and HYL1. e, List of 11 pri-miRNAs for which SE and HYL1 exert opposite impacts on their processing.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nature Plants</head><p>Resource <ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> </p><p>approximately 41 and 29 of 147 pri-miRNAs are fully dependent on SE and HYL1, respectively (Fig. <ref type="figure">3c</ref>). By contrast, one group of pri-miRNAs (20 or 25 of 147 pri-miRNAs, respectively) exhibited identical or even higher cutting ratios at position '0' in the se or hyl1 mutants compared with those of Col-0 (Fig. <ref type="figure">3c</ref> and Supplementary Fig. <ref type="figure">7a</ref>,<ref type="figure">b</ref>). This pattern strongly indicates that the initial processing of these pri-miRNAs is genuinely independent of either SE or HYL1. Further analysis identified two other groups that fell between dependence on and independence of SE/HYL1. Approximately 31 or 30 pri-miRNAs could undergo the first cleavages into pri-miRNA regions in the se or hyl1 mutants, but the first cutting sites could be in the lower stem, miRNA/* duplex, upper stem and even terminal loop regions for both BTL and LTB pri-miRNAs (Fig. <ref type="figure">3a-c</ref> and Supplementary Fig. <ref type="figure">7a</ref>,<ref type="figure">b</ref>). This pattern indicates that SE/ HYL1 affects the processing accuracy of pri-miRNAs, and the absence of SE/HYL1 leads to random cutting throughout the pri-miRNAs. This pattern also suggests that SE/HYL1 might act as a molecular ruler for this type of pri-miRNAs. Finally, one group of pri-miRNAs could be also cleaved at position '0' but with lower cutting ratios in se/hyl1 mutants, indicative of a significant decrease in cleavage efficiency. This type of pri-miRNAs could be classified as SE/HYL1-partially-dependent, and the numbers are approximately 16 or 18 of 147 pri-miRNAs (Fig. <ref type="figure">3a-c</ref> and Supplementary Fig. <ref type="figure">7a</ref>,<ref type="figure">b</ref>). This group of pri-miRNAs might require relaying through SE/HYL1 to DCL1 to be processed. Next, we investigated the impact of SE/HYL1 on the initial processing of 28 pri-miRNAs following SBTL/SLTB processing patterns. By designating the first nucleotide of the miRNA/* duplex as position '0', we observed that the processing of the pri-miRNAs relied on SE/HYL1 to different extents considering the correct selection of the initial cutting sites and high cutting ratios in the se/hyl1 mutants compared with Col-0 (Supplementary Fig. <ref type="figure">7c</ref>,<ref type="figure">d</ref>). Together, these findings are well in line with the previously published sRNA data <ref type="bibr">17,</ref><ref type="bibr">32</ref> , which demonstrate that not all miRNA levels are significantly decreased in se or hyl1 mutants. Of course, the temporal/spatial expression difference between MIR loci and SE/HYL1 might also contribute to the variation in the miRNAs' dependency on the proteins.</p><p>In summary, we reclassified pri-miRNAs into four categories on the basis of their initial cleavage's dependence on the two proteins: SE/HYL1-fully-dependent, SE/HYL1-dependent accuracy-affected, SE/HYL1-dependent efficiency-affected and SE/HYL1-independent (Fig. <ref type="figure">3c</ref>). Among the 116 pri-miRNAs with enough detectable reads, 41, 31, 16 and 20 fell into these four categories regarding SE's dependence, respectively. In parallel, 29, 30, 18 and 25 of 116 pri-miRNAs displayed varied dependences on HYL1, respectively (Fig. <ref type="figure">3c</ref>). When combining all SE-or HYL1-fully-dependent or partially dependent pri-miRNAs, we observed that 68 pri-miRNAs are reliant on both SE and HYL1, exemplified by pri-miR398b, as their cleavage sites were not detectable at all (Fig. <ref type="figure">3d</ref> and Supplementary Fig. <ref type="figure">7e</ref>), whereas 15 pri-miRNAs are entirely independent of SE and HYL1, exemplified by pri-miR319a, because their processing patterns were exactly the same in se/hyl1 mutants as in the DCL1 semi-active lines (Fig. <ref type="figure">3d</ref> and Supplementary Fig. <ref type="figure">7f</ref>). Moreover, 11 pri-miRNAs showed different requirements regarding SE or HYL1 for processing (Fig. <ref type="figure">3e</ref>). Specifically, 2 of the 11 pri-miRNAs, pri-miR779a and pri-miR864a, required HYL1 but not SE. Conversely, the remaining 9 pri-miRNAs required SE but not HYL1 for their miRNA biogenesis, including pri-miR165b, pri-miR167a, pri-miR168a, pri-miR169e, pri-miR171a, pri-miR390a, pri-miR395b, pri-miR398c and pri-miR5014a (Fig. <ref type="figure">3e</ref> and Supplementary Table <ref type="table">3</ref>). The processing of these 9 pri-miRNAs showed independence of HYL1, probably due to the functional redundancy of five DRB proteins in Arabidopsis <ref type="bibr">33,</ref><ref type="bibr">34</ref> . This notwithstanding, these pri-miRNAs, whether dependent on or independent of SE or HYL1, did not exhibit a preference for cleavage patterns (BTL or LTB) (Supplementary Fig. <ref type="figure">7g</ref> and Supplementary Table <ref type="table">3</ref>). In summary, our degradome-seq of the se and hyl1 mutants revealed distinct impacts of SE and HYL1 on the first cutting sites for pri-miRNA processing and offers detailed insights into the molecular mechanisms underlying their contributions to the processing of specific pri-miRNAs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>In vivo and in silico RSS of pri-miRNAs largely differ</head><p>We next investigated how the initial cutting sites and processing patterns of individual pri-miRNAs are determined in vivo. To this end, we determined the RSS of pri-miRNAs in vivo using our optimized DMS-MaPseq method <ref type="bibr">28</ref> . Briefly, we treated three-week-old plants and flower tissues from Col-0, dcl1-9, se-1 and hyl1-2 with dimethyl sulfate (DMS), a chemical that can methylate unpaired adenines (As) and cytosines (Cs) located at ss or loop regions of RNA molecules. The DMS-induced lesions are decoded in reverse transcription by the thermostable group II intron reverse transcriptase (TGIRT) enzyme, which can induce random mismatches for the methylated As and Cs (Fig. <ref type="figure">4a</ref>). Bioinformatic analysis revealed that all biological replicates of Col-0, dcl1-9, hyl1-2 and se-1 largely exhibited consistently higher mismatch ratios of As and Cs than their mock-treated counterparts, indicating the reliability of the experimental strategy (Extended Data Fig. <ref type="figure">1a</ref>).</p><p>For 147 bona fide pri-miRNAs, we were able to obtain sufficient read counts (&#8805;500 reads) for 73, 77, 76 and 78 pri-miRNAs in Col-0, dcl1-9, se-1 and hyl1-2, respectively, but not for the rest of the pri-miRNAs due to primer specificity and/or extremely low expression levels. First, 22 nt were extended both upstream and downstream from the first cleavage sites (position '0'), which covers the lower stem regions for the BTL-type pri-miRNAs and the upper stem regions for the LTB-type ones, as well as the miRNA/* duplex regions. Interestingly, for the BTL-patterned pri-miRNAs, the distal regions of the lower stems, or the basal regions, exhibited higher DMS reactivities than those of the duplex regions (Fig. <ref type="figure">4b</ref>). This pattern indicates the presence of more unpaired nucleotides in the distal regions of the lower stems, which is in line with the structural features of pri-miRNAs. Conversely, for the LTB-patterned pri-miRNAs, the upper stems and terminal loop regions displayed significantly higher DMS reactivities than those of the duplex regions, indicative of single-strandedness in the upper regions of pri-miRNAs (Fig. <ref type="figure">4c</ref>). The duplex regions of LTB-patterned pri-miRNAs different structures in vivo and in silico are further divided into four categories on the basis of detailed structure differences. f,g, BTL-processed pri-miR170 and pri-miR397b (f) and LTB-processed pri-miR160a and pri-miR160c (g) are four representative examples of the 69 pri-miRNAs that show different structures between in vivo DMS-reactivity-based modelling (right) and in silico modelling (left). The annotated miRNA/* regions are shaded with the outlines in blue and pink. The black dotted arrow denotes an intended cutting position that was not detectable in our system. The black and grey arrows indicate the first cleavage sites for BTL-and LTB-patterned pri-miRNAs, respectively. The structural differences are highlighted in the blue dashed boxes. The residues with the top 5%, 25% and 70% DMS activities are labelled with red, yellow and cyan backgrounds, respectively. The remaining residues with mismatch ratios below 0.01% are labelled with a white background. Guanines (G) and uracils (U) are marked with a grey background. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nature Plants</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nature Plants</head><p>Resource <ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> tended to display higher overall DMS reactivities, indicating more open-structured duplex regions than the BTL-patterned pri-miRNAs. This pattern is also found in pri-miRNAs with sequential processing patterns (Extended Data Fig. <ref type="figure">1b</ref>).</p><p>Next, we modelled the in vivo RSS of pri-miRNAs in Col-0 samples on the basis of an algorithm that combines both minimum free energy and DMS reactivity as constraints. We found that ~5.5% of pri-miRNAs (4 of 73) exhibited the same structures between DMS-reactivity-based structures (DRS) and RNAfold-predicted structures (RPS) (Fig. <ref type="figure">4d</ref>). These pri-miRNAs were pri-miR156a, pri-miR168a, pri-miR844a and pri-miR856a (Extended Data Fig. <ref type="figure">1c</ref>). In contrast, the vast majority, ~94.5% (69 of 73) demonstrated distinct structures between DRS and RPS. The most common differences involved the presence of more and larger internal loops or bulges in DRS than in RPS (Fig. <ref type="figure">4d</ref>,<ref type="figure">e</ref>, Extended Data Figs. <ref type="figure">2</ref><ref type="figure">3</ref><ref type="figure">4</ref><ref type="figure">5</ref><ref type="figure">6</ref><ref type="figure">7</ref><ref type="figure">8</ref>and Supplementary Table <ref type="table">4</ref>). For instance, BTL-type pri-miR170 and pri-miR397b harboured three internal loops (bulges) in the basal region according to DRS, instead of one big internal loop as predicted from in silico RNAfold (Fig. <ref type="figure">4f</ref>). LTB-processed pri-miR160a showed a larger terminal loop and one additional internal loop at its basal region in DRS compared with RPS (Fig. <ref type="figure">4g</ref>, left). Another LTB-cleaved example, pri-miR160c, obtained one additional internal loop (mismatch) in the top region and four internal loops (bulges) in the basal region in DRS, instead of having one big bulge in RPS (Fig. <ref type="figure">4g</ref>, right). Collectively, these findings highlight a substantial disparity between the in silico and in vivo structures that could help us understand the complexity of pri-miRNA processing in vivo.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>In vivo RSS better explains the first cutting sites of pri-miRNAs</head><p>Since Microprocessor tends to cut 15-17 nt away from the ssRNA-dsRNA reference junctions, and since a vast majority of plant pri-miRNAs seem to lack such structures in silico, we performed a systemic survey of RSS of DMS-reacted pri-miRNAs in vivo. Indeed, there were internal loops or bulges in both BTL-and LTB-patterned pri-miRNAs, and their distances from the first cleavage sites peaked at 15-17 nt with roughly normal distributions (Fig. <ref type="figure">5a</ref>).</p><p>Systemic assessment of all 73 pri-miRNAs with detectable in vivo RSS revealed that an additional ~5% of pri-miRNAs adapted to the optimized length of the molecular ruler (15-17 nt) for BTL-patterned processing on the basis of in vivo RSS compared with in silico folding (Extended Data Fig. <ref type="figure">9a</ref> and Fig. <ref type="figure">5a</ref>, left). For instance, RNAfold predicted that pri-miR397a displayed 6-nt and 19-nt distances from the internal loop and ss basal segment to the initial processing site, respectively, while in vivo RSS showed that the pri-miRNA obtained a new bulge (10 nt away), and a new internal loop (14 nt away) in the lower stem that is closer to 15 nt and thus could more efficiently guide BTL processing (Fig. <ref type="figure">5b</ref>, <ref type="figure">left</ref>). For another BTL-type pri-miR851a, two internal loops in the lower stem were detected from in vivo RSS, serving as preferable reference sites (16 nt and 9 nt) for Microprocessor versus in silico ones (9 nt and 3 nt) (Fig. <ref type="figure">5b</ref>, <ref type="figure">right</ref>).</p><p>For the LTB-patterned pri-miRNAs, we found that an additional ~13% contained new structural elements that could serve as more optimal molecular rulers (15-17 nt) on the basis of in vivo RSS compared with RPS (Extended Data Fig. <ref type="figure">9b</ref> and Fig. <ref type="figure">5a</ref>, right). For example, in vivo RSS showed that pri-miR390b now exhibited a new small bulge in the upper stem and thus introduced a 16-nt ruler to guide LTB processing. Moreover, another 10-nt distance between a new internal loop and the initial cleavage appeared in DRS. However, both the 16-nt and 10-nt lengths are obscure in the RPS (Fig. <ref type="figure">5c</ref>, left). Another LTB-type pri-miR4245 presented a new bulge in the upper stem that served as a more optimal 15-nt reference for a new species of miRNA/* from DRS than the 20-nt distance in RPS for one set of cuttings. In addition, this same new bulge could introduce an 11-nt length to another set of cuttings, instead of the 7-nt distance in RPS (Fig. <ref type="figure">5c</ref>, right). Taken together, approximately 5% and 13% more BTL-and LTB-patterned pri-miRNAs presented more optimal reference sites for DCL1 in in vivo RSS analysis than in silico, facilitating the activity of Microprocessor (Fig. <ref type="figure">5b</ref>,<ref type="figure">c</ref>, Extended Data Fig. <ref type="figure">9a</ref>,b and Supplementary Table <ref type="table">5</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>New features of internal loops/bulges and DCL1's first cuts</head><p>Intriguingly, both BTL-and LTB-patterned pri-miRNAs tended to harbour additional internal loops or bulges nearby that are approximately 9-11 nt away from the first cleavages (Fig. <ref type="figure">5d</ref>,<ref type="figure">e</ref>). This characteristic is more pronounced in LTB-processed pri-miRNAs, as nearly the same numbers of pri-miRNAs contained the two loops/bulges that are 9-11 or 15-17 nt away from the initial processing sites with 23 species in common. Similarly, the corresponding numbers for BTL-patterned pri-miRNAs were 23 and 40, respectively, with 20 in common (Extended Data Fig. <ref type="figure">9c</ref>). These results indicate that Microprocessor might have more flexibility to adapt to different locations of internal loops as observed in vitro <ref type="bibr">6</ref> . Of course, whether additional internal loops/bulges serve as new hidden or additional molecular rulers or regulatory elements awaits future investigation.</p><p>Furthermore, we explored the structural features of the first cutting positions for all 73 pri-miRNAs with DRS in Col-0. Overall, approximately 77% of pri-miRNAs had their first cleavage sites located in an unpaired region, comprising small internal loops or bulges. Around 16% of pri-miRNAs had their first cutting sites situated near internal loops, while only 7% had their first cleavages at fully paired regions (Fig. <ref type="figure">5f</ref>, <ref type="figure">left</ref>). Further characterization of the first cutting sites based on different processing patterns revealed clear differences between BTLand LTB-processed pri-miRNAs. For the BTL pattern, the first cleavage sites were predominantly located in unpaired regions (82%), 16% had their first cutting sites situated near internal loops and only 2% showed their first cleavage at paired regions (Fig. <ref type="figure">5f</ref>, <ref type="figure">middle</ref>). These results indicate that Microprocessor predominantly prefers to place the first cleavage sites at unpaired regions of BTL pri-miRNAs. By contrast, for LTB pri-miRNAs, the proportion of first cutting sites located in unpaired regions is lower (69%, right), and the proportion of first cutting sites in Fig. <ref type="figure">5</ref> | DRS provides more meaningful interpretation for determination of the initial cleavages of pri-miRNAs than RPS. a, Profiling of distances from the reference dsRNA-ssRNA junction sites derived from both RNAfold and DMS-MaPseq to the first cleavage sites revealed that ~15-17 nt is the predominant molecular ruler length for BTL-processed (left) and LTB-processed (right) pri-miRNAs. b, Pri-miR397a (left) and pri-miR851a (right) are two representative BTLtype examples that have more optimal distances for DCL1 processing between the internal reference regions and the first cleavage sites (black arrows) detected in DRS (right) than predicted from RPS (left). The black and grey brackets show 15-17-nt and 9-11-nt lengths from the reference loops, shown in black and grey dashed boxes, to the first cutting sites, respectively. c, Pri-miR390b (left) and pri-miR4245 (right) are two representative LTB-type examples that have more optimal distances for DCL1 processing between the internal reference regions and the first cleavage sites (grey arrows) detected in DRS (right) than predicted from RPS (left). The black dotted arrow denotes an intended cutting position that was not detectable in our system. d, Both RNAfold and DMS-MaPseq showed that pri-miRNAs typically harboured extra internal loops or bulges positioned approximately 9-11 nt away from their initial cleavage sites for BTL-processed (top) and LTB-processed (bottom) pri-miRNAs. e, Reanalysis of the cryogenic electron microscopy density map of the DCL1-pri-miR166f complex from published data <ref type="bibr">24</ref> suggests the presence of new binding pockets for additional internal loops/bulges that might be 9-11 nt away from the first cleavage sites (black arrowhead). Dark blue represents positively charged surfaces of DCL1. The blue lines indicate three different bulges in the lower stem of pri-miR166f. The colour scheme for the different domains of DCL1 is the same as in Fig. <ref type="figure">1b</ref>. f, Parallel DMS-MaPseq and degradome-seq analyses show that the first cleavage sites are predominantly located at the unpaired regions for BTL-processed pri-miRNAs, whereas the pattern is less pronounced for LTB-typed pri-miRNAs. Solid and hollow circles represent paired and unpaired nucleotides, respectively. Solid blue and pink circles represent the partial miRNA/* duplex. paired regions is higher (17% for near internal loops and 14% for paired regions) (Fig. <ref type="figure">5f</ref>, right). Again, Microprocessor appears to have less preference in selecting the first cleavage sites for LTB-type pri-miRNAs than for BTL-type ones. These results indicate that artificial miRNA design entails more stringent selection of the first cutting site when BTL-patterned backbones are used over LTB-patterned backbones.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nature Plants</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DCL1, SE and HYL1 act in shaping the structures of pri-miRNAs</head><p>We next assessed whether and how DCL1, SE and HYL1 impact pri-miRNAs' structures. We also computed the overall DMS reactivities for all common detectable BTL and LTB pri-miRNAs in Col-0, dcl1-9, se-1 and hyl1-2. Interestingly, ensemble DMS reactivities for both BTL (Fig. <ref type="figure">6a</ref>) and LTB (Fig. <ref type="figure">6b</ref>) patterns in all three mutants were obviously elevated in comparison with Col-0. Furthermore, we calculated the Gini index for pri-miRNAs, where higher values indicate larger structureomes. The Gini index of pri-miRNAs in hyl-2 and se-1 showed a significant decrease, and that in dcl1-9 showed a slight decrease, compared with those in Col-0 (Extended Data Fig. <ref type="figure">10a</ref>). These results suggest that the structures of pri-miRNAs are more open in these mutants than in Col-0. Since the prevailing view is that ribonucleoprotein complexes do not seem to impact the impermeability and reactivity of DMS with bound RNA <ref type="bibr">35</ref> , the more open structures or heterogeneity of pri-miRNAs in the mutants might result from conformational dynamics or from lack of function of helicases that are recruited through SE or HYL1, or DCL1 itself. Subsequently, we modelled RSS of selected pri-miRNAs in the three mutants using RNAstructure software constrained by DMS reactivity. Pri-miR395f, a representative of the BTL-processed pri-miRNAs, displayed a compact lower stem, a 2-nt internal loop in the miRNA/* duplex and two internal loops in the upper stem in Col-0. In contrast, this pri-miRNA exhibited an enlarged internal loop in the lower stem and a small but new internal loop in the proximity of miRNA/*, but no internal loop in the miRNA/* in dcl1-9. On the other end, two small but new internal loops occurred in dcl1-9, replacing one internal loop observed in Col-0. Similarly, pri-miR395f also displayed distinct structures in the hyl1-2 and se-1 mutants, reflected by loose-shaped upper stems among other changes in the lower stems and miRNA/* regions (Fig. <ref type="figure">6c</ref>).</p><p>Pri-miR156a, a representative of the LTB-patterned pri-miRNAs, displayed similar folding in dcl1-9 and Col-0 despite the seemingly increased DMS reactivity in the basal areas. This result suggests that there might be no helicase directly involved in this situation, leading to the lack of obvious structural changes in dcl1-9. However, this pri-miRNA clearly exhibited more unpaired regions in the basal regions and/or upper stem in hyl1-2 and se-1 than in Col-0 (Fig. <ref type="figure">6d</ref>). Additional examples including SBTL-processed pri-miR447b, SLTB-processed pri-miR319a and bidirectionally processed pri-miR166a displayed varying degrees of structural changes in dcl1-9, hyl1-2 and se-1 compared with Col-0 (Extended Data Fig. <ref type="figure">10b-d</ref>). These findings collectively suggest that DCL1, SE and HYL1 directly or indirectly modulate the RSS of pri-miRNAs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion</head><p>Understanding miRNA biogenesis has been a major challenge for plant miRNAs, as their substrates display profound heterogeneity in structures and lengths <ref type="bibr">[36]</ref><ref type="bibr">[37]</ref><ref type="bibr">[38]</ref> . Here we managed to identify the precise first cutting sites for all annotated pri-miRNAs through a unique strategy of degradome-seq of DCL1 dominant-negative lines. We were also able to draw a comprehensive atlas of processing patterns of all pri-miRNAs in Arabidopsis. One unexpected finding is that only 147 of 326 annotated miRNAs can now be unambiguously validated as bona fide species at this stage. Importantly, we clarified that nearly 25% (81 of 326) of pri-miRNAs are not bona fide substrates of DCL1. Additionally, 98 pri-miRNAs do not have reads in degradome-seq, and their authenticity awaits future clarification (Fig. <ref type="figure">1h</ref>).</p><p>Precise identification of the first cutting sites for all bona fide pri-miRNAs enabled us to survey their processing patterns. While we validated the processing patterns of 52 pri-miRNAs (35%) from earlier work in the field <ref type="bibr">12,</ref><ref type="bibr">14,</ref><ref type="bibr">15,</ref><ref type="bibr">31</ref> , we re-annotated or newly reported the processing modes for 95 pri-miRNAs (65%) (Fig. <ref type="figure">2</ref>). Systematic investigation of 147 true pri-miRNAs now places them into BTL, SBTL, LTB, SLTB and bidirectional processing types, whose shares are 37, 16, 38, 13 and 43, respectively (Fig. <ref type="figure">7a</ref>). Importantly, 28 pri-miRNAs can produce new species of productive miRNA/*s as they are born by Microprocessor and loaded into AGOs. We have also pinpointed the cognate mRNA targets for 10 of the 28 newly identified miRNAs (Supplementary Figs. <ref type="figure">2</ref> and <ref type="figure">6</ref>). The physiological implications of these new species of miR-NAs, especially their impact on canonical miRNA homeostasis and functional connections between two sets of miRNA/*s from the same pri-miRNAs and between two sets of targets, would be exciting topics for further studies.</p><p>Furthermore, DMS-MaPseq largely re-shaped our understanding of plant pri-miRNA processing from both structural and sequence perspectives. First, in vivo RSS exhibited significant differences from the in silico predicted structures. These differences are underscored by the fact that approximately 95% of 73 pri-miRNAs with detectable RSS demonstrate a prevalence of larger or more loops/bulges in vivo than the in vitro RPS (Figs. <ref type="figure">4</ref> and <ref type="figure">7b</ref>). This observation is reminiscent of human pre-miRNAs that also display larger terminal loops in icSHAPE-MaP-determined RSS than in the theoretical model from miR-Base <ref type="bibr">39</ref> . Like their animal counterparts, plant pri-miRNAs possess more unpaired nucleotides in their RSS in vivo. Second, approximately 78% and 68% of BTL-and LTB-patterned pri-miRNAs are processed through the canonical molecular ruler (15-17 nt). These in vivo RSS features largely clarified ~5% additional BTL-and ~13% additional LTB-processed cleavage patterns that could not be previously explained on the basis of RNAfold structures. This notwithstanding, we also noticed that approximately 43% of BTL-and 61% of LTB-type pri-miRNAs harbour additional internal loops and bulges that are ~9-11 nt away from the initial cleavages (Fig. <ref type="figure">7b</ref> and Extended Data Fig. <ref type="figure">9a</ref>,<ref type="figure">b</ref>). These additional internal loops and bulges seem to fit into the new pockets of DCL1 found through the reanalysis of DCL1's cryogenic electron microscopy structure with pri-miR166f (Fig. <ref type="figure">5e</ref>) <ref type="bibr">24</ref> . DCL1 clearly uses a binding pocket in the PAZ domain to recognize the canonical reference internal loops of pri-miRNA that are 15-17 nt away from the cleavage site. However, DCL1 also harbours an ample and adaptable space that could accommodate the internal loops/bulges that are 9-11 nt away from the cleavage site (Fig. <ref type="figure">5e</ref>). This DCL1 structural feature underscores our observation that pri-miRNAs exhibit more and larger internal loops or bulges in DRS than in RPS (Fig. <ref type="figure">4d</ref>-g and Extended Data Figs. <ref type="figure">2</ref><ref type="figure">3</ref><ref type="figure">4</ref><ref type="figure">5</ref><ref type="figure">6</ref><ref type="figure">7</ref><ref type="figure">8</ref>). Third, we a b c d 5' 21 18 15 12 9 6 3 0 -3 -6 -9 -12 -15 -18 -21 -21 -18 -15 -12 -9 -6 -3 Relative position (nt) Relative position (nt) Relative position (nt) Relative position (nt) 0 3 6 9 12 15 18 21 -21 -18 -15 -12 -9 -6 -3 0 3 6 9 12 15 18 21 21 18 15 12 9 6 3 0 -3 -6 -9 -12 -15 -18 -21 0.075 0.050 0.025 0 0.050 DMS reactivity DMS reactivity 0.025 0 0.075 0.050 0.025 0 0.075 0.050 0.025 0 0.075 0.050 0.025 0 0.075 0.050 0.025 0 Col-0 dcl1-9 hyl1-2 se-1 Col-0 dcl1-9 hyl1-2 se-1 0.075 0.050 0.025 0 0.075 0.050 0.025 0 0.100 0.075 0.050 0.025 DMS reactivity DMS reactivity 0 0.100 0.075 0.050 0.025 0 0.100 0.075 0.050 0.025 0 0.100 0.075 0.050 0.025 0 0.100 5' Col-0 dcl1-9 Pri-miR395f hyl1-2 se-1 Col-0 dcl1-9 hyl1-2 se-1 Pri-miR156a Proportion of mismatch ratio (%) 100 95 70 0 Nature Plants Resource <ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> </p><p>observed that up to 77% of pri-miRNAs had their first cleavage sites at unpaired regions (Figs. <ref type="figure">5f</ref> and <ref type="figure">7b</ref>). This ratio is significantly higher than the earlier reported number (approximately 40%) <ref type="bibr">40</ref> , although the preference of DCL1 to process unpaired regions in the stem of pri-miRNAs has been appreciated before <ref type="bibr">6,</ref><ref type="bibr">41</ref> . These discoveries have provided new guidelines for designing artificial miRNAs: first, it will be more effective if the cleavage sites targeted by DCL1 are located on the internal unpaired regions; and second, the backbones of LTB-type pri-miRNAs seem to tolerate more mismatches or unpaired sequences in their duplex regions than the BTL-type ones. Taking pri-miR159a as an example <ref type="bibr">42,</ref><ref type="bibr">43</ref> , we propose to modify an amiR-backbone that resembles its DRS instead of its RPS (Extended Data Fig. <ref type="figure">10e</ref>). Parallel degradome-seq and DMS-MaPseq enabled us to revisit the role of SE and HYL1 in pri-miRNA processing in a physiological context. Whereas SE and HYL1 concertedly impact the initial cleavage sites, they each have independent functions. We discovered that SE and HYL1 have completely opposite effects on 11 pri-miRNAs, which had not been reported before (Fig. <ref type="figure">3e</ref>). HYL1-independent pri-miRNAs are more abundant than SE-independent ones, and this discrepancy may be offset by the presence of HYL1 homologues in plants. Furthermore, we did not observe the preference of SE or HYL1 for the processing of specifically patterned pri-miRNAs (such as BTL or LTB). It has been well established that SE and HYL1 impact miRNA biogenesis through various genetic and biochemical pathways <ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref><ref type="bibr">25,</ref><ref type="bibr">26</ref> . We have recently reported that SE could recruit SWI2/SNF2 ATPase CHR2 to remodel pri-miRNA RSS to inhibit miRNA production <ref type="bibr">17</ref> . Here we can strengthen the idea that RSS can serve as a critical regulatory layer to control pri-miRNA processing. This concept can be highlighted by the observation that pri-miRNAs in dcl1-9, se-1 and hyl1-2 displayed higher DMS reactivity than those in Col-0, indicative of flexible RSS in vivo (Fig. <ref type="figure">6</ref> and Extended Data Fig. <ref type="figure">10</ref>). This result further suggests that RSS changes might also contribute to the inaccurate or abnormal processing of pri-miRNAs observed in se and hyl1 mutants. It is generally assumed at this stage that protein-RNA interaction negligibly impacts DMS activities on bound substrates <ref type="bibr">35,</ref><ref type="bibr">44,</ref><ref type="bibr">45</ref> . In this scenario, the alteration of RSS by DCL1 can be attributed to its inherent helicase function, while the impact of SE and HYL1 on RSS is probably indirect and occurs through the activities of their interacting helicases exemplified by CHR2. Otherwise, the current dogma that protein binding does not affect the chemical probing of RNA structures in vivo needs to be revisited.</p><p>In summary, parallel degradome-seq and DMS-MaPseq have provided a comprehensive atlas of pri-miRNA processing steps and the structural determinants for such processing. The dissection of connections between RSS features and first cleavage sites provides new guidelines for designing artificial miRNAs for more precise and efficient silencing of targeted transcripts. As a side outcome, these were different from RNAfold-predicted structures (RPS). The DRS better explains why DCL1 selects the first cutting sites (black arrows) that are 15-17 nt (black lines) away from the internal loops or bulges. DRS also detects additional internal loops or bulges that are 9-11 nt (grey lines) away from the first cleavage sites by DCL1. See 'Discussion' for details.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nature Plants</head><p>Resource <ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> findings unambiguously clarify the identities of the bona fide miRNAs. The ideas and approaches in this study can thus be readily adopted to different organisms.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Plant materials and growth conditions</head><p>Arabidopsis thaliana ecotype Columbia (Col-0), dcl1-9 (CS3828), se-1 (CS3257), se-2 (SAIL_44_G12) and hyl1-2 (SALK_064863) were used for this study as described previously <ref type="bibr">43</ref> . The pBA002a-P DCL1 -FM-gDCL1 E1507Q and pBA002a-P DCL1 -FM-gDCL1 E1696Q constructs were transformed into the dcl1-9 +/-background and screened by genotyping. The primers used for PCR identification are listed in Supplementary Table <ref type="table">6</ref>. All plants were grown on soil ( Jorry Gardener/LP5) or standard Murashige and Skoog (PhytoTech Labs, catalogue number M524) medium plates in a 16-hour light/8-hour dark cycle at 22 &#177; 1 &#176;C.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Vector construction</head><p>The pBA002a-P DCL1 -FM-gDCL1 E1507Q and pBA002a-P DCL1 -FM-gDCL1 E1696Q used for this study were constructed as follows: pBA-Flag-Myc-DCL1 E1507Q and pBA-Flag-Myc-DCL1 E1696Q (described previously <ref type="bibr">6</ref> ) were digested by BamHI (New England BioLabs, catalogue number R3136S)/PmeI (New England BioLabs, catalogue number R0560S), and the resultant fragments were ligated into the BamHI/PmeI-treated pBA002a-P DCL1 -FM-gDCL1 (ref. 17) to yield the pBA002a-P DCL1 -FM-gDCL1 E1507Q and pBA002a-P DCL1 -FM-gDCL1 E1696Q constructs; these were confirmed by sequencing using seq-F: AAGGATGAGGCAACAAATGG and seq-R: ACCTTCCAAGCAGCAGTTGT. They were then transformed into the dcl1-9 +/-background via the floral dip transformation method <ref type="bibr">46</ref> .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Western blot assays</head><p>Western blot analysis was performed with ten-day-old plants that were grown on Murashige and Skoog plates or three-week-old plants that were grown on soil. The extraction and experimental procedures were performed as described previously <ref type="bibr">47</ref> . The blots were detected with primary antibodies against Myc (Sigma-Aldrich, catalogue number C 3956, 1:10,000 dilution) and actin (Sigma-Aldrich, catalogue number A0480, 1:10,000 dilution). The secondary antibodies were anti-rabbit immunoglobulin G (Cytiva, catalogue number NA934v, 1:5,000 dilution) and anti-mouse immunoglobulin G (Cytiva, catalogue number NA931v, 1:5,000 dilution).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>sRNA blot assays</head><p>Total RNAs from ten-day-old seedlings were separated in 15% denaturing Urea-PAGE gels, followed by semi-dry transferring onto Hybond-N+ hybridization membranes (GE Healthcare, RPN303B). The membranes were UV crosslinked and hybridized with miRNA probes labelled with [&#947;-32 P] ATP (PerkinElmer), with U6 serving as a loading control. The membranes were covered with a phosphor imaging plate (GE Healthcare), and signals were detected with a Typhoon FLA7000 (GE Healthcare) as described previously <ref type="bibr">47</ref> . The probes used for sRNA blot are listed in Supplementary Table <ref type="table">6</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Library construction for degradome-seq</head><p>The RNA was extracted from three-week-old or flower tissues of Col-0, dcl1-9 +/-;DCL1 E1507Q , dcl1-9 +/-;DCL1 E1696Q , hyl1-2 and se-2 using TRIzol reagent (Sigma-Aldrich, catalogue number T9424) according to the manufacturer's instructions. We isolated poly(A)-RNA from total RNA using oligo(dT) dynabeads; then, an amount of 5 &#181;g of DNase-treated poly(A)-RNA was ligated to the 5&#8242; RNA adaptor (rGrUrUrCrArGrAr-GrUrUrCrUrArCrArGrUrCrCrGrArCrGrArUrC) using T4 RNA ligase 1 (New England BioLabs, catalogue number M0204S) according to the manufacturer's instructions. The ligation products were purified with RNAClean XP beads (Beckman Coulter, reference number A63987). Then, chimeric RNA was reverse transcribed by Superscript III Reverse Transcriptase (Invitrogen, CA) using the 3&#8242; adaptor with oligo(dT) (CCTTGGCACCCGAGAATTCCATTTTTTTTTTTTTTTTTTVN) and amplified by PCR with adaptors including index primers (Supplementary Table <ref type="table">6</ref>) for Illumina sequencing.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Degradome-seq analysis</head><p>The raw data (PE150) were initially filtered for quality control requiring a quality score &gt;25 at the 3&#8242; end and trimmed to remove the adaptor sequences by Cutadapt <ref type="bibr">48</ref> . Only Read1 reads from the PE150 data have cleavage information; these were retained for the following analysis. Clean reads over 50 nt were retained and aligned using hisat2 (ref. 49)  to the Arabidopsis Araport11 genome reference <ref type="bibr">50</ref> . The software feature-Counts <ref type="bibr">51</ref> was used to count the read number located in pri-miRNA with a 200-bp extension in both the upstream and downstream directions. Degradome-seq recovered roughly 13-21 million clean reads for each replicate, among which more than 74% could be uniquely mapped to the reference genome. A Pearson correlation between replicates exceeded 0.92, indicating high-quality and reproducible datasets. Data from two biological replicates were merged for the following analyses due to high repeatability.</p><p>To calculate the cutting ratio for each position of all pri-miRNAs, the read count difference for each nucleotide was first determined by subtracting the read count of the upstream position from that of the current position. The cutting ratio was calculated by dividing the read count difference from the total reads mapped to the current pri-miRNA. The highest cutting ratio position was defined as the first cleavage site for each pri-miRNA, and the processing directions were sorted into five different types on the basis of the location of the first cutting site.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Prediction of the in vitro RSS of pri-miRNA via RNAfold</head><p>The secondary structures were predicted from the RNAfold web server (<ref type="url">http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi</ref>) on the basis of minimum free energy and partition function. The annotated miRNA/* duplex sequences of pri-miRNAs collected from the miRBase website (<ref type="url">https://www.mirbase.org/browse/results/?organism=ath</ref>) were coloured with blue and pink backgrounds, respectively, on structure models using VARNAv3-93 (<ref type="url">https://varna.lisn.upsaclay.fr</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Determination of productive and abortive pri-miRNA products</head><p>We downloaded and repurposed previously published databases, which include regular sRNA-seq and AGOs-IP sRNA-seq data from both our laboratory and others <ref type="bibr">29,</ref><ref type="bibr">30,</ref><ref type="bibr">52</ref> . To ascertain whether these newly discovered products are functional (productive) or non-functional (abortive), we established the following criterion: if there are sufficient read counts (&#8805;20) supporting these new products, they are categorized as productive processing. If the read counts are not evident (&lt;20), these products are considered abortive cleavage.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Target prediction for new species of productive miRNAs</head><p>The command-line version of psRNATarget <ref type="bibr">53</ref> (<ref type="url">https://github.com/  jtremblay/mirnatarget</ref>) was used to predict targets for 28 newly discovered miRNAs against the Araport11 cDNA reference (<ref type="url">https://www.  arabidopsis.org</ref>). Predicted target regions with no more than one mismatch in the seed region were preserved for subsequent validation. Cutting sites from Col-0 degradome-seq were employed to ensure the prediction accuracy. Only predicted target regions overlapping with cutting sites (with read coverage &#8805;20 and cutting ratio &#8805;0.1) were considered as authentic targets of the new miRNAs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Assessing the impact of SE and HYL1 on pri-miRNAs' first cutting sites</head><p>The 147 bona fide pri-miRNAs were re-sorted into three patterns: 72 BTL-processed pri-miRNAs, 46 LTB-processed pri-miRNAs and 28 sequential-processed pri-miRNAs (including SBTL and SLTB). Pri-miR169d was excluded from the classification due to the presence Nature Plants Resource <ref type="url">https://doi.org/10.1038/s41477-024-01725-9</ref> of two cutting ratio sites with equal proportions (50%), which could introduce ambiguity in our computational analysis. All 42 bidirectionally processed pri-miRNAs were re-assigned to either (S)BTL (34 of the cases) or LTB (8 of the cases) depending on the directions the canonical miRNAs are produced by. The positions with the highest cutting ratios in the DCL1 E1507Q and DCL1 E1696Q lines were designated as '0' for all pri-miRNAs processed in either the BTL or LTB mode. The first cutting sites of RNase b, reflected in the DCL1 E1507Q line, are centred at position '0' in the top panel of Fig. <ref type="figure">3a</ref>, while the initial cutting sites of RNase a, reflected in the DCL1 E1696Q line, are illustrated in the bottom panel of Fig. <ref type="figure">3a</ref>. These results are well aligned with our in vitro results that the RNase b and a domains process the 5&#8242; and 3&#8242; arms of BTL-patterned pri-miRNAs, respectively <ref type="bibr">6</ref> . By contrast, an inverted pattern was shown for LTB-patterned pri-miRNAs, in which the RNase b and a domains preferably cleave the 3&#8242; and 5&#8242; arms of pri-miRNAs, reflected by the locations of the highest cutting ratios in the DCL1 E1507Q and DCL1 E1696Q lines, respectively (Fig. <ref type="figure">3b</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Pri-miRNA-specific DMS-MaPseq library preparation</head><p>For three-week-old plants, about 5 g of Col-0, dcl1-9, hyl1-2 and se-1 plants grown on soil were collected within one hour and completely covered in 40 ml of 1&#215; DMS reaction buffer (40 mM HEPES pH 7.5, 100 mM KCl and 0.5 mM MgCl 2 ) in a clean 50 ml Corning tube. DMS (Sigma, catalogue number D186309) was added to a final concentration of 1% as described previously <ref type="bibr">28</ref> . Mock treatment was performed by adding the same volume of deionized water. The samples were treated in the DMS reaction buffer or mock solution at room temperature under a vacuum for 15 min. To stop the reaction, a final concentration of 20% &#946;-mercaptoethanol (Sigma-Aldrich, catalogue number M6250) was added, and the mixture was incubated for 5 min under vacuum. After being washed three times with 50 ml of cold deionized water, the samples were immediately frozen with liquid N 2 and ground into powder.</p><p>For flower tissues, the same plants were grown on soil until flowering, and then samples were collected and treated with or without DMS using the same conditions as above except the incubation time under vacuum. Different time courses for DMS treatment including 14 min, 28 min, 42 min and 56 min were also tested; eventually, 56 min was chosen for the assay after comparison, because incubation times less than 56 min are not sufficient to distinguish the DMS-treated samples from the mock-treated samples, and total RNA appeared to start decaying by 56 min.</p><p>Total RNA was isolated using TRIzol reagent according to the manufacturer's instructions and then treated with TURBO DNase (Invitrogen by Thermo Fisher, catalogue number AM2239). For each sample, 2 &#181;g of DNase-treated RNA was mixed with gene-specific reverse transcription primers (2 pmol of each primer, up to ten gene-specific primers in one reaction). Then, 1 &#181;l of 10 mM dNTP and DEPC-H 2 O was added to a total volume of 13 &#181;l and incubated at 65 &#176;C for 5 min, then immediately put on ice for 2 min. Then, 4 &#181;l of 5&#215; First-Strand buffer (250 mM Tris-HCl pH 8.3, 375 mM KCl and 15 mM MgCl 2 ), 1 &#181;l of 0.1 M DTT (prepared fresh), 1 &#181;l of RNase inhibitor (Thermo Fisher, catalogue number AM2696) and 1 &#181;l of TGIRT-III (Ingex, catalogue number TGIRT50) were added. The mixture was incubated at 42 &#176;C for 30 min, and then reverse transcription proceeded at 60 &#176;C for 1.5 h. The reaction was stopped by adding 2.3 &#181;l of 1 M NaOH and heating the mixture at 98 &#176;C for 15 min, then immediately putting it on ice. After neutralization by 2.5 M HCl, H 2 O was added to a total volume of 30 &#181;l for each sample, followed by purification with 39 &#181;l of AMPure XP beads (Beckman Coulter, reference number A63881), which is intended for performing DNA cleanup and efficient removal of unincorporated dNTPs, primers, salts and other contaminants.</p><p>The cleaned cDNA was precipitated and resuspended in deionized water. Pri-miRNAs were then amplified using KOD hot start DNA polymerase (Millipore Sigma, catalogue number 71086) with gene-specific primers (Supplementary Table <ref type="table">5</ref>). PCR bands were gel purified and normalized according to band intensity before library construction. PCR products were mixed equally and fragmented into 50-200 bp using a sonication machine (Diagenode SA Plcoruptor) following the manufacturer's protocol. After purification using AMPure XP beads, the fragments were subjected to end repair, adenylation and adaptor ligation using Illumina adaptors (Supplementary Table <ref type="table">6</ref>), mainly following the published protocol. The fragments were barcoded through adaptor ligation. The ligation products were purified again by two steps of cleanup with AMPure XP beads. Next, the purified barcoded libraries were enriched by 11 cycles of PCR using KOD hot start DNA polymerase. Finally, the PCR products were cleaned using AMPure XP beads and sent for sequencing by NovaSeq PE150.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Pri-miRNA-specific DMS-MaPseq analysis</head><p>The raw fastq files were initially filtered for quality control requiring a quality score &gt;25 in the 3&#8242; end and trimmed to remove the adaptor sequences by Cutadapt <ref type="bibr">48</ref> . To further guarantee high sequence quality, Fastq_quality_filter from the Fastx-toolkit (<ref type="url">http://hannonlab.cshl.edu/  fastx_toolkit</ref>) was used to filter reads with low quality with the parameters q = 25 and p = 80, meaning that 80% of the nucleotides had a base call accuracy of more than 99.7%. High-quality reads over 50 nt were retained and mapped to the Arabidopsis Araport11 genome reference <ref type="bibr">50</ref> by tophat2 (ref. 54) with the parameters N = 15; read-gap-length, 10; read-edit-dist, 15; max-insertion-length, 5; max-deletion-length, 5; g, 3&#8242; (ref. 35). Only uniquely mapped reads were kept for mismatch calling. A homemade Python script called CountMismatch2Bed.py (<ref type="url">https://  github.com/changhaoli/TAMU_02RSS</ref>) was used to call mismatches. DMS reactivity was calculated as the ratio between the mismatch count and the total read count for each nucleotide. The average mismatch ratios including A/C/G/U were calculated and plotted in a bar plot using the R package ggpubr (<ref type="url">https://cran.r-project.org/web/packages/  ggpubr/index.html</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DMS-reactivity-guided modelling of bona fide pri-miRNAs' RSS</head><p>On the basis of the DMS signal, the secondary structures of pri-miRNAs were modelled by RNAstructure (version 6.3) <ref type="bibr">55</ref> . The following threshold was used to separate adenine and cytosine bases into paired and unpaired nucleotides to produce the best-fitting model for our experimental data. For each pri-miRNA, in Col-0 samples, we set the highest 5% of DMS reactivities of As or Cs as red, which fit to the unpaired region very well; 25% mismatch ratios were coloured with yellow, mainly located in unpaired regions; and the rest with signals higher than 0.0001 were cyan. For mutants, the same pri-miRNA follows the exact same threshold constraint to model its secondary structure. On the basis of miRBase annotation, all mature miRNA sequences are outlined in blue, while miRNA/* sequences are outlined in pink.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Selection of the reference loops/bulges for molecular rulers</head><p>The canonical molecular ruler is approximately 15-17 nt; thus, the ssRNA-dsRNA junction regions that are more than 14 nt away, but close to 15-17 nt, from the first cutting sites are considered as the reference sites (related to Fig. <ref type="figure">5a</ref>). For the potential non-canonical 9-11-nt ruler, the junction regions that are more than 8 nt but less than 14 nt from the first cleavage sites were selected in this study (related to Fig. <ref type="figure">5d</ref>). Anti-Myc</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Extended Data</head><p>21 nt 21 nt 21 nt 21 nt 0.58 0.69 0.39 0.71 1.00 0.42 0.41 0.81 0.38 0.84 1.00 0.09 0.42 0.83 0.39 0.75 1.00 0.08 0.29 0.87 0.26 0.84 1.00 0.21 d c l1 -9 + /- ; D C L 1 E 1 6 9 6 Q 25,513,406 20,128,595 78.89% 14,129,966 70.20% 12,072,577 85.44% Col-0-R2</p><p>27,198,136 21,555,925 79.26% 14,562,422 67.56% 10,898,849 74.84% DCL1 E1507Q -R1 17,042,317 13,295,876 78.02% 9,097,992 68.43% 8,284,781 91.06% DCL1 E1507Q -R2 20,746,753 16,353,047 78.82% 10,325,815 63.14% 8,281,325 80.20% DCL1 E1696Q -R1 20,618,385 15,567,303 75.50% 11,235,541 72.17% 9,898,617 88.10% DCL1 E1696Q -R2 22,836,135 18,725,777 82.00% 11,609,173 62.00% 9,277,891 79.92% hyl1-2-R1</p><p>17,236,546 13,147,056 76.27% 9,294,680 70.70% 8,582,609 92.34% hyl1-2-R2</p><p>21,771,518 17,675,401 81.19% 11,156,419 63.12% 9,269,543 83.09% se-2-R1</p><p>19,490,467 15,161,272 77.79% 10,831,012 71.44% 9,908,659 91.48% se-2-R2</p><p>26,097,085 21,405,171 82.02% 13,767,965 64.32% 11,517,248 83.65%</p><p>d b miR166 DCL1 E1507Q -R1 DCL1 E1507Q -R2 DCL1 E1696Q -R2 DCL1 E1696Q 0 200 400 7500 5000 2500 0 Col-0 E1507Q E1696Q 80 60 40 20 0 2000 1500 1000 500 0 94.7% 88% 5.8%</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>pri-miR167a a</head><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 397</p><p>30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 195 pri-miR167d: 55.6% 82.8%</p><p>pri-miR168b: pri-miR169a:</p><p>30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 251 58% 52%</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 146 74% 100% 100% pri-miR167c:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 144 10% 20%</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 173</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 144 26.7%</p><p>30 40 50 60 70 80 90 100 110 120 130 140 149 pri-miR161: pri-miR165a: 26.3% 7.5% 13.5% 8.7% 9% 25.2% 6.3% 15% 5p-1 3p-1 5p-2 3p-2 15% 97% 41% pri-miR166f: 29.2% b(i) (20 re-validated)</p><p>pri-miR166g:</p><p>Relative position (nt)</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 142</p><p>pri-miR395b:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 83%</p><p>pri-miR395f:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 139 13% 50%</p><p>pri-miR397a:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 pri-miR399a: 43% 86.5%</p><p>pri-miR399d:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 153</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 146</p><p>pri-miR397b:</p><note type="other">1 10 20 30 40 50 60</note><p>70 80 90 100 110 120 130 131 pri-miR170: 37.5% 45.8%</p><p>pri-miR172b:</p><p>U A C A A A U A C C U A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 141 2.2% 83.9% 15.8% Supplementary Fig. 2 b(ii) 32% 1.1%</p><p>pri-miR393b:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 pri-miR827:</p><p>pri-miR864:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 158</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 137</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 142</p><p>U C A G C U U A A U U A A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 157 30 40 50 60 70 80 90 100 110 120 130 </p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 168 G C A C A A U</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>G C G G A C A G U G C U A C A A G A A G A G A U U U A U A G A G U C U U A A G A G U C U C A A G A A G C G U A C A A G G A G A U G A G G A G U G A A U C A C C A A A</head><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 257 100% 15% 10% 7% 11% 7.5% 5p-1 5p-2 3p-1 3p-2 5p-3</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 pri-miR158b:</p><p>pri-miR776:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 139</p><p>30 40 50 60 70 80 90 100 110 120 130 140 pri-miR847: AGO1-IP small RNA-seq AGO1-IP small RNA-seq 5p-1(~70 reads) 3p-1(~310 reads) 3p-2(2 reads)</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 92.3% 75% 5p-1 3p-1 3p-2 Relative position (nt) Reads DCL1 E1507Q DCL1 E1696Q page 4 63% pri-miR859: pri-miR867: 5.6% 3% 17% pri-miR857a: G U U U C U A U U A</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>U U U U C G A C U C C U A C A A C A A A C U U U C A C C A U A C A A A A U A A U G A A G A A A A C A C C C A A</head><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 399</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 148</p><p>C 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 301 pri-miR5013: pri-miR5020b: 90% 60% 10% 5p-2 3p-1 100% 94% 5p-1</p><p>pri-miR4240:</p><p>30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 205</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 159 One functional miRNA/* is the canonical miR399/*, designated as the 5p-1 and 3p-1 products (blue brackets) while the other is a new set of miRNA/* duplex in red that could be recovered in sRNA-seq and that we referred as 5p-2 and 3p-2. The reads of the recovered miRNA/*s (5p-1/3p-1; 5p-2/3p-2) are shown on the right panels. Of note, pri-miR399b also underwent the third mode of processing that is shifted 7 nt toward the canonical miR399/* duplex, producing a 3p-3 abortive product in the 3' arm (light green brackets). The presence of multiple starting processing sites for pri-miR399b is likely due to its complicated terminal loop structure. The black dotted arrows denoted the intended cutting positions but were not identified in our data. The cleavage site marked with an asterisk (*) was also discovered in WT and fiery1 14 .</p><p>(g) Pri-miR156f is now re-annotated to the two types of BTL processing, resulting in miR156f/* (5p-1/3p-1) and a newly productive miRNA/* species (5p-2/3p-2) (left panel) that could be recovered from sRNA-seq. Be noted that 5p-1 targets mRNA (AT1G53160), 3p-2 cleaves mRNA (AT5G63290) according to the degradome-seq (right panel).</p><p>(h) IGV graphs of reads' numbers and relative ratios at detected cleavage sites of pri-miR780a in Col-0 and semi-active DCL1 expressing lines. Related to Fig. <ref type="figure">2c</ref>.</p><p>(i) IGV graphs of two species of miRNA/*s from pri-miR780a, both (5p-1/3p-1) and (5p-2/3p-2) are sorted into AGO1. Related to Fig. <ref type="figure">2c</ref>.</p><p>(j) Newly identified BTL patterns for 12 pri-miRNAs with some new targets included. Be noted that there are two pages for this panel (2j(i) and 2j(ii)). Pri-miR158b had an alternative starting cleavage site that is in the middle of miR158/*, likely resulting in an abortive product, which is not recovered in RISC. Pri-miR831 does not generate the products (5p-1/3p-1) as predicted from miRBase. Rather, the initial cleavage position shifted 3 nt away from the predicted site and generated a new pair but functional duplex (5p-2/3p-2, in red). Pri-miR847 follows two kinds of BTL modes, producing functional 5p-1/3p-1 recovered from AGO1-RISC (in blue), but abortive 3p-2 (in pink). The processing of pri-miR851a is even more heterogeneous as it has three different sites for initial cleavages with comparable ratios: one produces miR851a/* as predicted (5p-1/3p-1, in blue); another way shifted 3 nt toward the base region and generated a new pair but productive form of miRNA/* (enclosed by red brackets). Importantly, 5p-1 targets AT4G21350 whereas the new 5p-2 cleaves AT2G34750.</p><p>Additionally, the third processing shifted 6 nt onto the canonical miRNA/* duplex but yielded an abortive product (marked by a light green bracket). Finally, pri-miR4240 displays two ways of BTL processing: one produces the canonical miRNA/* (in blue), but the other generates abortive product with only one strand of sRNA that could be recovered in RISC but with low read counts (labelled in pink).</p><p>Extended Data Fig. 3 200 150 100 50 0 0 200 400 600 60 40 20 0 40 30 20 10 0 17.6% 70% Col-0 E1507Q E1696Q</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>pri-miR169j a</head><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 226</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 207 pri-miR169f: 95% 31.3% 12.5% 15.6%</p><p>pri-miR169g: pri-miR169i:</p><p>30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 235</p><p>pri-miR169l: 25% 69% Supplementary Fig. <ref type="figure">3</ref> A</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 321</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 191</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 383 pri-miR163: pri-miR169b: 8% 8% 100% 100% b(i) (12 re-validated) 50% 100% pri-miR169c: Relative position (nt) Reads DCL1E1507Q DCL1E1696Q page 1</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 196</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 383 c pri-miR822: pri-miR845a: 100% 68.6% 63.5% 9.7% 9.8% pri-miR823 d 200 150 100 50 0 0 200 400 600 1000 750 500 250 0 400 300 200 100 0 Col-0 E1507Q E1696Q 16% 95.1% 68.5%</p><p>30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 309</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 212</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 pri-miR169m: 25.8% Extended Data Fig. 3 b(ii</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 235 pri-miR169n: 42% 91% pri-miR394a: 18% 1.3% 38%</p><p>pri-miR402: pri-miR447b: </p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 183 pri-miR156a: pri-miR156b: pri-miR156c: 13% 25% 13% 24% 62% 100% 19% 2.3% 20% 8.3% 50% b(i) (12 re-validated)</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 159</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 146 1.5% 4.3% 43.1%</p><p>pri-miR156d:</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 pri-miR156e:</p><p>30 40 50 60 70 80 90 100 110 120 130 140 147 52.2% Extended Data Fig. 4 100 75 50 25 0 250 200 150 100 50 0 60 40 20 0 Col-0 E1507Q E1696Q 53% 71% 12% pri-miR160a: 0 100 200 300 400 500 a pri-miR160b:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 135 pri-miR156h:</p><p>20 30 40 50 60 70 80 90 100 110 120 130 136 8.9% Supplementary Fig. 4 Relative position (nt) Reads DCL1E1507Q DCL1E1696Q page 1 c pri-miR395d 0 100 200 300 400 2.0 1.5 1.0 0.5 0 40 20 0 0.050 0.025 0 Col-0 E1507Q E1696Q 50% 77% pri-miR160c: pri-miR162a: Extended Data Fig. 4 b(ii) 20.1%</p><p>68.5%</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>47.1%</head><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 141 28.5% 10.1% 56% 11.7% 5%</p><p>30 40 50 60 70 80 90 100 110 120 130 140 pri-miR171c: 13.1% 13.9% 80.5% 5.1% 14.4% 21.9% 16.1% 77.5% pri-miR171b: pri-miR400: 43% 29% 25% 17.5% 17.5%</p><p>20 30 40 50 60 70 80 90 100 110 120 130 137</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 158</p><p>30 40 50 60 70 80 90 100 110 120 130 132 Supplementary Fig. 4 Relative position (nt) Reads DCL1E1507Q DCL1E1696Q page 2 e AGO1-IP small RNA-seq AGO2-IP small RNA-seq AGO1-IP small RNA-seq AGO2-IP small RNA-seq 5p-1(~100 reads) 5p-2(~340 reads) 3p-1(~80 reads) 3p-2(~5 reads)</p><p>pri-miR4245:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 225</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 158</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 145 pri-miR390b: 22% 18.6% 20% 40%</p><p>pri-miR408:</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 214 pri-miR159c: Extended Data Fig. 4 d (4 re-annotated) 100% 25% 75% * * *</p><p>Supplementary Fig. <ref type="figure">4</ref> d (4 re-annotated)</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 195</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 191 pri-miR775: f(i) (19 newly discovered) pri-miR841a: 7.8% 7.6% 0.8% 2% 6.3% 9% 14.2% 4% 5.5% 5p-1 5p-3 3p-1 3p-3 3p-2 5p-2 20% 5% 10%</p><p>pri-miR848a:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 162 page 3</p><p>U 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 208 pri-miR849a: 55.6%</p><p>pri-miR852:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 169 pri-miR853:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 pri-miR1888a:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 137 9.1% 27.8%</p><p>pri-miR2112:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 189 2.7% 7.7% 1.5%</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>2.1%</head><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 149</p><p>30 40 50 60 70 80 90 100 110 120 130 135 pri-miR5012: pri-miR5017: 27.3% 50% 14.3% 8.6% 4.5% 4.3% 13.5% 5.7% 5p-1 3p-1 3p-2 5p-2 pri-miR3440b: 30% 30% 2.1% 4.1% 5.5% 8.2% 6% 3% 5p-1 3p-1 3p-2</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 178 pri-miR5021: 33.3% 33.3% 66.7%</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 171 16% 15%</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 203 pri-miR5026:</p><p>Supplementary Fig. <ref type="figure">4</ref> f(ii) page 4</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 223 A U U U U A U U</p><p>U C C G C A G C C C U C C A A C G A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 181 pri-miR5028: 25% 25% f(iii) pri-miR5634: 1.8% 1% 1.9% 1% 1.4% 3.4% 1% 2.3% 1%</p><p>pri-miR5637:</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 400 404</p><p>pri-miR5640: pri-miR5654:</p><p>10 20 30 40 50 60 70 80 90 100 110 119 0 200 400 600 15000 10000 5000 0 1500 1000 500 0 600 400 200 0 Col-0 E1507Q E1696Q 49.5% 94.7% 18.6% 4.3% 7.7% 4.9% pri-miR159a: c pri-miR839a 0 200 400 600 1000 500 0 1500 1000 500 0 1000 500 0 Col-0 E1507Q E1696Q 30% 34% 3.3% 2.5% 17.3% 4.6% 3% 17.5%</p><p>C 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 224</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 222 pri-miR159b:</p><p>pri-miR319a:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 13.3% 31.5% 7.5% 11.8% 8.2% 55.2% 4.8% 13.4% 12% 14.1% 12% 22.4% 12% 13.4% 10.9% 3.3% 2.2% 1.5% pri-miR319b: 2.6% 1.6% 1.1% 4.2% 3.3% 10.8% 33.1% 82.1% 30% pri-miR319c: 20% 20% 20% 40% 97.4% b (4 re-validated)</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 219 pri-miR159a pri-miR839a a c Supplementary Fig. 5 Relative position (nt) Reads Relative position (nt) Reads DCL1E1507Q DCL1E1696Q DCL1E1507Q DCL1E1696Q page 1</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 400</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 226 pri-miR835a: pri-miR856:</p><p>30.3% 3.2% 3.4% 20% 16.5% 22.5% 1.3% 1.4% 27.5% 15.4% 2.5% 8%</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 271 pri-miR840a: 5.6% 7.2% 14% 12% 31% 32%</p><p>pri-miR869a: </p><p>U C A A U 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 184 pri-miR5024: pri-miR5656: 40% 11% 22% 33% 5p-1 3p-1 3p-2 5p-2 </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>G G A U A U U G A A C C G A A G A U C G A C G G U U G A G A U G A C A A G G C C A A G A U A U A A C A A A A A G A U A U U A U U C U A G A A G A U</head><p>30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 224 pri-miR1888b: 5p-1 (~1400 reads) 5p-2(2 reads) 3p-1(~80 reads) 3p-2(~9000 reads) AGO1-IP small RNA-seq AGO2-IP small RNA-seq AGO1-IP small RNA-seq AGO2-IP small RNA-seq 22% 15% 3p-2 5p-2 5p-1 3p-1 3p-1(~27 reads) 3p-2(~110 reads) 0 200 400 600 4000 3000 2000 1000 0 500 400 300 200 100 0 60 40 20 0 Col-0 E1507Q E1696Q 62.1% 78% 5.7% 23.7%</p><p>pri-miR166a:</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 187 pri-miR166b: pri-miR166e:</p><p>47.8% 7.3% 29% 24.8% 5p-1 3p-1 5p-2 67.9% 26.7% 8.9% 8.9% 1.1% 9.5% 5p-1 3p-1 5p-2 5p-3</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 pri-miR166a a Supplementary Fig. 6  </p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 203</p><p>20 30 40 50 60 70 80 90 100 110 120 127 pri-miR158a: pri-miR164b: pri-miR164c:</p><p>14% 6.4% 44% 7.4% 11% 4.8% 5p-1 3p-1 5p-2 b(i) (23 with abortive products) 13% 1.3% 26% 1.5% 61% 5p-1 3p-1 3p-2 5p-2 73% 67% 5p-1 3p-1 5p-2 pri-miR165b: 12.9% 65.3% 20.6% 3.2% 1</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 212 77.4% * * * Relative position (nt) Reads DCL1E1507Q DCL1E1696Q page 1</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 217</p><note type="other">10 20 30 40 50 60 70</note><p>80 90 100 110 120 130 140 150 160 167 72% 50% 36% 1.5% 3% 3p-2 5p-1 3p-1 5p-2 5p-3 3p-3 pri-miR168a: pri-miR169e: </p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 155 G</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 143 pri-miR172a: pri-miR172d: pri-miR172e: 2% 40% 7.8% 5p-1 3p-1 5p-2 23% 32% 7.0% 3p-2 5p-1 3p-1 5p-2 8.3% 91.6% 7</p><p>30 40 50 60 70 80 90 100 110 120 130 134</p><p>20 30 40 50 60 70 80 90 100 110 120 130 A</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 191 A C A C A U</p><p>30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 186 pri-miR391: pri-miR390a: 19.1% 7.7% 93.9% 43.3% 3p-2 5p-1 3p-1 5p-2 77.7% 82.4% 5.4% 2.9% 3p-2 5p-1 3p-1 5p-2 pri-miR394b: 18% 4.8% 4.2% 9.0% 5.3% 2.8% 3p-2 5p-1 3p-1 5p-2 pri-miR398c: pri-miR771a: 11.4% 53% 1.3% 38.5% 3p-2 5p-1 3p-1 5p-2 10.2% 13.3% 3p-2 5p-1 3p-1 </p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 155 * pri-miR828: </p><p>30 40 50 60 70 80 90 100 110 120 130 140 145 U C A U C A U C U C U C U U</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 214 pri-miR395c: 18% 17% 82% 3p-2 5p-1 3p-1</p><p>pri-miR398b: </p><p>30 40 50 60 70 80 90 100 110 120 130 140 147</p><note type="other">1 10 20 30 40 50 60 70 80 90 100 110 120 121</note><p>pri-miR395a:</p><p>10 20 30 40 50 60 70 80 90 100 110 120 130 132 pri-miR5995b: 70% 10% 3p-2 5p-1 5p-2 3p-1 pri-miR2111a: b</p><note type="other">10 20 30 40 50 60 70 80 90</note><p>100 110 120 130 140 150 160 170 180 190 192 14% 40% 20% 40% 29% 3p-2 5p-1 5p-2 3p-1</p><p>30 40 50 60 70 80 90 100 110 120 130 135 f-1 (4 with both productive and abortive products) 5p-1(~1800 reads) 5p-2(~140 reads) small RNA-seq small RNA-seq 3p-1(~4100 reads) 3p-2(~530 reads) C U A U C A U C A A C C  </p><p>C A U U G A U G A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 153 * C A C A U  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>G C A A C U A G A G G A A G G A U C C A A A G G G A U C G C A U U G A U C C</head><p>U C C U C U U C G A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 167 pri-miR393a: 64% 69.2% 15.4% 22% 5% 3p-2 5p-2 5p-1 3p-1</p><p>Cleavage site on target (AT4G03190) of 5p-1(miR393a) Cleavage site on target (AT1G05170) of 5p-2(miR393a) * pri-miR396b:</p><p>U U C C A U U U 1 10 20 30 40 50 60 70 80 90 100 110 120 130 132 pri-miR398a:</p><p>4.2% 8% 11.3% 12% 55.7% 3p-2 5p-2 5p-1 3p-1 47.4% 21.1% 66.7% 3p-2 5p-2 5p-1 3p-1</p><p>Cleavage site on target (AT2G36400) of 5p-1(miR396b) Cleavage site on target (AT5G59700) of 3p-2(miR396b)</p><p>20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 176 * pri-miR403: pri-miR472a: 45% 68% 30% 2.3% 3p-2 5p-2 5p-1 3p-1 9.1% 21% 3p-2 5p-2 5p-1 3p-1 i(iii) G A U U G U C A U U A G A A G A G U C G U A U U A C A U G U U U U G U G C U U G A A U C U A A U U C A A C A G G C U U U A U G U A A G A G A U U C U U U A A C A A U U C C U A U A A U C U U U G U</p><p>U C G A U U U U U A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 147</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 218 * pri-miR824: 86% 30% 37% 5.6% 3p-2 5p-2 5p-1 3p-1 U A U C A C C A U U U G U A C U U G A G U U G U C U C U C A U G U C U A G A C C A U U U G U G A G A A G G G A G U U U U U G U U U A C A C C A A U A C C C C C C A G U C U C U A A A U U U G U A A G A A G U A U U A U G C U C A A U U A A G G A A C A U A G C U A G U U C G A C A U A C C A U A C U A C C C U U U U U A A A A C U A U C C U A U A U</p><p>A U U C C C U U C U C A U C G A U G G U C U A G A U G U G C G A G G U G A C U C U C A U G G A G G U A A A G A A C A A U G G U G A U 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 400 410 420 430 440 450 460 470 480 490 500 510 520 530 540 550 560 570 580 590 600 610 620 630 640 650 660 670 680 689 pri-miR3933: 25% 33% 14% 8% 3p-2 5p-2 5p-1 3p-1 39% 32% 63% 8.6% 3p-2 5p-2 5p-1 3p-1</p><p>pri-miR868a:</p><p>A A U U U G A U U A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 235</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>G G C G A U U A C A C U U A U A G U A A A A C U U A G A A G A A G C A A A A U G A C G A C U C G G A C A A C A A U A A U</head><p>C G A A A A A G A A A 1 10 20 30 40 50 60 70 80 90 100 110 120 130 136 page 7 Supplementary Fig. 6 C A U G U A  pri-miR5014a: pri-miR8183:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C A C A U A A A A C U A G U A U A U U G C C U U G A A C A U G G U U U A U U A G G A A A G A C A U G A G A A C</head><p>5.7% 3% 12.5% 5.7% 7.5% 7.5% 3% 3p-2 5p-2 5p-1 3p-1 3p-3 5p-3 60% 20% 30% 3p-2 5p-2 5p-1 3p-1 Extended Data Fig. 6 i(iv)</p><p>1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 141 A C G A A G A G A U U U C A U G A</p><p>C U A C U U C C G C C G 1 10 20 30 40 50 60 70 80 90 100 110 120 130 140 143 5' cutting ratio 1.0 0.8 0.6 0.4 0.2 0.0 -40 -35 -30 -25 -20 -15 -10 -5 0 5 10 15 20 25 30 35 40 c 0 -5 -10 -15 -20 -25 -30 -35 -40 40 35 30 25 20 15 10 5 0.0 0.2 0.4 0.6 0.8 1.0 cutting ratio d 5' -40 -35 -30 -25 -20 -15 -10 -5 0 5 10 15 20 25 30 35 40 cutting ratio 1.0 0.8 0.6 0.4 0.2 0.0 0 -5 -10 -15 -20 -25 -30 -35 -40 40 35 30 25 20 15 10 5 cutting ratio 0.0 0.2 0.4 0.6 0.8 1.0 pri-miR398b pri-miR319a 0 200 400 40 20 0 Col-0 DCL1 E1507Q DCL1 E1696Q 40 30 20 10 0 60 40 20 0 125 100 75 50 25 0 20 15 10 5 0 se-2 hyl1-2 0 100 200 300 400 200 150 100 50 0 750 500 250 0 200 150 100 50 0 300 200 100 0 150 100 50 0 e f g BTL LTB 62.5% 37.5% 60% 40% 52% 48% 68% 32% S E d e p e n d e n t S E in d e p e n d e n t H Y L 1 -d e p e n d e n t H Y L 1 -in d e p e n d e n t 100 75 50 25 0 percentage(%) 100 75 50 25 0 Chi-Squared Test</p><p>p=1 p=0.24 Relative position (nt) Reads Relative position (nt) Reads Relative position (nt) Relative position (nt) Relative position (nt) Relative position (nt) Relative position (nt) Relative position (nt) Relative position (nt) Relative position (nt) Col-0 DCL1 E1507Q DCL1 E1696Q hyl1-2 se-2 Col-0 DCL1 E1507Q DCL1 E1696Q hyl1-2 se-2 Col-0 DCL1 E1507Q DCL1 E1696Q hyl1-2 se-2 Col-0 DCL1 E1507Q DCL1 E1696Q hyl1-2 se-2</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>&#169; The Author(s), under exclusive licence to Springer Nature Limited</p></note>
		</body>
		</text>
</TEI>
