<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>How intrinsically disordered proteins order plant gene silencing</title></titleStmt>
			<publicationStmt>
				<publisher>Elsevier Ltd.</publisher>
				<date>01/01/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10492488</idno>
					<idno type="doi">10.1016/j.tig.2023.12.009</idno>
					<title level='j'>Trends in Genetics</title>
<idno>0168-9525</idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Baoshuan Shang</author><author>Changhao Li</author><author>Xiuren Zhang</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Intrinsically disordered proteins (IDPs) and proteins with intrinsically disordered regions (IDRs) possess low sequence complexity of amino acids and display non-globular tertiary structures. They can act as scaffolds, form regulatory hubs, or trigger biomolecular condensation to control diverse aspects of biology. Emerging evidence has recently implicated critical roles of IDPs and IDR-contained proteins in nuclear transcription and cytoplasmic post-transcriptional processes, among other molecular functions. We here summarize the concepts and organizing principles of IDPs. We then illustrate recent progress in understanding the roles of key IDPs in machineries that regulate transcriptional and post-transcriptional gene silencing (PTGS) in plants, aiming at highlighting new modes of action of IDPs in controlling biological processes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IDPs are pivotal regulators of diverse biological processes</head><p>IDPs are a group of proteins that lack a fixed tertiary structure. Frequently, proteins that harbor IDRs while having globular structures in other domains are also classified as IDPs. Unlike wellfolded proteins, IDPs possess a biased and heterogeneous amino acid composition, with more hydrophilic and fewer hydrophobic residues <ref type="bibr">[1]</ref>. These residues constitute the driving force for IDP interactions via electrostatic interactions (charge-charge, charge-&#960;, cation-&#960;, &#960;-&#960;, and hydrogen bonds) and/or hydrophobic interactions of nonpolar amino acid residues with hydrophobic side chains (Figure <ref type="figure">1A</ref>) <ref type="bibr">[2]</ref>. The amino acid composition and molecular features of IDPs enable them to serve as scaffolds or hubs for multivalent and/or transient interactions with their partners for various functions <ref type="bibr">[3]</ref>. IDPs often undergo liquid-liquid phase separation (LLPS), also termed biomolecular condensation, via their IDRs and/or low-complexity sequences. LLPS is a feature of numerous membraneless organelles in eukaryotic cells and allows the compartmentalization and organization of ribonucleoprotein (RNP) macromolecules to fulfill specific cellular activities <ref type="bibr">[4]</ref>.</p><p>Although IDPs have been discovered for decades, their pivotal biological roles and regulatory mechanisms have been only appreciated until very recently. IDPs participate in cell proliferation, polarity, and metabolism in bacteria <ref type="bibr">[5]</ref>. In mammals, dysregulation of IDPs can cause brain disorders such as Parkinson's and Alzheimer's diseases, as well as cancers, making IDPs promising targets for therapy <ref type="bibr">[6,</ref><ref type="bibr">7]</ref>. In plants, IDPs are engaged in hormone signaling <ref type="bibr">[8]</ref>, seed germination <ref type="bibr">[9]</ref>, intra-chloroplast cargo sorting <ref type="bibr">[10]</ref>, flowering <ref type="bibr">[11,</ref><ref type="bibr">12]</ref>, plant immune response <ref type="bibr">[13]</ref>, and temperature and light perception <ref type="bibr">[14,</ref><ref type="bibr">15]</ref>, among others. In this review we first summarize basic biochemical features of IDPs and then specifically address their roles in gene silencing.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Biochemical characteristics of IDPs</head><p>IDPs are frequently involved in formation of biological condensates via LLPS and can also undergo phase transition to less reversible gel-like and solid aggregates <ref type="bibr">[16]</ref>. IDP-triggered condensation can enrich substrates, thus facilitating biochemical reactions. Conversely, the</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Highlights</head><p>The genome of Arabidopsis thaliana encodes thousands of intrinsically disordered proteins (IDPs). Several IDPs, exemplified by H2B.8, LHP1, ADCP1, and EMB1579, regulate transcription and epigenetic silencing in various condensation-dependent manners. Diverse IDPs such as FCA, FLL2, FRI, and/or their associated long non-coding RNAs, regulate the transcription of FLC via phase separation. SE promotes Microprocessor activity via its condensation. The protein can be hijacked by SAID1 and 2 to form nuclear condensates, where SAID1/2 sequester primary miRNA precursors (pri-miRNAs) and inhibit Microprocessor activity. SAID1/2 promote phosphorylation of SE, which triggers its destruction by the 20S proteasome and reduces its binding affinity for HYL1, while isomerization of SE promotes its condensation and miRNA production.</p><p>SGS3 constitutes a regulatory hub to control the production of 21-22 nt small interfering RNAs (siRNAs) via phase separation. SGS3 condensation is critical for the recruitment of RDR6 and RNA substrates. However, SGS3 association with its RNA substrates is enhanced by FVE, which also bridges SGS3/RDR6 and DRD4/DCL4 to facilitate siRNA production.</p><p>condensates may also sequester biomolecules from other niches to stimulate or suppress the reactions of the sequestered factors. Proteins in the condensates may either be prevented from, or be prone to, degradation depending on biophysiological conditions. Biological involved in macromolecular complex assembly or condensation via multivalent or transient interactions with their partners. IDPs recruit their partners, such as nucleic acids and folded proteins, via electrostatic or hydrophobic interactions, to trigger macromolecular complex assembly, disassembly, condensation, and/or aggregation. The interactions are mostly multivalent and transient, and are determined by thermodynamic factors of IDPs or their partners, such as protein concentration, ATP level, temperature, balance of net charge, cellular physiological conditions, and post-translational modifications (PTMs). The features of IDPs and their assembly/condensation underlie their roles in cellular compartmentation, conformational change, accessibility to PTM modifications, sequestering and scavenging, among others, to fine-tune reaction activities, storage, and/or cleavage of proteins. (B-E) Overlap of the top 30% of proteins predicted from five computational analyses to retain LLPS capacity (B and C) and their Gene Ontology (GO) enrichment (D,E). The top 30% of proteins with LLPS capacity were predicted by SaPS, PdPS, catGRANULE, PLAAC, and PScore in A. thaliana (B) and rice (C). GO enrichment for the 1036 and 252 overlapping proteins with LLPS capacity in A. thaliana (D) and rice (E). Abbreviations: BP, biological process; CC, cellular component; MF, molecular function.</p><p>condensates or aggregates can also enable cellular compartmentalization, concentration of cellular components, rapid assembly and disassembly, and rapid switchable protein-protein and protein-nucleic acid interactions, and/or facilitate the dynamic formation of local reaction centers with spatiotemporal specificity (Figure <ref type="figure">1A</ref>) <ref type="bibr">[4]</ref>. Overall, the aforementioned features of IDPs are evolutionarily adaptive strategies that enable protective and energy-saving processes for the recovery of cellular proteins without widespread misfolding or degradation.</p><p>IDPs have the potential to bind to multiple partners to attain distinct conformations that enable them to interact with different targets and display flexible conformational preferences <ref type="bibr">[17]</ref>. Generally, IDRs, short linear motifs (SLiMs), or low-complexity domains (LCDs) of IDPs mediate their multivalent interactions <ref type="bibr">[4]</ref>. One type of LCD is a prion-like domain (PrLD) that resembles the yeast prion domain in that it is enriched in polar, uncharged amino acids such as Asn, Gln, Tyr, Ser, and Gly <ref type="bibr">[18]</ref>. Upon binding to their partners, many IDPs often undergo conformational changes, depending on their foldability and microenvironment <ref type="bibr">[19,</ref><ref type="bibr">20]</ref>. This exposes variable surfaces of different recognition motifs and/or sites of post-translational modifications (PTMs), thus increasing the total repertoire of structures resulting from an IDP sequence and allowing binding promiscuity and plasticity spatial-temporally and transiently <ref type="bibr">[21]</ref>. The assembly and disassembly of IDPs depend on the local microenvironment, including ionic concentration, temperature, pH, and other parameters. These processes are also determined by the concentration of the IDPs in aqueous solution and their features of length, hydrophobicity, and charge distribution (Figure <ref type="figure">1A</ref>) <ref type="bibr">[22]</ref>.</p><p>Multiple datasets including PONDR (<ref type="url">http://www.pondr.com</ref>) and DisProt (<ref type="url">https://www.disprot.org</ref>) have been developed for the prediction of IDPs <ref type="bibr">[23]</ref>. The propensity of IDPs to undergo phase separation can also be predicted through bioinformatic tools, although the results can vary considerably <ref type="bibr">[24]</ref>. Five distinct methods (SaPS and PdPS <ref type="bibr">[25]</ref>, catGRANULE <ref type="bibr">[26]</ref>, PLAAC <ref type="bibr">[27]</ref>, and PScore <ref type="bibr">[28]</ref>) have been popular for this purpose. Comparative computational analyses reveal that ~1036 and 252 proteins from the dicotyledon and monocotyledon model organisms Arabidopsis thaliana and rice (Oryza sativa), reflecting the top 30% overlapping proteins from the five methods, tend to undergo phase-separation events (Figure <ref type="figure">1B</ref>,C and Tables <ref type="table">S1</ref> and<ref type="table">S2</ref> in the supplemental information online). Gene Ontology (GO) analysis implicates these proteins in macromolecule metabolism, mRNA metabolism, and developmental regulation in A. thaliana. The proteins are over-represented with molecular functions in DNA binding, protein dimerization, kinase activity, RNA helicase, and translation initiation (Figure <ref type="figure">1D</ref>). Similarly, in rice, the proteins are enriched in RNA metabolic processes, transcription, and developmental regulation, and have molecular functions in nucleic acid binding as well as pyrophosphatase, hydrolase, and purine NTP-dependent helicase activities (Figure <ref type="figure">1E</ref>). Notably, many of the computationally predicted phase separation-prone proteins in A. thaliana could also be recovered experimentally via biotinylated isoxazole microcrystals that tend to precipitate proteins with random coil polypeptides <ref type="bibr">[29]</ref>, highlighting the power and complementarity of the two strategies to predict plant phase-separating proteins at a proteome level.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IDPs participate in transcriptional regulation</head><p>The transcription apparatus in eukaryotes is dynamically constituted of RNA polymerases, transcription factors (TFs), mediators, chromatin-remodeling complexes, and numerous epigenetic regulators. Recent studies have shown that a large fraction of the transcription apparatus comprises IDPs, and the resultant condensates of distinct compositions may drive consecutive transcription cycles. The Cterminal domain (CTD) of RNA polymerase II (Pol II) has an IDR that retains multiple heptapeptide YSPTSPS repeats ranging from 26 in yeast to 52 repeats in human <ref type="bibr">[30]</ref>. <ref type="bibr">Boehning et al.</ref> showed that the human and yeast RNA Pol II CTDs possess the capacity to form condensates. Moreover, the length of the CTD influences the stability and dynamics of droplets, as an extended CTD can lead to stronger CTD-CTD interactions and more stable droplets in vitro <ref type="bibr">[31]</ref>. While the RNA Pol II CTD triggers the formation of various transcription-dependent condensates with distinct components, phosphorylation of the protein alters its conformation in subsequent transcription apparatus assemblies <ref type="bibr">[32]</ref>. Unphosphorylated RNA Pol II is recruited to promoters via LLPS of the CTD and the transcription initiation machinery <ref type="bibr">[31]</ref>. Subsequently, RNA Pol II is released from transcription initiation condensates with RNA accumulation and CTD phosphorylation, and is relayed to elongation or splicing condensates <ref type="bibr">[32]</ref>. In addition, IDRs of TFs can determine their binding specificity to target DNA regions through multivalent determinants distributed within the IDR sequences <ref type="bibr">[33]</ref>.</p><p>In A. thaliana, the FLOWERING LOCUS C (FLC) locus encodes a master transcriptional regulator that controls flowering time <ref type="bibr">[34]</ref>. Recent progress has shed light on the importance and mechanism of IDPs in regulating FLC transcription <ref type="bibr">[11,</ref><ref type="bibr">12]</ref>. Briefly, FLC protein is a MADS box TF that represses the expression of genes in the downstream flowering pathway, leading to delayed flowering <ref type="bibr">[34]</ref>. The expression of FLC itself is enhanced by the transcriptional factor, FRIGIDA (FRI), an IDP with two coiled-coil motifs <ref type="bibr">[12]</ref>. FRI can recruit transcriptional coactivators and histone modifiers such as histone H3 lysine 36 trimethylation (H3K36me3) and H3K4me3 methyltransferases to the FLC promoter under warm conditions, leading to activation of FLC transcription. However, this process can be reverted by vernalization <ref type="bibr">[35]</ref>. While FRI is unstable in a warm environment, prolonged cold exposure results in accumulation of the protein and promotes FRI to form nuclear condensates that depend on its C-terminal IDR and two coiled-coil domains. Importantly, the condensation can stabilize FRI but sequester the protein away from FLC promoter, leading to repression of FLC expression (Figure <ref type="figure">2</ref>), a prerequisite for flowering transition after vernalization <ref type="bibr">[12]</ref>.</p><p>Transcription of FLC is also regulated by the antisense long non-coding RNA COOLAIR from the FLC locus <ref type="bibr">[34]</ref>. Notably, the RNA-binding protein FCA is an IDP with a prion-like domain and can form condensates. The size and number of the FCA condensates can be further stimulated by another coiled-coil IDP protein, FLL2. This can promote 3&#8242; processing of COOLAIR RNA and cause chromatin silencing of FLC (Figure <ref type="figure">2</ref>) <ref type="bibr">[11]</ref>. In addition, Zhu et al. reported that cold exposure can alter FRI interaction with COOLAIR and other partners, and can promote FRI condensation, as shown in root cells and tobacco leaf nuclei. Consequently, the condensate sequesters FRI away from FLC promoter and triggers the repression of FLC transcription. Deletion of the COOLAIR promoter significantly alters the overall size distribution and reduces the number of cold-induced FRI condensates, leading to elevated FRI occupancy at the FLC promoter and inefficient repression of FLC transcription under cold exposure <ref type="bibr">[12,</ref><ref type="bibr">36]</ref>.</p><p>The abovementioned model has recently been challenged because Zhang et al. argue that COOLAIR is neither required for FRI condensation nor for FLC repression under constant cold exposure <ref type="bibr">[37]</ref>. They reported that, following deletion of the promoter cold-responsive cis elements, there was no significant change in the size or number of FRI condensates, either before or after cold exposure, between functional pFRI:FRI-GFP and the cold-insensitive pFRI:FRI-GFP; &#916;COOLAIR lines in which the COOLAIR promoter was deleted. Furthermore, transcriptional repression of FLC following cold exposure was not affected in pFRI:FRI-GFP; &#916;COOLAIR lines which lack COOLAIR expression <ref type="bibr">[37]</ref>. Interestingly, Zhu et al. revisited the experiments later and insisted on the correctness of their model <ref type="bibr">[36]</ref>. The conflicting results might be due to differences in growth conditions and seedling densities in the two laboratories. Clearly, more experiments will be necessary to clarify the regulatory mechanism of the FRI transcriptional condensates in controlling FLC locus expression. Given that IDP-triggered condensates are context-specific and condition-sensitive, it will be crucial to directly quantitate how cold exposure might alter the</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Trends in Genetics</head><p>interaction of FRI with COOLAIR within FRI condensates and the consequences thereof. In addition, it will be crucial to analyze the underlying mechanism of FRI condensation in aerial parts under physiological conditions, even though this is technically more challenging than in root tip cells.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IDPs regulate transcriptional silencing</head><p>Heterochromatin compartmentalization acts to repress transcription, and IDPs are involved in chromatin condensation in both animals and plants. Chromatin compaction and accessibility </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Trends Trends in in Genetics Genetics</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Trends in Genetics</head><p>are regulated by histone variants and modifications <ref type="bibr">[38]</ref>, DNA methylation <ref type="bibr">[39]</ref>, and chromatin remodeling (Figure <ref type="figure">2</ref>) <ref type="bibr">[40]</ref>. In a study to explore the mechanism of sperm chromatin condensation in flowering plants, <ref type="bibr">Buttress et al.</ref> reported that the N-terminal IDR of the A. thaliana histone variant H2B.8 mediates its phase separation which drives chromatin compaction of sperm nuclei. Whereas dysfunction of H2B.8 results in enlarged sperm nuclei with decondensed chromatin, ectopic expression of H2B.8 in somatic cells triggers smaller nuclei with compacted chromatin. Intriguingly, the authors found that H2B.8 condensates mainly associate with and compact transcriptionally inactive AT-rich euchromatin, thus facilitating nuclear compaction while retaining active gene expression <ref type="bibr">[41]</ref>.</p><p>Chromatin compaction is also regulated by histone methylation which can be mediated through the activities of the evolutionally conserved Polycomb group (PcG) complexes, including Polycomb repressive complex 1 (PRC1) and PRC2, in both animals and plants (Figure <ref type="figure">2</ref>). PRC2 catalyzes mono-, di-, and tri-methylation of histone lysine 27 (H3K27me1-3) that are hallmarks of silent chromatin. Canonical PRC1 facilitates histone H2A ubiquitination (H2AK119/ 121ub) and chromatin compaction, and this process relies on PRC2-deposited H3K27me3 marks. By contrast, the noncanonical PRC1 promotes these processes independently of PRC2 activity <ref type="bibr">[42,</ref><ref type="bibr">43]</ref>. New insights are gained on the activity and recruitment of Polycomb complex, with the identification of IDPs involved in chromatin condensation <ref type="bibr">[44]</ref>. Mammalian CHROMOBOX 2 (CBX2) and its homologs may act as IDPs that interact and condense with PRC1, thus facilitating chromatin compaction <ref type="bibr">[45]</ref>. LIKE HETEROCHROMATIN PROTEIN 1 (LHP1) also contains an intrinsically disordered hinge region. This IDR can bind to RNA in vitro and is required for the formation of subnuclear foci in vivo that are reminiscent of Polycomb bodies. Notably, disruption of the IDR abolishes the function of LHP1 in H3K27me3 maintenance <ref type="bibr">[46]</ref>. The IDP EMBRYO DEFECTIVE 1579 (EMB1579) undergoes LLPS to recruit DNA DAMAGE-BINDING PROTEIN 1 (DDB1), CULLIN 4 (CUL4), and MULTIPLE SUPPRESSOR OF IRA 4 (MSI4) to promote PRC2-mediated H3K27me3 modification at the FLC locus <ref type="bibr">[47]</ref>. HETEROCHROMATIN PROTEIN 1 (HP1) recognizes H3K9me3, another hallmark of heterochromatin, and selfoligomerizes to drive chromatin condensation and compaction <ref type="bibr">[48]</ref>. In plants, the HP1 functional analog and H3K9me2 reader protein, AGENET DOMAIN-CONTAINING PROTEIN 1 (ADCP1), fulfils a similar function to drive heterochromatin formation <ref type="bibr">[49]</ref>. Chromatin remodelers utilize ATP hydrolysis to disrupt electrostatic interactions between the positively charged histones and negatively charged DNA <ref type="bibr">[50]</ref>. Several members of the SWI/SNF remolding components, namely BRM (BRAHMA), SWI3C (SWITCH/SUCROSE NONFERMENTING 3C), AN3 (ARATHNICTABA 3), and SWP73B (SWI/SNF-ASSOCIATED PROTEIN 73B), are predicted to contain PrLDs <ref type="bibr">[40]</ref>, and might assume their roles as IDPs in chromatin remodelling.</p><p>Like histone methylation, DNA methylation is also strongly associated with heterochromatin compaction and transcriptional silencing <ref type="bibr">[51]</ref>. In A. thaliana, whereas heterochromatic DNA methylation is facilitated by the chromatin remodelers DDM1 (DECREASED DNA METHYLATION 1) and CMT2 (CHROMOMETHYLASE 2) <ref type="bibr">[52]</ref>, interspersed DNA methylation within transposon-or repeatcontaining euchromatic regions is mediated by an RNA-directed DNA methylation (RdDM) pathway <ref type="bibr">[53]</ref>. The RdDM pathway is initiated by the production of short transcripts by DNAdependent RNA polymerase IV from transposable elements (TEs) and repetitive elements. The nascent RNAs are converted into double-stranded (ds)RNAs by RNA-DEPENDENT RNA PO-LYMERASE 2 (RDR2) in A. thaliana <ref type="bibr">[54,</ref><ref type="bibr">55]</ref>, and these are further processed by DICER-LIKE 3 (DCL3) to produce 24 nt siRNAs <ref type="bibr">[56,</ref><ref type="bibr">57]</ref>. The 24 nt siRNA-loaded ARGONAUTE 4 (AGO4) then directs RNA polymerase V to the correct loci and recruits DNA methyltransferases and additional epigenetic components to initiate DNA methylation <ref type="bibr">[58,</ref><ref type="bibr">59]</ref>. In addition, MICRORCHIDIA (MORC) family proteins are further recruited and interact with the DNA methylation-binding protein SUVH2/9, as well as with chromatin remodelers DRD1, DMS3, and RDM1, at RdDM loci to facilitate efficient establishment of RdDM and transcriptional silencing <ref type="bibr">[60]</ref>.</p><p>The biogenesis and action of the 24 nt siRNAs are linked to discrete nuclear speckles, called Cajal bodies, whose condensation relies on COILIN protein <ref type="bibr">[61]</ref>. COILIN comprises a N-terminal globular domain, a CTD with a Tudor-like structure, and a highly disordered central region that contains a nuclear localization signal (NLS) and an RG (Arg/Gly) box that is subject to Arg dimethylation. Unexpectedly, the N-terminal domain of COILIN mediates its multivalent interaction with partners and condensation of Cajal bodies <ref type="bibr">[62]</ref>. The assembly and activity of Cajal bodies are modulated by phosphorylation <ref type="bibr">[63]</ref> and symmetrical dimethylation of Arg residues <ref type="bibr">[64]</ref> in COILIN protein. Several other proteins, for instance AGO4, in the 24 nt siRNA pathway are predicted to be IDPs and have been reported to colocalize with Cajal bodies <ref type="bibr">[65,</ref><ref type="bibr">66]</ref>. In fact, a proportion of AGO4 localizes to AGO4/NRPD1b bodies, although the majority of AGO4 localizes in Cajal bodies (Figure <ref type="figure">2</ref>). Notably, disruption of COILIN affects the integrity of Cajal bodies and AGO4 colocalization <ref type="bibr">[67]</ref>. Both NRPD1b and AGO4 contain PrLDs <ref type="bibr">[68]</ref>, but how they condense and the role of condensation in RdDM remain to be studied.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IDPs regulate PTGS</head><p>PTGS is primarily guided by small RNAs (sRNAs) exemplified by miRNAs and small interfering RNAs (siRNAs). miRNAs and siRNAs are loaded into AGO proteins to form the RNA-induced silencing complex (RISC) that either cleaves target mRNAs or triggers translational inhibition followed by RNA decay <ref type="bibr">[69]</ref>. These small noncoding RNAs are universally present in eukaryotic organisms, and incongruity of sRNAs and dysregulation of the sRNA machinery can perturb normal development and fitness in both plants and animals. Hence, sRNA pathways afford promising targets for the development of therapeutics and for agronomic improvement <ref type="bibr">[70]</ref><ref type="bibr">[71]</ref><ref type="bibr">[72]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IDPs participate in miRNA biogenesis</head><p>The biogenesis of miRNAs is initiated by the transcription of the primary substrates of miRNAs (pri-miRNAs) by RNA Pol II. In metazoans, pri-miRNAs are processed by the nuclear Microprocessor complex that primarily consists of DROSHA and two DGCR8 proteins that generate the 60-75 nt miRNA precursors (pre-miRNAs) <ref type="bibr">[73,</ref><ref type="bibr">74]</ref>. Pre-miRNAs are further cleaved by the RNase III enzyme DICER1 to yield miRNA/* duplexes <ref type="bibr">[75]</ref>. Interestingly, Cui et al. recently reported that the animal DGCR8 is an IDP and its elevated accumulation can cause imbalanced DGCR8/ DROSHA protein stoichiometry, which in turn can drive irreversible aggregation of Microprocessor and impede pri-miRNA processing <ref type="bibr">[76]</ref>.</p><p>In plants, pri-miRNAs are sequentially processed into miRNA/* by the core Microprocessor complex composed of the RNase III enzyme DCL1, the multifunctional zinc-finger (ZF) protein SERRATE (SE), and the dsRNA-binding protein HYL1 <ref type="bibr">[77]</ref>. Once ectopically expressed, these components tend to be enriched in nuclear speckles, often called dicing bodies (D-bodies) (Figure <ref type="figure">3A</ref>) <ref type="bibr">[78]</ref>. Notably, some auxiliary factors that affect miRNA production often colocalize with the core Microprocessor complex when transiently overexpressed, whereas their mutants display compromised miRNA production accompanied by alteration of D-body number or size <ref type="bibr">[79,</ref><ref type="bibr">80]</ref>. Thus, one prevailing view is that D-body is the main factory site for miRNA production. However, increasing evidence has documented that initial processing of pri-miRNA occurs cotranscriptionally, and subsequent processing may take place in D-bodies or even in structures in the nucleoplasm that are not detectable at the resolution of classic microscopy <ref type="bibr">[81,</ref><ref type="bibr">82]</ref>.</p><p>The core Microprocessor component SE is an IDP that interacts with multiple partners, and has multiple functions in RNA metabolism including miRNA production <ref type="bibr">[83]</ref> and pri-miRNA decay <ref type="bibr">[84]</ref>. Excessive accumulation of SE could lead to the formation of intermediate Microprocessor complexes that are unable to efficiently produce miRNAs <ref type="bibr">[83]</ref>. However, these intermediate complexes appear to remain soluble, and their mode of action may differ from that of animal DGCR8/ Drosha aggregates <ref type="bibr">[76]</ref>. SE has generally been believed to act as a scaffold to recruit the core processing machinery including DCL1/HYL1 to the proper RNA substrates, or vice versa, to generate miRNAs <ref type="bibr">[77,</ref><ref type="bibr">85]</ref>. SE has also been recently reported to mediate LLPS to enhance the processing activity of Microprocessor (Figure <ref type="figure">3A</ref>) <ref type="bibr">[86]</ref>. Whereas SE harbors two major IDRs located in its N-terminal and C-terminal tails, the N-terminal IDR confers its propensity to form condensates <ref type="bibr">[86]</ref>.</p><p>PTMs of IDPs control the homeostasis and activity of the Microprocessor complex Eukaryotic proteins are mostly degraded through a ubiquitin-dependent 26S proteasome pathway. However, IDPs are prone to degradation independently of the process. Indeed, SE protein can undergo ATP-and ubiquitin-independent degradation, clearly reflecting to its inherent feature as an IDP (Figure <ref type="figure">3A</ref>). In vivo, excess SE impedes the assembly of macromolecular complexes and/or is not shielded from the complexes. Consequently, the protein is readily scavenged by PAG1 (PROTEASOME &#913; SUBUNIT 1) and channeled to the 20S proteasome for destruction <ref type="bibr">[83]</ref>. Subsequent work by Wang et al. revealed that the degradome signal for triggering SE degradation is protein phosphorylation via PRP4KA (PRE-MRNA PROCESSING 4 KINASE A) and its homologs. Hypophosphorylated SE variants display elevated binding affinity for HYL1 and are relatively less vulnerable to the activity of 20S proteasome than hyperphosphorylated SE counterparts <ref type="bibr">[87]</ref>. Phosphorylation of SE represents a regulatory mechanism at the post-translational level to rapidly clear excess SE and to maintain homeostasis of SE in vivo to secure its proper functions (Figure <ref type="figure">3A</ref>) <ref type="bibr">[87]</ref>. Because PTMs of IDPs can alter protein conformations, and phase behaviors <ref type="bibr">[88]</ref>, it could be hypothesized that phosphorylation might impact on the LLPS behavior of SE and thus compromise the assembly of Microprocessor complexes.</p><p>Prolyl isomerization can contribute to the regulation of phase separation. A recent report showed that the recruitment of peptidyl prolyl isomerase (PPIA) into IDP droplets triggers condensate dissolution via cis/trans interconversion of prolines in TAU and/or the PRO-ARG DIPEPTIDE REPEAT PROTEIN PR20 <ref type="bibr">[89]</ref>. CYP71 is a PPIA that could interact with Microprocessor components and promote droplet formation (Figure <ref type="figure">3A</ref>). Furthermore, this relies on CYP71 PPIase activity in vitro. Consistently, HYL1-YFP foci that serve as a proxy for D-bodies disappear in cyp71 mutants <ref type="bibr">[90]</ref>. Notwithstanding, the precise proline residue(s) of SE that are targeted by CYP71 remain to be pinpointed, and how proline isomerization failure might impact on SE function in vivo awaits future clarification.  <ref type="bibr">[91,</ref><ref type="bibr">92]</ref>. SAID1 and 2 act as negative regulators of SE stability and miRNA biogenesis, as said1; said2 loss-of-function mutants display increased assembly of Microprocessor and elevated accumulation of miRNAs. Mechanistically, SAID1 and 2 promote PRP4KA-mediated phosphorylation of SE, causing its degradation in vivo. In addition, SAID1/2 display strong binding affinity for pri-miRNAs and can sequester them from SE, leading to the direct inhibition of pri-miRNA processing by Microprocessor in vitro. Importantly, SAID1/2-mediated dual inhibition of miRNA production appears to take place in SE-centered condensates in vivo (Figure <ref type="figure">3A</ref>) <ref type="bibr">[91]</ref>. This notion is highlighted by the observation that SAID1/2 display clear nuclear puncta, suggestive of condensation in wild-type plants, whereas the condensates disappear in se mutants. Because SAID1/2 and SE are all IDPs, it is possible that they might mutually affect biochemical features and functions for each other through co-condensation <ref type="bibr">[91]</ref>. It is also possible that the IDPs might undergo conformational switches into structured proteins upon their association</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Trends in Genetics</head><p>or binding to pri-miRNAs. The latter hypothesis appears to be supported by artificial intelligence (AI) modeling (Figure <ref type="figure">3B</ref>) <ref type="bibr">[93]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RNA helicases regulate condensation of D-bodies</head><p>Recently, the DEAD-box RNA helicases RH6, RH8, and RH12 IDPs have been identified as HYL partners by immunoprecipitation and mass spectrometry (IP-MS) of the HYL1 interactome <ref type="bibr">[94]</ref>.</p><p>The purified RH6, RH8, and RH12 proteins can undergo phase separation in an RNA-and ATP-dependent manner in vitro. The proteins can also physically interact and colocalize with Microprocessor components in nuclear speckles resembling D-bodies in planta. Whereas HYL-YFP foci are eliminated in the knockdown lines of three helicases, the addition of RH12 elevates the size and number of SE droplets in vitro, indicative of the critical roles of these helicases in promoting SE phase separation and D-body assembly (Figure <ref type="figure">3A</ref>) <ref type="bibr">[94]</ref>.</p><p>RH6, RH8, and RH12 also contribute to the assembly and subcellular dynamics of processing bodies (P-bodies) and stress granules (SGs) in vivo <ref type="bibr">[95]</ref>. Because the homologs of these helicases in yeast have long been known to drive the formation and turnover of P-bodies and SGs through LLPS <ref type="bibr">[96]</ref>, RH6, RH8, and RH12 might mechanistically behave similarly to regulate the two RNP complexes in A. thaliana. In addition, RH6, RH8 and RH12 can be also translocated into another cellular compartmentation niche on the chloroplast periphery, called virus bodies because these are where virus replication actively takes place <ref type="bibr">[94]</ref>. Thus, these helicases seem to concurrently regulate the assembly and disassembly of multiple macromolecular complexes in both nuclei and cytoplasm to adapt to physiological changes. Whether all these functions are fulfilled via modulation of the condensation of the complexes remains unclear; however, the regulation is likely to involve RNA substrates given that the RH proteins are RNA helicases.</p><p>The SWI2/SNF2 ATPase CHR2/BRM canonically acts to remodel chromatin to regulate gene transcription. The protein can also act as an RNA helicase that interacts with SE to remodel the secondary structures of pri-miRNAs, thus impeding Microprocessor processing efficiency <ref type="bibr">[97]</ref>. Notably, CHR2 protein itself contains long stretch of low-complexity regions. It is possible that CHR2 and SE might form cocondensates to orchestrate transcription and cotranscriptional processing of transcripts including pri-miRNAs. In addition, SE has been reported to enhance the enzymatic activities of histone methyltransferases ATXR5 (ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5) and ATXR6 in vitro and H3K27me1 accumulation in vivo <ref type="bibr">[98]</ref>. A retrospective thought of this process is that the promotive function of SE might also be granted through its tendency to form condensates.</p><p>IDPs in the regulation of miRNA loading miRNAs are loaded onto AGO proteins to form RISCs. This also involves IDPs and phase separation. In metazoans, the scaffold protein TNRC6 (TRINUCLEOTIDE REPEAT-CONTAINING 6) sets up a bridge between AGO2 and the downstream deadenylase complex through multivalent interactions between the Gly/Trp (GW)-rich domain of TNRC6B, which is located in the IDR, and three evenly spaced Trp-binding pockets in the AGO2 PIWI domain, facilitating subsequent translational repression and decapping <ref type="bibr">[99]</ref>. Another IDP, YBX1, can form condensates to selectively recruit and sort miR-223 into exosomes to mediate intercellular communication <ref type="bibr">[100]</ref>.</p><p>Tomassi et al. identified CARP9, a large IDP, as a new partner of the miRNA pathway in A. thaliana. Mutations in CARP9 produce morphological alterations, a mild reduction in miRNA accumulation, and impaired RNA silencing <ref type="bibr">[92]</ref>. CARP9 mutants also present low accumulation of AGO1 protein and apparent depletion of miRNAs in the cytoplasm. Notably, CARP9 interacts with HYL1, but not with other Microprocessor components. CARP9 is colocalized in SE/HYL1-containing nuclear speckles, but not with DCL1 protein. Intriguingly, CARP9 also associates with AGO1 and HEAT SHOCK PROTEIN 90 (HSP90), as well as with mature miRNAs. This prompted the authors to propose that CARP9 might utilize its IDP feature to act as a scaffold protein to connect HYL1 to AGO1, thus promoting miRNA loading into the RISC before its export to the cytosol (Figure <ref type="figure">3A</ref>) <ref type="bibr">[92]</ref>.</p><p>IDPs in the regulation of 21-22 nt siRNA biogenesis in PTGS Different from miRNAs, siRNA production in plants starts with the conversion of single-stranded (ss)RNA substrate to dsRNA via a complex composed of RNA-DEPENDENT RNA POLYMER-ASE 6 (RDR6) and SUPPRESSOR OF GENE SILENCING 3 (SGS3) <ref type="bibr">[101,</ref><ref type="bibr">102]</ref>. In turn, dsRNAs are processed by the DCL2/4-DRB4 (DOUBLE-STRANDED RNA-BINDING PROTEIN 4) complex to generate 21-22 nt siRNAs <ref type="bibr">[103,</ref><ref type="bibr">104]</ref>. For the production of endogenous trans-acting sRNAs or phased siRNAs, SGS3 interacts directly with the 3&#8242; end of miRNAs in AGO1/AGO7 RISCs that also bind to the processed target transcripts, thus enhancing miRNA-mediated ribosome stalling and the amplification of secondary siRNAs <ref type="bibr">[105]</ref>.</p><p>SGS3 is an IDP and has been speculated to undergo LLPS to form siRNA bodies where PTGSdestined siRNAs are produced (Figure <ref type="figure">4</ref>). SGS3 contains two putative PrLDs located in its N and C termini. The protein also contains ZF, rice gene X and SGS3 (XS), and coiled-coil (CC) domains.</p><p>The XS domain is involved in RNA binding, whereas the CC domain is involved in protein-protein interactions. Indeed, Kim et al. observed that full-length SGS3 protein is distributed into cytoplasmic foci, whereas PrLD-deleted SGS3 protein is in the nucleus. Consistently, recombinant GFP-SGS3 protein forms liquid droplets in vitro, but the PrLD-deleted SGS3 does not exhibit such phase-separating behavior <ref type="bibr">[106]</ref>. Tan et al. carefully assessed the contributions of individual domains to SGS3 condensation. It turns out that the N-terminal PrLD domain is indispensable for SGS3 phase separation, whereas the XS domain is required for the fluid properties of SGS condensates. Notably, the CC domain of SGS3 is responsible for the recruitment of RDR6 into SGS3 condensates <ref type="bibr">[107]</ref>. Interestingly, Tan et al. also reported that both ssRNA and dsRNA can be incorporated into SGS3 condensates. They showed that SGS3 has a higher affinity for ssRNA than for dsRNA <ref type="bibr">[107]</ref>. However, this finding differs from the observations of two other groups <ref type="bibr">[108,</ref><ref type="bibr">109]</ref>. The contradiction might be explained by the fact that Tan et al. used a dual tagged recombinant SGS3 protein (GST-SGS3-GFP) that might interfere with the conformation of SGS3 in the assay, whereas the other teams used tag-free SGS3 in their electrophoretic mobility shift assays. Under physiological conditions, SGS3 protein might bind to ssRNA, but this binding likely entails additional RNA-binding protein(s) exemplified by FVE (FLOWERING LOCUS VE, discussed in the following section).</p><p>SGS3 is predicted to contain multiple phosphorylation sites in its N-terminal LCD <ref type="bibr">[110]</ref>. Han et al. selected two such sites (Ser36 and Ser37) for mutagenesis assays. They reported that the hypophophorylated SGS3 variant (Ser to Ala) undergoes droplet formation whereas the phosphomimetic (Ser to Asp) variant compromises phase separation in a fluorescence recovery assay (Figure <ref type="figure">4</ref>). Importantly, SGS3 phase separation is critical for plant antiviral immunity and fertility. For instance, the phosphomimetic (Ser to Asp) variant lacks antiviral defense capability, reminiscent of the sgs3 mutant, whereas the hypophosphorylated variant can restore the antiviral ability in the sgs3 mutant to the wild-type level <ref type="bibr">[111]</ref>. However, how the phase separation of SGS3 regulates antiviral immunity and fertility in plants awaits future clarification. Furthermore, whether SGS3 is phosphorylated at the two sites in planta and the identity of the kinase(s) that catalyze the phosphorylation of SGS3 remain unknown.</p><p>The behavior and function of SGS3 can be also modulated by cellular factors. Sun et al. recently performed a forward genetic screen aiming to recover mutants compromised in PTGS of transgenes. Surprisingly, a new allele of FVE was identified. FVE is a well-known component that regulates flowering time and cold response at an epigenetic level in A. thaliana <ref type="bibr">[112,</ref><ref type="bibr">113]</ref>. However, Sun observed that FVE could be localized in the nucleus and cytoplasm, while the cytoplasmic portion of FVE could fully rescue the defect of the fve mutant in PTGS, but not the defect in the flowering time. These results indicate that FVE might have a noncanonical role in cytoplasmic RNA silencing. Further mechanistic studies show that FVE targets SGS3 and promotes its homodimerization, which is prerequisite for its function. FVE can also bind to ssRNAs and dsRNAs with moderate affinities. These features might promote the launching of SGS3, which itself has a very low affinity for ssRNAs, onto the ssRNA substrates. FVE binding to RNA substrates can also promote the routing of SGS3/ RNA to DRB4/DCL2/4 complexes for further processing. Indeed, FVE can also interact with DRB4 and promote DRB4/DCL2/4 activity in generating siRNAs from dsRNA substrates. Thus, the very epigenetic factor FVE can synchronize the RDR6/SGS3-DRB4/DCL2/4 channel to promote siRNA production and transgene silencing (Figure <ref type="figure">4</ref>). This notwithstanding, as FVE harbors six tandemly repeated WD40 domains and a small portion of low-complexity regions. Whether FVE contributes to SGS3 dimerization and association with RNA substrates through altering its LLPS, or vice versa, would be an interesting topic for future studies <ref type="bibr">[109]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Concluding remarks and future perspectives</head><p>Recent studies have witnessed the increasingly important roles of IDPs in diverse biological processes in plants and animals. Many IDPs can serve as scaffolds for various RNP complexes such Trends in Genetics Studies on how PTMs alter IDP behaviors, and possibly their condensation, are likely to uncover new regulatory layers that control biological processes. Many IDPs may not harbor specific motifs or domains in their primary sequences, and might evolve neofunctions when bound to their substrates or partners. Furthermore, IDPs might act in multiple niches of condensates or complexes, and precise characterization of their specific functions in these niches will entail new ideas and approaches. as Cajal bodies, D-bodies, siRNA bodies, P-bodies, and SGs, among others, whereas other IDPs can act as regulatory hubs for molecular functions. Numerous IDPs tend to form condensates via phase separation to orchestrate transcription regulation and gene silencing, and among the proteins are FRI, SE, and SGS3 that are the landmarks of epigenetic regulation, miRNA, and siRNA pathways, respectively (Table <ref type="table">S3</ref> in the supplemental information online). These proteins can form condensates to recruit partners and substrates, thus impacting on enzymatic or processing events, or changing downstream regulatory cascades. The condensation behaviors of these proteins themselves can also be modulated by additional IDPs, substrates, or even PTMs. These might be the tip of the iceberg for the IDP field, and new regulatory modes of action are being revealed (see Outstanding questions). This notwithstanding, hundreds or even thousands of proteins have predicted to be IDPs according to different computational analyses, one outstanding issue would be how we can separate the wheat from the chaff and pinpoint the bona fide IDPs among the hypothetic candidates. Alphafold2 might be a powerful tool to complement current computational predictions to further screen potential IDPs in silico <ref type="bibr">[93]</ref>. A related question would be to precisely decode the compositions and interactomes of condensates in physiological contexts. Proximity labeling-based MS would greatly advance the field in this objective <ref type="bibr">[114]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Trends Trends in in Genetics Genetics</head><p>Second, although many of the identified IDPs appear to undergo LLPS or have been interpreted to participate in condensation, the inference that all IDPs have the potential to form condensates is probably an oversimplification. Behavioral characterization of IDPs entails comprehensive studies in vitro and in vivo and needs to be validated by multiple independent lines of evidence. A combination of in vitro quantitative biochemistry and biophysical studies with in vivo investigation via cell biology and genetics could provide a more complete picture of the candidate proteins. These include in vitro thermodynamic parameters of enthalpy and entropy, surface tension parameters of condensate viscosity, inverse capillary velocity, molecular diffusion, and binding affinity, among others. In parallel, integration of fluorescence recovery after photobleaching (FRAP) <ref type="bibr">[115]</ref>, single-photon microscopy <ref type="bibr">[116]</ref>, single-molecule F&#246;rster resonance energy transfer (FRET) <ref type="bibr">[1]</ref>, and proximity labeling <ref type="bibr">[117,</ref><ref type="bibr">118]</ref> would be powerful to decipher the propensities and organizing principles of condensates in vivo. Multiple stringent criteria and integration of advanced methodologies to dissect the composition and properties of biomolecular condensates quantitatively are greatly needed for our understanding of the biological functions of protein condensation and the underlying mechanisms. However, results from in vitro reconstitution systems frequently do not perfectly reiterate in vivo observations under different experimental or biological conditions. In these scenarios, physiological relevance is a predominant factor to reconcile inconsistencies. For instance, whether mutagenesis to disrupt condensation causes functional defects of the proteins in vivo, and whether the process can be rescued by exchange with known LCDs, will be crucial to determine whether condensation is a central feature of a particular protein.</p><p>Third, IDPs, like other folded proteins, are subject to various types of PTMs in response to physiological needs. Studies on how the PTMs alter the behaviors and fates of IDPs, including protein condensation, would be likely to uncover new regulatory layers in the control of biological processes. Several proteomic datasets including phosphorylation are available for the model organisms, and these will be invaluable resources for studying the PTMs of IDPs.</p><p>Fourth, numerous IDPs are crucial components of RNPs. Many IDPs may not be RNA-binding proteins or lack canonical RNA recognition motifs in their primary amino acid sequences. However, some IDPs exemplified by FVE and SAID1/2 appear to have strong affinity for RNA substrates <ref type="bibr">[91,</ref><ref type="bibr">109]</ref>. Whether and how these IDPs have evolved neofunctions in RNA processing or metabolism awaits future in-depth investigation.</p><p>Finally, given that several IDPs are known to be master players in epigenetic and PTGS pathways, manipulation of IDP-regulated processes to control gene-silencing pathways offers new strategies to manipulate biological processes for biotechnological applications and crop improvement. Future efforts in this direction are certain to be rewarding.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx 3</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_3"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_4"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx 7</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_5"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_6"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx 11</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_7"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx 13</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="16" xml:id="foot_8"><p>Trends in Genetics, Month 2024, Vol. xx, No. xx</p></note>
		</body>
		</text>
</TEI>
