<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Transcriptome and epigenome analyses of vernalization in &lt;i&gt;Arabidopsis thaliana&lt;/i&gt;</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>08/01/2020</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10216075</idno>
					<idno type="doi">10.1111/tpj.14817</idno>
					<title level='j'>The Plant Journal</title>
<idno>0960-7412</idno>
<biblScope unit="volume">103</biblScope>
<biblScope unit="issue">4</biblScope>					

					<author>Yanpeng Xi</author><author>Sung‐Rye Park</author><author>Dong‐Hwan Kim</author><author>Eun‐Deok Kim</author><author>Sibum Sung</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Background: Vernalization accelerates flowering after prolonged winter cold. Transcriptional and epigenetic changes are known to be involved in the regulation of the vernalization response.Despite intensive applications of next-generation sequencing in diverse aspects of plant research, genome-wide transcriptome and epigenome profiling during vernalization response has not been conducted.
Results:In this work, we present the first comprehensive analyses of transcriptomic and epigenomic dynamics during the vernalization process in Arabidopsis thaliana. Six major clusters of genes exhibiting distinctive features were identified. Temporary changes in histone H3K4me3 levels were observed that likely coordinate photosynthesis and prevent oxidative damage during cold. In addition, vernalization induced a stable accumulation of H3K27me3 over genes encoding many development-related transcription factors, resulting in either inhibition of transcription or a bivalent status of the genes. Lastly, FLC-like and VIN3-like genes were identified that appear to be novel components of the vernalization pathway.
Conclusions:Our work provides the first comprehensive assessment of transcriptome and epigenome dynamics during the vernalization process and indicates that multiple regulatory pathways are involved in promoting differentiation and phase transitions during vernalization in Arabidopsis.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Background</head><p>Temperature is an important environmental cue that, coupled with day length, cues plants to initiate flowering. For most winter-annual and biennial plants, prevention of flowering before winter and induction of flowering after winter is required for successful reproduction. Cold itself is not sufficient since temperature fluctuations in fall might be falsely taken as the passing of winter.</p><p>A timing mechanism is needed to distinguish long-term winter cold from short-term chilling stress.</p><p>Therefore, the vernalization process evolved which accelerates flowering only after prolonged cold exposure. In winter-annual Arabidopsis thaliana, vernalization is regulated by two major loci: FLOWERING LOCUS C (FLC) and FRIGIDA (FRI) <ref type="bibr">[1]</ref><ref type="bibr">[2]</ref><ref type="bibr">[3]</ref>. FLC encodes a MADS-box transcription factor that represses the expression of downstream targets <ref type="bibr">[4]</ref><ref type="bibr">[5]</ref><ref type="bibr">[6]</ref>. FRI acts with other proteins in a complex to upregulate FLC expression <ref type="bibr">[7]</ref><ref type="bibr">[8]</ref><ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref><ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref>. High level of FLC and its clade members prevent flowering by repressing floral integrator genes such as FLOWERING LOCUS T (FT) and SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 (SOC1) <ref type="bibr">[6,</ref><ref type="bibr">[15]</ref><ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref> and also feedback regulations operate between FLC and floral integrators <ref type="bibr">[21,</ref><ref type="bibr">22]</ref>, forming intricate regulatory networks that control flowering. FLC is stably repressed by prolonged winter cold, thereby enables rapid induction of flowering under favorable day length in spring. The vernalization-triggered FLC repression is mitotically stable and it is reset only during meiosis to ensure the requirement of vernalization in the next generation <ref type="bibr">[23]</ref>. This "memory" of winter indicates the involvement of epigenetic regulation. Indeed, studies performed during the past decade have begun to elucidate the role of histone modification and chromatin structural dynamics in FLC repression <ref type="bibr">[24]</ref><ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref><ref type="bibr">[28]</ref><ref type="bibr">[29]</ref><ref type="bibr">[30]</ref><ref type="bibr">[31]</ref><ref type="bibr">[32]</ref><ref type="bibr">[33]</ref><ref type="bibr">[34]</ref>.</p><p>Before vernalization, FLC chromatin is enriched with active histone marks, including histone acetylation, H3K4me3, H3K36me3, and etc., which are likely deposited by FRI complexes <ref type="bibr">[8,</ref><ref type="bibr">12,</ref><ref type="bibr">[35]</ref><ref type="bibr">[36]</ref><ref type="bibr">[37]</ref>. Early in vernalization, the expression of antisense noncoding RNAs are induced at FLC locus. Expression of these RNAs, termed COOLAIR (cold induced long antisense intragenic RNA) correlates with the reduction in expression of the FLC sense transcript, and COOLAIR physically associates with FLC chromatin resulting in depletion of H3K36me3 <ref type="bibr">[38,</ref><ref type="bibr">39]</ref>. Recently, expression of VP1/ABI3-LIKE1 (VAL1) was shown to be necessary for vernalization-mediated reduction of histone acetylation at FLC. VAL1 is a B3 domain protein recruited to FLC through its direct binding to RY motifs within the nucleation region. VAL1 recruits histone deacetylase HDA19 to FLC chromatin <ref type="bibr">[33,</ref><ref type="bibr">40]</ref>.</p><p>In late stage of vernalization, prolonged cold induces sufficient amount of VERNALIZATION INSENSITIVE 3 (VIN3), a PHD-finger domain protein, which forms heterodimer with VIN3-LIKE 1 (VIL1) and together recruit POLYCOMB REPRESSIVE COMPLEX 2 (PRC2) to the nucleation region in the first intron of FLC. This PHD-PRC2 complex catalyzes the tri-methylation of histone H3K27, a well-characterized repressive mark <ref type="bibr">[28]</ref>. At this stage, H3K27me3 modifications are confined within the nucleation region. Meanwhile, expression of another noncoding RNA, termed COLDAIR (cold assisted intronic noncoding RNA) is induced from the sense direction of the first intron of FLC. Loss of COLDAIR results in a vernalizationinsensitive phenotype <ref type="bibr">[30]</ref>. COLDAIR interacts with CURLY LEAF (CLF), the enzymatic core of PRC2, to facilitate its sequence-specific binding at the FLC locus <ref type="bibr">[30,</ref><ref type="bibr">34]</ref>. When temperatures warm, VIN3 levels decline rapidly, but VIL1-PRC2 remains bound to the FLC locus. H3K27me3 spreads until it covers the entire genomic region of FLC. It is not clear how or why the spreading of repressive marks occurs only when the temperature warms. The accelerated enzymatic activity of histone modifying complexes at higher temperatures might explain this phenomenon. LIKE HETEROCHROMATIN PROTEIN 1 (LHP1) proteins are enriched at FLC following PRC2 action, and these proteins are necessary for stable maintenance of the epigenetically repressed state of FLC in warm conditions. VAL1 recruits LHP1 to FLC through direct protein-protein interactions <ref type="bibr">[40]</ref>. The repressive state of FLC is stably inherited through many cycles of cell division during subsequent growth and development.</p><p>In addition to changes in histone modifications, chromatin structural changes also occur at FLC locus during vernalization. Vernalization induces physical clustering of FLC alleles in the nucleus, which requires Polycomb complex components VERNALIZATION INSENSITIVE 2 (VRN2) and VIL1, but not LHP1 <ref type="bibr">[31]</ref>. An interaction between the 5' and 3' regions of the FLC chromatin is formed before cold and is disrupted during the early stage of vernalization. The mechanism of formation this loop is not clear. It is known that the transcriptional status of FLC is not relevant to this process and that the components of PRC2 complex are not necessary <ref type="bibr">[32]</ref>. An intragenic chromatin loop is also induced by vernalization, which could be responsible for the vernalization-induced spreading of H3K27me3 marks along FLC chromatin <ref type="bibr">[34]</ref>. A non-coding RNA derived from the FLC promoter called COLDWRAP is involved in the formation of the intragenic chromatin loop.</p><p>Given the quantitative nature of vernalization response, it would be helpful to have a comprehensive picture of the transcriptome and epigenome changes that occur during the vernalization process. To date, few vernalization-related next-generation sequencing datasets have been generated, and most come from food crops such as pak choi (Brassica rapa subsp. chinensis) and radish (Raphanus sativus L.) <ref type="bibr">[41,</ref><ref type="bibr">42]</ref>. The RNA-seq and ChIP-seq analyses collected at multiple time points during vernalization described in this work represent the first comprehensive profiling of the transcriptome and epigenome dynamics of vernalization in Arabidopsis thaliana.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results &amp; Discussions</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Transcriptional dynamics of vernalization in Arabidopsis thaliana</head><p>To capture genome-wide transcriptional dynamics during the vernalization process, seven samples were collected, termed NV (without cold exposure), V1h (1-hour cold), V1d (1-day cold), V10d (10-day cold), V20d (20-day cold), V40d (40-day cold), and T10 (40-day cold followed by 10-day normal growth temperature). The well-known patterns of FLC repression and VIN3 induction were successfully captured by the RNA-seq (Fig. <ref type="figure">1A</ref>, 1B and Table <ref type="table">S1</ref>). FLC belongs to a small gene family, including FLC and the MADS AFFECTING FLOWERING genes MAF1 (also known as FLOWERING LOCUS M), MAF2, MAF3, MAF4, and MAF5. The RNA-seq data showed relatively similar dynamics of MAF1 and FLC that differed from the patterns of expression of MAF2 and MAF3 (Fig. <ref type="figure">1A</ref>). MAF4 and MAF5 were of too low abundance for a pattern of expression to be confidently differentiated by RNA-seq. Of the VIN family members, VIL2 showed the highest expression, whereas VIL3 was barely detected. Levels of VIL1 and VIL2 were largely stable across vernalization (Fig. <ref type="figure">1B</ref>).</p><p>Differentially expressed genes (DEGs) were by comparison of vernalized samples to NV samples. All the time points, except V1h, showed similar numbers of up-and down-regulated genes (Fig. <ref type="figure">1C</ref>). Only 710 up-regulated and 306 down-regulated genes were identified in V1h samples, indicating that the downstream cascades of cold-regulated genes were initiated by a limited number of early responsive genes. V10d, V20d, and V40d shared 3,485 differentially regulated genes in common (Fig. <ref type="figure">1D</ref>), suggesting that expression of many cold-regulated genes was stably maintained regardless of the duration of cold. That 3,976 of the 5,580 genes expressed during V1d were also expressed at one or more of the V10d, V20d, and V40d time points indicate that long-term responses built up within just one day of cold exposure are maintained (Fig. <ref type="figure">1E</ref>).</p><p>To fully explore the time-course dynamics, differentially expressed genes from all time points were clustered based on expression patterns. Six major clusters with distinct transcriptional dynamics were identified (Fig. <ref type="figure">2</ref>). Cluster 1 consisted of a small number of early responsive genes (545) that were up-or down-regulation within just 1 hour of cold treatment (Fig. <ref type="figure">2A</ref>). Gene Ontology (GO) analysis revealed that this cluster was enriched in hormone-related genes, including ethylene, abscisic acid, cytokinin, and salicylic acid (Fig. <ref type="figure">2A</ref>), which is consistent with the fact that plant hormones are usually among the "first responders" upon environmental changes and stresses. Members of cluster 2 (2,272 genes) and cluster 3 (1,744 genes) exhibited relatively constant up-and down-regulation, respectively, at time points V1d to V40d (Fig. <ref type="figure">2B</ref>, <ref type="figure">2C</ref>), indicating that these genes are regulated during cold. GO analysis showed that up-regulated genes in cluster 2 were enriched in translation-related terms (Figure <ref type="figure">2B</ref>), such as ribosome biogenesis, translation initiation, RNA secondary structure unwinding, and rRNA processing, suggesting that protein synthesis is boosted during prolonged cold, probably in order to make up for the reduced enzymatic activity at low temperature.</p><p>Photosynthesis and lipid processing genes were enriched in cluster 3 (Figure <ref type="figure">2C</ref>), indicating that in Arabidopsis photosynthesis is repressed during cold. In evergreen plants, winter cold inhibits the efficiency of photosynthetic CO2 assimilation, which could lead to over-excitation and increased photo-oxidative damage if plants continue to absorb light energy. Therefore, downregulation of light absorption balances the supply and utilization of energy during cold and protect plants from photo-oxidative damage <ref type="bibr">[43]</ref>. Indeed, the photosynthesis-related genes in cluster 3 mostly encode components of light harvesting complexes, suggesting that Arabidopsis utilizes a similar strategy as evergreens during winter cold.</p><p>Genes in cluster 4 (911 genes) had expression that was gradually induced during cold instead as opposed to the constant high levels observed for genes in cluster 2 (Figure <ref type="figure">2D</ref>). This pattern resembles that of VIN3 during vernalization. Genes related to microtubule movement were present in this cluster. Genes in cluster 5 (1,828 genes) and cluster 6 (1,650 genes) had gradually increased or decreased expression during cold, respectively, and levels of these genes were maintained after the return to warm temperature (Figure <ref type="figure">2E</ref>, <ref type="figure">2F</ref>). The pattern of expression of genes in cluster 6 resembled that of FLC during vernalization. No functional terms showed obvious enrichment in these two clusters.</p><p>To identify potential protein binding motifs enriched in the six major clusters, 3 kilobases of promoter sequence for each gene were extracted and analyzed using the MEME program for motif discovery and analysis. Five major motifs were discovered with distinct and overlapping enrichment among the clusters (Fig. <ref type="figure">2</ref>, far right; Table <ref type="table">1</ref>). Motif 1 (M1) was enriched in clusters 3, 4, and 6 and motif 2 (M2) in clusters 2 and 5. Motif 3 (M3) and motif 4 (M4) were both enriched in clusters 2 and 4, and motif 5 (M5) was only enriched in cluster 2. Overall, gene clusters upregulated during vernalization showed higher motif enrichment, suggesting that induction of genes was regulated by the combination of transcription factors, whereas repression might require distinct mechanisms. The transcription factors with binding motifs that match those enriched in genes differentially expressed during vernalization are listed in Table <ref type="table">2</ref>. Many of these transcription factors are involved in salt stress, hormone signaling, and flowering regulation. Motif 4 was of great interest since it is the binding motif for the ERF/AP2 transcription factors involved in hypoxia signaling <ref type="bibr">[44,</ref><ref type="bibr">45]</ref>, and VIN3 was reported to be induced by hypoxia <ref type="bibr">[46]</ref>. It is also noteworthy that a recent finding showed that hypoxia also stabilizes the VRN2-containing PRC2 complex to mediate the repression of FLC during vernalization <ref type="bibr">[47]</ref>, implicating biological relevance between hypoxia and vernalization.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Histone modification changes during vernalization</head><p>Three well-studied histone modifications, H3K27me3, H3K4me3, and H3K36me3, were analyzed by ChIP-seq at NV, V40d, and T10 (Fig. <ref type="figure">3</ref> and Table <ref type="table">S2</ref>). We first analyzed the distribution of histone marks on FLC chromatin at these time points (Fig. <ref type="figure">3A</ref>). An enrichment of H3K27me3 was observed around the FLC transcription start site at V40d compared to NV. The gene body of FLC exhibited a minor increase of H3K27me3 during cold, whereas the major spreading and coverage of repressive marks occurred only after plants were moved back to warm temperature at T10 (Fig. <ref type="figure">3A</ref>). Consistent with the increase of H3K27me3, a decrease of H3K36me3 along the gene body of FLC was observed as a function of time, although the overall enrichment of H3K36me3 was much lower than that of H3K27me3 at all stages. To our surprise, the H3K4me3 marks showed little change during and after vernalization (Fig. <ref type="figure">3A</ref>), which was quite different from previous reports showing that H3K4me3 undergone vernalization-induced reduction at FLC locus. Besides the transcription start site, a minor H3K4me3 peak was observed around the 3'-end of FLC. The function of this peak is unclear. We hypothesize that it could be involved in the formation of chromatin loop or the expression of antisense transcripts <ref type="bibr">[38,</ref><ref type="bibr">39]</ref>.</p><p>At the genome-wide level, H3K4me3 and H3K36me3 were enriched on actively transcribed genes, whereas H3K27me3 was observed over genes expressed at low levels and over silenced genes (Fig. <ref type="figure">3B</ref>). H3K4me3 peaks were confined around transcription start sites with an average span of about 2 kilobases, whereas H3K36me3 and H3K27me3 were diffused into gene bodies (Fig. <ref type="figure">3C</ref>). Most of the H3K4me3 peaks did not change much in terms of location or intensity during vernalization (Fig. <ref type="figure">3D</ref>). H3K36me3 largely followed the pattern of H3K4me3 distribution (Fig. <ref type="figure">3E</ref>, <ref type="figure">3F</ref>) as expected since both are active histone marks. In total, 19,176, 18,804, and 19,176 peaks were called for H3K4me3 in NV, V40d, and T10 samples, respectively, and 13,968, 13,859, and 13,601 peaks were called for H3K36me3 at these time points (Fig. <ref type="figure">3D</ref>). These numbers represent two-thirds of coding genes in Arabidopsis genome, which roughly matches the number of actively transcribed genes. Thus, nearly every actively transcribed gene has an H3K4me3 peak located at their transcription start site. The lower numbers of genes marked by H3K36me3 compared to H3K4me3 are probably due to the overall lower enrichment levels for H3K36me3 compared to H3K4me3 detected in our ChIP-seq analysis. As expected due to the synergistic function of these modifications in transcriptional regulation, 98.6% of H3K36me3 peaks overlapped with an H3K4me3 peak (Fig. <ref type="figure">3F</ref>).</p><p>A much smaller number of peaks were called for H3K27me3 than for the active histone marks, with 5,969, 7,463, and 7,236 peaks in NV, V40d, and T10 samples, respectively (Fig. <ref type="figure">3D</ref>).</p><p>Only 2.0% to 4.7% of peaks, depending on the time point, overlapped between these two marks. Surprisingly, a large portion of H3K27me3 peaks (33.2%) overlapped with H3K4me3 marks (Fig. <ref type="figure">3F</ref>), resulting in the so-called "bivalent" status for the underlying genes <ref type="bibr">[48]</ref><ref type="bibr">[49]</ref><ref type="bibr">[50]</ref><ref type="bibr">[51]</ref>. GO analysis indicated that transcription factors were highly enriched in the group of genes with bivalent histone marks, suggesting that the combination of H3K27me3 and H3K4me3 could be required for flexible regulation of transcription factors in Arabidopsis. The transcription factors with bivalent marks are listed in Supplemental Table <ref type="table">S3</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Vernalization causes an overall increase of H3K27me3 in Arabidopsis genome</head><p>Vernalization had minimal effect on H3K36me3 distribution, as peaks from V40d and T10 correlated almost perfectly with NV samples (Fig. <ref type="figure">4A</ref>). A temporary effect of vernalization on H3K4me3 was enhanced diffusion at V40d; patterns of H3K4me3 at T10 were similar to those at NV. In contrast, vernalization-induced H3K27me3 changes observed at V40d were maintained after plants were moved back to warm temperature at T10 (Fig. <ref type="figure">4A</ref>). To quantify the differential peaks among samples, reads within each peak were extracted and converted to digital counts for statistical analysis. Consistent with the correlation analysis, only 9.9% of H3K36me3 peaks were differentially regulated; a slightly higher percentage of H3K4me3 peaks (15.6%) were differentially regulated. In contrast, over one-third of H3K27me3 peaks (36.6%) were differentially regulated by vernalization (Fig. <ref type="figure">4B</ref>). Surprisingly, the direction of change of these differentially regulated peaks was not evenly distributed: Cold induced an overall decrease of H3K4me3 and increase of H3K27me3 at V40d (Fig. <ref type="figure">4C</ref>). The absence of down-regulated H3K27me3 peaks indicated a potential unidirectional action of H3K27me3 for switching off genes and suggests that, once added, the H3K27me3 mark is difficult to remove. To confirm the ChIPseq results, several genes were randomly chosen for validation. Quantitative real-time PCR (qRT-PCR) showed validated the ChIP-seq analysis (Fig. <ref type="figure">S1</ref>).</p><p>The group of genes with cold-induced reduction of H3K4me3 were enriched with photosynthesis-related terms (Fig. <ref type="figure">4D</ref>), as was cluster 3 of cold down-regulated genes (Fig. <ref type="figure">2C</ref>). Therefore, it is likely that in Arabidopsis the temporary removal of H3K4me3 marks at the transcription start site decreases the expression of photosynthesis genes to prevent photo-oxidative damage during cold and quickly restores their activities in warm temperature to ensure normal growth and development. The factors involved in this temperature-induced H3K4me3 change are currently unknown.</p><p>Interestingly, transcription factors from almost all families were strongly enriched in the group of genes with H3K27me3 peaks up-regulated during vernalization (Fig. <ref type="figure">4E</ref>). Of the 335 transcription factor genes that had strongly up-regulated H3K27me3, 155 were marked also with H3K4me3 (Fig. <ref type="figure">4F</ref>). GO analysis revealed that floral regulator genes were enriched in the group of transcription factors with vernalization-induced H3K27me3 modifications (Table <ref type="table">3</ref>), confirming that vernalization promotes the transition from vegetative growth to reproductive growth through epigenetic switching off of regulatory hub genes in Arabidopsis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Identification of FLC-like and VIN3-like transcripts</head><p>We hypothesized that any gene with a repression pattern similar to that of FLC or an induction pattern similar to that of VIN3 upon cold treatment could have similar functions during vernalization. Cluster 6 and cluster 4 included genes with patterns of expression similar to those of FLC and VIN3, respectively (Fig. <ref type="figure">2D</ref>, <ref type="figure">2F</ref>). A dynamic time warping (DTW) algorithm was used to identify optimal matches within each cluster. DTW was first used in speech recognition for measuring the similarity between soundtracks <ref type="bibr">[52]</ref>. The advantage of DTW over simple pairwise comparison is that it allows the stretch and compression of input sequences. In this work, the timeseries transcriptional dynamics of two genes were given as inputs, and a distance score was then calculated (Fig. <ref type="figure">5A</ref>, <ref type="figure">5B</ref>). The lower the distance score, the higher the similarity of the two expression patterns (Fig. <ref type="figure">5B</ref>).</p><p>All genes within cluster 6 were compared to FLC using DTW, and the resulting distance scores were ranked from low to high (Fig. <ref type="figure">5C</ref>). Genes in cluster 4 genes were ranked for similarity to the VIN3 expression pattern (Fig. <ref type="figure">5D</ref>). To validate the transcriptional profiles of the FLCand VIN3-like genes identified from the DTW algorithm, transcripts from five genes from each category were quantified in time-course samples. The results of qRT-PCR were consistent with the RNA-seq profiles (Fig. <ref type="figure">5E</ref>, <ref type="figure">5F</ref>). Several of the FLCand VIN3-like genes are known floral regulators and cold-related genes that could be novel components of the vernalization pathway (Tables <ref type="table">4</ref> and <ref type="table">5</ref>). Interestingly, of the top 10 VIN3-like genes, three encode factors involved in meiotic recombination (Table <ref type="table">5</ref>), suggesting that VIN3 may have a role in meiotic recombination or may regulate chromatin contact.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>AHL family genes act as floral repressors in vernalization pathway</head><p>In the set of 10 most FLC-like genes were two AT-hook family genes, AT-HOOK MOTIF NUCLEAR LOCALIZE PROTEIN 21 (AHL21) and AT-HOOK MOTIF NUCLEAR LOCALIZE <ref type="bibr">PROTEIN 22 (AHL22)</ref>. qRT-PCR confirmed their expression patterns (Fig. <ref type="figure">6B</ref>). As found in our analysis of FLC, stable increases of H3K27me3 were observed on both loci during and after vernalization (Fig. <ref type="figure">6A</ref>). Previous studies have shown that AHL family genes are involved in control of flowering <ref type="bibr">[53]</ref><ref type="bibr">[54]</ref><ref type="bibr">[55]</ref><ref type="bibr">[56]</ref>. AHL family members exist in nearly all plant species sequenced so far, ranging from moss to higher plants. In Arabidopsis, the AHL family contains 29 members with conserved AT-hook motifs known to bind to AT-rich DNA sequences <ref type="bibr">[57,</ref><ref type="bibr">58]</ref>. In addition to roles in regulation of flowering AHL family members function in diverse aspects of plant growth and development including hypocotyl elongation, floral development, and light responses <ref type="bibr">[53-56, 59, 60]</ref>.</p><p>AHL genes have evolved into two phylogenetic clades. Clade A are intron-less genes with only one AT-hook motif, whereas clade B are genes containing intron and one or two AT-hook motifs (Fig. <ref type="figure">S2A</ref>) <ref type="bibr">[57]</ref>. Besides AHL21 and AHL22, several other AHL family members also showed FLC-like transcriptional dynamics during vernalization as well as up-regulated H3K27me3 marks (Fig. <ref type="figure">S2B</ref>), including AHL19, AHL20, AHL23, AHL24, AHL25, AHL27, and AHL29. Interestingly, all of the FLC-like AHLs belong to intron-less clade A, suggesting that clade A of AHL genes could be an ancient family involved in cold response.</p><p>To further confirm the biological function of AHL genes in vernalization, we obtained the knockout and overexpression lines of AHL22 to test its flowering phenotype with or without vernalization. The ahl22 mutants were not significantly different from wild-type plants, probably due to the highly redundant functions of AHL family members. However, overexpression of AHL22 in Col-0 rendered the plant late flowering as Col-0 (FRI) without vernalization (Fig. <ref type="figure">6C</ref>, top). And the flowering was accelerated after 40 days of cold treatment (Fig. <ref type="figure">6C</ref>, <ref type="figure">bottom</ref>).</p><p>Quantitative measurement indicated that the overexpression of AHL22 resulted in elevated rosette leaves in Col-0 comparable to but less than that in FRI_Col-0 without vernalization (Fig. <ref type="figure">6C</ref>, <ref type="figure">top)</ref>.</p><p>Vernalization partially rescued the late-flowering phenotype in AHL22 overexpression line but was less effective than that in FRI_Col-0 (Fig. <ref type="figure">6D</ref>), suggesting that AHL22 might function in parallel to FLC in regulating downstream floral genes. Altogether, we propose that AHL family genes, especially genes belong to clade A, may be ancient yet novel floral regulators in vernalization pathway which were switched off by prolonged cold-induced H3K27me3 in order to assist the acceleration of flowering in Arabidopsis thaliana.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusions</head><p>This work presents the first profile of dynamic transcriptome and epigenome changes during vernalization in Arabidopsis thaliana. RNA-seq data was collected for samples without cold exposure, with 1-hour, 1-day, 10-day, 20-day, and 40-day exposure to cold, and with a 40day cold followed by 10 days at normal growth temperature. Analyses revealed six major clusters of differentially regulated genes. Plant hormone signaling genes were among those with altered expression immediately after exposure to cold. Throughout the exposure to cold, translationrelated genes were up-regulated to enabled efficient protein synthesis when enzymatic activities were limited by low temperature. Also throughout the cold exposure photosynthesis-related genes were down-regulated to prevent photo-oxidative damage caused by excessive energy production.</p><p>Potential protein-binding motifs within each cluster suggest interesting candidates for further studies.</p><p>Genome-wide profiling of histone modifications, including H3K4me3, H3K36me3, and H3K27me3, showed a temporary reduction of H3K4me3 at photosynthesis-related genes after 40 days of exposure to cold and up-regulation of H3K27me3 after 40 days of cold with and without 10 days at optimal growing temperature. About one-third of the H3K27me3 peaks in all loci in the Arabidopsis genome that are marked with H3K27me3 were vernalization regulated; most of these genes encode transcription factors and most harbor bivalent marks of both H3K4me3 and H3K27me3. In mammalian systems, bivalent histone modifications play critical roles in embryonic development and cell lineage commitment <ref type="bibr">[49]</ref><ref type="bibr">[50]</ref><ref type="bibr">[51]</ref>. Little is known about the functions of bivalent marks in Arabidopsis, but our finding that thousands of genes, including a large portion of transcription factors, harbor both H3K4me3 and H3K27me3 suggest that "bivalency" may allow rapid switching of transcription status of Arabidopsis genes critical to functions like flowering.</p><p>The time-course patterns of transcriptome and epigenome changes allowed us to identify novel components of the vernalization pathway. A number of FLC-like and VIN3-like genes were discovered through classification and pattern recognition. Among them, one AHL family gene was confirmed to be a repressor of flowering that was epigenetically silenced during vernalization. Additional candidates will be interesting targets for further studies. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Plant materials and growth conditions</head><p>The Arabidopsis Col-0 with a functional FRI allele was used as the wild-type strain. Standard growth conditions were 22 &#730;C with a 16-h light/8-h dark (long day) photoperiodic cycle under white fluorescent light. Seeds were surface sterilized, placed on agar medium, and grown in the dark at 4 &#730;C for 3 days for stratification. For vernalization treatment, seedlings were grown for 7 days at 22 &#730;C, and then either harvested as NV or transferred to 4 &#730;C under short day (8-h light/16h dark) for 1 h (V1h), 1 day (V1d), 10 days (V10d), 20 days (V20d), and 40 days (V40d) of treatment. The T10 sample was kept at 4 &#730;C for 40 days followed by 10 days at 22 &#730;C before harvesting.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RNA extraction and qRT-PCR</head><p>Harvested samples were flash-frozen in liquid nitrogen. Total RNA was extracted using the Trizol/chloroform method. Extracted RNA was treated with DNase I to eliminate genomic DNA contamination. Around 2 &#181;g of total RNA was used for cDNA synthesis using M-MLV reverse transcriptase (Promega). qRT-PCR was performed using SYBR green reaction mix (Applied Biosystems) according to the manufacturer's instructions on a Viia7 Real-Time PCR system (Applied Biosystems).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Chromatin Immunoprecipitation (ChIP)</head><p>Seedlings were crosslinked at 4 &#730;C with 1% formaldehyde solution under vacuum for 25 min. The reaction was terminated by addition of 0.125 M glycine. Crosslinked seedlings were rinsed in distilled water and then flash frozen in liquid nitrogen. ChIP was performed following the Abcam (A) Quantitative measurement of expression levels of FLC family genes over a time course during vernalization as in Reads Per Kilobase of transcript, per Million mapped reads (RPKM). Error bars were generated based on normalized read counts within each locus from 2 biological replicates. (B) Quantitative measurement of expression levels of VIN3 family genes over a time course during vernalization as in RPKM. Error bars were generated based on normalized read counts within each locus from 2 biological replicates. (C) Bar graph showing total numbers of differentially upregulated (red) and down-regulated (blue) genes at each time point relative to NV. (D) Venn diagram showing the overlapping and uniquely differentially regulated genes at V10d, V20d, and V40d. (E) Venn diagram showing the overlapping and uniquely differentially regulated genes at V1h, V1d, V10d/20d/40d, and T10. (A-F) Clusters 1 to 6, respectively, were generated from k-means clustering of transcription profiles obtained over the time course of vernalization. Shown from left to right for genes in the indicated cluster are heatmaps of gene expression at each time point, normalized box plots of genes expression at each time point, enriched GO terms, and motifs enriched within clustered genes if detected. (A) Genome browser illustration of normalized ChIP-seq and RNA-seq results at FLC locus. H3K4me3 tracks are shown in red, H3K36me3 in green, and H3K27me3 in blue. RNA-seq results are shown in grey colors. (B) Heatmaps of H3K4me3 (red), H3K36me3 (green), and H3K27me3 (blue) over all coding genes in Arabidopsis genome. Each row represents the normalized read density from transcription start site (TSS) to transcription end site (TES) of each gene, ranked by transcription level from the highest (top) to the lowest (bottom). (C) Averaged profiles of H3K4me3 (red), H3K36me3 (green), and H3K27me3 (blue) distributions around TSS regions over all coding genes in Arabidopsis genome. (D) Bar graph showing total number of peaks called by MACS2 within each sample. (E) Correlation plot of genome-wide H3K4me3 and H3K36me3 densities. (F) Venn diagrams showing overlapped among different histone marks. vernalization. (A) Correlation plots of densities of H3K4me3 in red, H3K36me3 in green, and H3K27me3 in blue in V40d vs. NV (left) and T10 vs. NV (right) samples. (B) Pie graph showing the percentages of histone modification peaks differentially regulated during vernalization. (C) Bar graph showing the number of vernalization up-regulated (darker hues) and down-regulated (lighter hues) H3K4me3 (red), H3K36me3 (green), and H3K27me3 (blue) peaks. (D) Bar graph showing top GO terms ranked by enrichment score from H3K4me3 temporarily down-regulated loci with pvalue. (E) Bar graph showing top GO terms ranked by enrichment score from H3K27me3 upregulated loci with p-value. dynamic warping algorithm. (A) Illustration of Dynamic Time Warping algorithm. (B) Examples of sequences with low (left), medium (middle), and high (right) distance scores. (C) Bar graph showing the RNA-seq results of the 10 genes (in shades of grey) that most closely resemble the vernalization-mediated repression pattern of FLC (red). (D) Bar graph showing the RNA-seq results of the 10 genes (in shades of grey) that most closely resemble the vernalization-mediated induction pattern of VIN3  Table 1. Motifs enriched in each cluster of genes differentially regulated during vernalization. Table 2. Transcription factors (TFs) with binding motifs similar to those identified in genes differentially regulated during vernalization. Table 3. Functional annotations of transcription factors with vernalization-induced 484 H3K27me3 up-regulation. 485 486 Functional annotation Number of genes Enrichment score P-value cell differentiation 59 40.82 1.30E-13 ethylene signaling pathway 30 17.4 1.00E-19 flower development 25 11.12 2.20E-13 carpel development 10 9.4 1.30E-11 ovule development 11 6.85 5.40E-09 regulation of secondary cell wall biogenesis 8 6.82 5.20E-09 vegetative to reproductive phase transition of meristem 14 6.1 3.30E-08 gibberellic acid signaling pathway 11 4.82 6.50E-07 specification of flora organ identity 6 4.57 1.30E-06 trichome differentiation 7 4.32 2.40E-06 transmitting tissue development 4 3.42 2.20E-05 auxin signaling pathway 14 3.21 3.80E-05 487 488 Table 4. Genes with expression patterns similar to FLC during the course of vernalization. Locus Name Protein domain Reported function AT1G51860 -LRR, protein kinase -AT2G45430 AHL22 AT-hook DNAbinding regulation of flowering AT2G35270 AHL21 AT-hook DNAbinding patterning and differentiation of reproductive organs AT4G37390 AUR3 GH3 auxinresponsive negative component in auxin signaling AT5G06800 -Myb-like DNAbinding -AT3G26760 -glucose dehydrogenase -AT1G74770 BTSL1 zinc finger negative regulator of iron deficiency AT3G53620 PPA4 pyrophosphatase regulate pyrophosphate levels AT5G40780 LHT1 transmembrane high-affinity transporter for cellular amino acid uptake AT5G03150 JKD zinc finger epidermal patterning in root meristem Table 5. Genes with expression patterns similar to VIN3 during the course of vernalization. Locus Name Protein domain Reported function AT5G44565 -transmembrane -AT5G55450 LTP4.4 -lipid transport and pathogen resistance AT2G01150 RHA2B zinc finger ABA signaling and drought response AT1G63990 SPO11-2 DNA topoisomerase VI regulate meiotic recombination AT3G27730 MER3 DEAD-like helicase required for meiotic crossover formation AT4G12480 EARLI1 plant lipid transfer resistance to low temperature and fungal infection AT4G21940 CPK15 protein kinase -AT5G52290 SHOC1 similar to XPF endonucleases required for class-I meiotic crossover formation AT5G24860 FPF1 -regulate the competence to flowering AT5G46600 -malate transporter -</p></div></body>
		</text>
</TEI>
