<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Universality of superconcentration intheSherrington–Kirkpatrick model</title></titleStmt>
			<publicationStmt>
				<publisher>Wiley</publisher>
				<date>03/01/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10519378</idno>
					<idno type="doi">10.1002/rsa.21183</idno>
					<title level='j'>Random Structures &amp; Algorithms</title>
<idno>1042-9832</idno>
<biblScope unit="volume">64</biblScope>
<biblScope unit="issue">2</biblScope>					

					<author>Wei‐Kuo Chen</author><author>Wai‐Kit Lam</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<title>Abstract</title> <p>We study the universality of superconcentration for the free energy in the Sherrington–Kirkpatrick model. In [10], Chatterjee showed that when the system consists of spins and Gaussian disorders, the variance of this quantity is superconcentrated by establishing an upper bound of order , in contrast to the bound obtained from the Gaussian–Poincaré inequality. In this paper, we show that superconcentration indeed holds for any choice of centered disorders with finite third moment, where the upper bound is expressed in terms of an auxiliary nondecreasing function that arises in the representation of the disorder as for standard normal. Under an additional regularity assumption on , we further show that the variance is of order at most .</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION AND MAIN RESULTS</head><p>The Sherrington-Kirkpatrick (SK) model is an important mean-field spin glass that was introduced to explain some unusual magnetic behaviors of certain alloys. For a given disorder (random variable) h with finite second moment, a given (inverse) temperature &#120573; &gt; 0, and any integer N &#8805; 1, its Hamiltonian is defined as</p><p>N , Random Struct Alg. 2024;64:267-286. wileyonlinelibrary.com/journal/rsa &#169; 2023 Wiley Periodicals LLC 267</p><p>where (h ij ) 1&#8804;i,j&#8804;N are i.i.d. copies of h. One of the main objectives in the study of the SK model is to understand the limit of the free energy, (&#120590;) , which has attracted a lot of attention in physics as well as in mathematics communities, see, for instance, <ref type="bibr">[17,</ref><ref type="bibr">18,</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref>. This paper is concerned with the order of fluctuation for the free energy. When h is standard Gaussian, it can be checked directly from the Gaussian-Poincar&#233; inequality that Var(F N (&#120573;)) = O(N). It is natural to ask whether one can improve this bound as Var(F N (&#120573;)) = o(N), a phenomenon called superconcentration, introduced in the pioneering work of Chatterjee <ref type="bibr">[10,</ref><ref type="bibr">11]</ref>. In light of this notion, superconcentration was established with the bound that for any &#120573; &gt; 0, there exists a constant C = C(&#120573;) &gt; 0 such that</p><p>When &#120573; &#8804; 1&#8725; &#8730; 2, much sharper bounds were also obtained in the literature. In the case of &#120573; &lt; 1&#8725; &#8730; 2, <ref type="bibr">[2]</ref> showed that F N (&#120573;) satisfies a central limiting theorem, and their result implies that Var(F N (&#120573;)) = &#920;(1) <ref type="foot">1</ref> . At &#120573; = 1&#8725; &#8730; 2, it was predicted by <ref type="bibr">[4,</ref><ref type="bibr">19]</ref> that a sharp phase transition should occur, namely, Var(F N (&#120573;)) = &#920;(log N). Along this conjecture, a partial result Var(F N (&#120573;)) = O((log N) 2 ) was known by the authors, see <ref type="bibr">[14]</ref>. Interestingly, if one now considers the SK model in the presence of an external field, that is, replacing -H N (&#120590;) with -H N (&#120590;) + r &#8721; N i=1 &#120590; i for some r &gt; 0 in the free energy, then it was known in <ref type="bibr">[13]</ref> that the corresponding free energy obeys a central limit theorem and Var(F N (&#120573;)) = &#920;(N), agreeing with the rate obtained from the Gaussian-Poincar&#233; inequality instead of exhibiting superconcentration.</p><p>While the aforementioned results addressed superconcentration assuming that the disorder is Gaussian, we aim to investigate this phenomenon for more general choice of disorders. Note that for an arbitrary h with finite second moment, the Efron-Stein inequality readily implies Var(F N (&#120573;)) = O(N). In contrast to this bound, we say that the free energy is superconcentrated if Var(F N (&#120573;)) = o(N).</p><p>To state our main results, we assume that Eh = 0 and Eh 2 = 1. We express h = f (g) for some nondecreasing function f and a standard Gaussian random variable g. Let g 1 , g 2 be independent copies of g. For 0 &#8804; t &#8804; 1, define</p><p>where</p><p>Note that w(t) &#8594; 1 as t &#8593; 1 if we further assume E|h| 3 &lt; &#8734;. Indeed, since f is nondecreasing, f (g 1 t )f (g 2 t ) converges to f (g) 2 almost surely as t &#8593; 1 and for any M &gt; 0, denoting</p><p>On the other hand, from the H&#246;lder inequality and using the union bound,</p><p>These together yield the desired limit. With this, our first main result shows that superconcentration for the free energy holds for any h with a finite third moment, where the upper bound for the variance is related to the rate of convergence of w(t) at 1.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Theorem 1.</head><p>There exist positive constants c, K &gt; 0 depending only on &#120573; such that whenever h satisfies Eh = 0, Eh 2 = 1, and E|h| 3 &lt; &#8734;, we have</p><p>In the next main result, we let f be an arbitrary absolutely continuous function and take h = f (g). Note that now f is not necessarily nondecreasing. If we make the assumption that Eh = 0, Eh 2 = 1, and E|f &#8242; (g)| 3 &lt; &#8734;, then we can obtain superconcentration for the free energy in the same rate as (1). Theorem 2. There exists a constant K &gt; 0 depending only on &#120573; such that whenever h satisfies that Eh = 0, Eh 2 = 1, and f is absolutely continuous with E|f &#8242; (g)| 3 &lt; &#8734;, we have</p><p>A few remarks are in order.</p><p>Remark 1. We do not expect to obtain the bound N&#8725; log N in Theorem 2 directly from Theorem 1. Nevertheless, if in Theorem 1 we assume additionally that f is differentiable and</p><p>, which follows from the mean value theorem and the fact that w &#8242; is nondecreasing, see <ref type="bibr">(8)</ref> below. As a result,</p><p>and this implies that Var(F N (&#120573;)) = O(N log log N&#8725; log N).</p><p>Remark 2. Under the assumption that the first four moments of h agree with those of g and h has a finite fifth moment, one can manage to match the first and second moments of the free energies associated to h and g asymptotically by using the approximate Gaussian integration by parts, which will lead to Var(F N ) = O(N&#8725; log N) as ( <ref type="formula">1</ref>) and ( <ref type="formula">3</ref>), see, for example, <ref type="bibr">[9]</ref>. Our main results address superconcentration by reducing the moment assumption as much as possible.</p><p>Remark 3. In a more general framework, one can consider the mixed even p-spin model, whose Hamiltonian is defined as</p><p>where h i 1 , &#8230; ,i 2p for all 1 &#8804; i 1 , &#8230; , i 2p &#8804; N and all p &#8805; 1 are i.i.d. copies of h and (&#120573; p ) p&#8805;1 is a real sequence with &#8721; &#8734; p=1 2 p &#120573; 2 p &lt; &#8734;. In <ref type="bibr">[10]</ref>, Chatterjee showed that the corresponding free energy is again superconcentrated as long as h is standard normal. We point out that the same results in Theorems 1 and 2 also hold in this setting by the same proofs.</p><p>We now discuss three applications of Theorems 1 and 2. ) .</p><p>To show this, we use Theorem 1 by expressing h = f (g) for</p><p>where &#934; is the CDF of g. Note that f is nondecreasing. Denote &#120574; = &#934; -1 (p). A direct computation shows that</p><p>) .</p><p>To compute these probabilities, we write</p><p>On the other hand,</p><p>From these and using a 2 p + b 2 (1p) = 1, we arrive at</p><p>Thus, we can find a constant &#119889; &gt; 0 such that if t is sufficiently close to 1,</p><p>Theorem 1 then implies that for some constants C, c &gt; 0 and for N large,</p><p>and our claim follows by noting that</p><p>Proof Sketch. Our proof is based on Chatterjee's interpolation argument (see <ref type="bibr">[10,</ref><ref type="bibr">11]</ref>) in proving superconcentration for the free energy in the SK model with Gaussian disorder h = g. The argument starts by noting that F N (&#120573;) is a function of i.i.d. standard Gaussian g = (g ij ) 1&#8804;i,j&#8804;N and writing Var(F N (&#120573;)) = &#120601;(1) -&#120601;(0), where for independent copies g 1 and g 2 of g,</p><p>and F = F N (&#120573;). The first key step uses Gaussian integration by parts inductively to show that &#120595;(a) = &#120601; &#8242; (e -a ) for a &#8805; 0 is a completely monotone function, and from the Bernstein theorem, this function can be represented as &#120595;(a) = &#8747; [0,&#8734;) e -as &#120583;(&#119889;s) for some positive measure &#120583;. Consequently, from H&#246;lder's inequality, for all 0 &lt; s &#8804; t &lt; 1,</p><p>In the second step, one relates &#120601; &#8242; (t) to the second moment of the cross overlap associated to the interpolated spin system. In particular, by employing the so-called Latala argument (see <ref type="bibr">[11]</ref>, Lemma 10.4), it can be shown that &#120601; &#8242; (s) = O(1) as long as s is small enough, where the fact that g is symmetric was heavily used. On the other hand, the above inequality makes it possible to get &#120601; &#8242; (t) = O(N 1-log t&#8725; log s ) whenever t &#8805; s. With these, one readily obtains the desired bound O(N&#8725; log N) utilizing the relation Var(F N (&#120573;)) = &#8747; 1 0 &#120601; &#8242; (t) &#119889;t. In our argument, we adapt the following interpolation</p><p>Now the terms in &#120601; &#8242; (t) involve f &#8242; (see <ref type="bibr">(12)</ref> below). While (5) remains valid, the main difficulty arises in obtaining an useful bound for &#120601; &#8242; (s) with small s. To this end, for technical purposes, we adapt the convexity argument in <ref type="bibr">[14]</ref> by considering the coupled free energy (15) instead of using the Latala approach. Our control in some sense relies on an approximate Gaussian integration by parts argument throughout.</p><p>Universality of superconcentration in other models. Superconcentration does not only exhibit in mean-field spin glass models, but also in random growth models on the integer lattice such as first-passage percolation <ref type="bibr">[15]</ref>, directed polymers <ref type="bibr">[3]</ref>, frog model <ref type="bibr">[8]</ref>.</p><p>For first-passage percolation, after a series of work <ref type="bibr">[6,</ref><ref type="bibr">7,</ref><ref type="bibr">15]</ref>, it is shown that under a 2 + log moment assumption, the model exhibits superconcentration, and it does not depend on the distribution of the disorder. Similar results hold for many related models. In <ref type="bibr">[12]</ref>, Chatterjee shows that superconcentration holds in a certain type of "surface growth models", which includes directed last-passage percolation and directed polymers, under the assumption that the disorder is a Lipschitz function of a Gaussian random variable.</p><p>The approach to superconcentration for growth models relies on the idea in <ref type="bibr">[7]</ref>, which consists of two components: one is the L 1 -L 2 bound by Talagrand (or its variants), and the other is the translation invariance of the model. Even though the superconcentration results look very similar in both mean-field spin glasses and random growth models (the upper bounds for the variances are also of order N&#8725; log N), this approach does not seem to work in mean-field spin glass models in any obvious way, due to the fact that spin glass models and growth models are very different in nature.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>2</head><p>PROOF OF THEOREM 1</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Some auxiliary lemmas</head><p>In this subsection, we shall gather three elementary lemmas that will be used in our main controls later.</p><p>t and g 2 t be defined as in <ref type="bibr">(4)</ref>. The first lemma controls the derivative of the expectation associated to this interpolation. Although the proof has appeared in <ref type="bibr">[10]</ref>, we still provide a proof for completeness. We say that</p><p>smooth and all of its partial derivatives are of moderate growth. Define</p><p>Then for any 0 &#8804; t &#8804; 1,</p><p>and for any 0</p><p>Proof. By symmetry,</p><p>Using Gaussian integration by part yields</p><p>and this gives <ref type="bibr">(6)</ref>. As for the second assertion, note that each term in &#120601; &#8242; (t) is of the same form as that of &#120601;(t). Hence, we can apply induction to show that</p><p>Here, if we set &#120595;(a) = &#120601; &#8242; (e -a ) for 0 &#8804; a &lt; &#8734;, then &#120595; is completely monotone, that is, (-1) n &#120595; (n) (a) &#8805; 0 for all 0 &lt; a &lt; &#8734; and n &#8805; 0. From this and the Bernstein theorem (see for instance <ref type="bibr">[16]</ref>, Sec. XIII.4), one can express &#120595;(a) = &#8747; e -ax &#120583;(&#119889;x) for all a &gt; 0 by some finite positive measure &#120583; on [0, &#8734;). From the H&#246;lder inequality, for any 0 &lt; a &lt; b,</p><p>and this is equivalent to <ref type="bibr">(7)</ref>. &#9642; Lemma 2. Assume that Y, X 1 , X 2 are random variables with finite second moment and</p><p>From this and EY = 0, putting x = X, multiplying by Y and taking expectation on both sides complete our proof. &#9642; Lemma 3. Let &#120595; be a differentiable function on R and be of moderate growth. Then</p><p>Proof.</p><p>is continuous, &#915; is closed and we can write &#915; c as a disjoint union of open intervals &#8899; l&#8712;I (a l , b l ), where I &#8838; N is some index set. Here, on each (a l , b l ), &#120595; takes a fixed sign, and on {a 1 , b 1 , a 2 , b 2 , &#8230; }, &#120595; = 0. From this, we can rewrite</p><p>for some sequence {w l } l&#8712;I with w l = 1 or -1. Now, we compute directly</p><p>From ( <ref type="formula">9</ref>), the assertion follows. &#9642;</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Main controls</head><p>Throughout this entire subsection, we assume that f satisfies the following:</p><p>(A) &#8758; f is nondecreasing and smooth with Ef (g) = 0, E f (g) 2 = 1, and E|f (g)| 3 &lt; &#8734;, and its derivatives of all orders are of moderate growth.</p><p>Let h = f (g). Let g = (g ij ) 1&#8804;i,j&#8804;N , g 1 = (g 1 ij ) 1&#8804;i,j&#8804;N , and g 2 = (g 2 ij ) 1&#8804;i,j&#8804;N be i.i.d. standard Gaussian. For any 0 &#8804; t &#8804; 1, set</p><p>Define</p><p>where for &#120001; = 1, 2,</p><p>and g &#120001; t,ij is the (i, j)-th entry of g &#120001; t . From <ref type="bibr">(6)</ref>,</p><p>Here, &#10216;&#8901;&#10217; t is the Gibbs expectation associated to the following Gibbs measure</p><p>Recall that w(t) = Ef (g 1 t )f (g 2 t ). From Lemma 1, we also have w</p><p>Under the assumption (A), we have for 0 &lt; t &lt; 1 that</p><p>where</p><p>]</p><p>] ,</p><p>where the inequality used Lemma 3 for</p><p>] and the last equality used Gaussian integration by parts with respect to g 2 . From the H&#246;lder inequality and independence, it follows that</p><p>The same bound is also valid for E|X 2 |U. From these, putting Y = U -EU, and using the fact that f &#8242; &#8805; 0, we have</p><p>Using Gaussian integration by parts and the Cauchy-Schwarz inequality,</p><p>It follows that max</p><p>From these and Lemma 2, for X &#8758;= (X 1 , X 2 ) and twice differentiable L with max &#120001;=1,2 ||&#120597; x &#120001; L|| &#8734; &#8804; &#120574;, we arrive at</p><p>Now for fixed i, j, conditionally on g 1 t and g 2 t except g 1 t,ij and g 2 t,ij , we express &#10216;&#120590; i &#120590; j &#120591; i &#120591; j &#10217; t as L(X) in distribution. A direct computation gives</p><p>where (&#120590; 1 , &#120591; 1 ) and (&#120590; 2 , &#120591; 2 ) are i.i.d. samples from the Gibbs measure associated to &#10216;&#8901;&#10217; t . From these, max &#120001;=1,2 ||&#120597; x &#120001; L|| &#8734; &#8804; 2&#120573;N -1&#8725;2 . Consequently, from ( <ref type="formula">14</ref>) and using conditional expectation,</p><p>Summing these up over all i, j completes our proof. &#9642; Lemma 5. Assume that (A) holds. There exists a constant K depending only on &#120573; such that whenever t &#8712; [0, 1) satisfies</p><p>we have</p><p>2024, 2, Downloaded from <ref type="url">https://onlinelibrary.wiley.com/doi/10.1002/rsa.21183</ref> by UNIVERSITY OF MINNESOTA 170 WILSON LIBRARY, Wiley Online Library on [01/07/2024]. See the Terms and Conditions (<ref type="url">https://onlinelibrary.wiley.com/terms-and-conditions</ref>) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License where R 1</p><p>Proof. For t &#8712; [0, 1] and &#120582; &#8805; 0, consider</p><p>Denote by &#10216;&#8901;&#10217; t,&#120582; the Gibbs average with respect to the i.i.d. samples (&#120590; &#120001; , &#120591; &#120001; ) &#120001;&#8805;1 from the Gibbs measure associated to this free energy Q. A direct differentiation and Gaussian integration by parts yield that</p><p>In the same manner as the proof of Lemma 4, if we let</p><p>Therefore, from ( <ref type="formula">14</ref>), for D(t) &#8758;= 16&#120573; 3 R 0 (t), we have</p><p>From this, whenever 0 &#8804; w(t) &#8804; &#120582;,</p><p>which implies that</p><p>Let &#120582; 0 &gt; 0 such that 2&#120573; 2 &#120582; 0 &lt; 1. For any t &#8805; 0 satisfying 2&#120573; 2 (&#120582; 0 + w(t)) &lt; 1, if we plug &#120582; = &#120582; 0 + w(t) into the above inequality, then</p><p>Finally, since</p><p>where the third inequality used ( <ref type="formula">16</ref>) and the fact that Q(t, 0) = Q(0, 0). Here, Q(t, 0) = Q(0, 0) holds because, recalling the definition Z &#120001; t from <ref type="bibr">(11)</ref>,</p><p>In order to bound the right-hand side of the last inequality, our next step is to show that we can essentially replace f (g 1 ij ) and f</p><p>in Q(0, &#8901;) by i.i.d. standard normal random variables by using approximate Gaussian integration by parts. Let</p><p>1&#8804;i,j&#8804;N and</p><p>) . This is essentially the same as Q(0, &#120582;) with the replacement of</p><p>and</p><p>. For any 0 &#8804; s &#8804; 1, set the interpolated free energy</p><p>) .</p><p>Similar to the Gibbs expectation &#10216;&#8901;&#10217; t,&#120582; , we let &#10216;&#8901;&#10217; &#8242; s be the Gibbs expectation with respect to the i.i.d. (&#120590; &#120001; , &#120591; &#120001; ) &#120001;&#8805;1 sampled from the Gibbs measure associated to the free energy &#120588;(s, &#120582;). It follows that</p><p>Here, the second term can be computed by the usual Gaussian integration by parts,</p><p>As for the first term, note that Eh = 0, Eh 2 = 1, and E|h| 3 &lt; &#8734;, we can use approximate Gaussian integration by parts (see, e.g., <ref type="bibr">[5]</ref>, Lemma 2.2) to obtain</p><p>Consequently, we arrive at</p><p>This together with <ref type="bibr">(17)</ref> implies that</p><p>) .</p><p>In the last step, note that</p><p>Observe that due to the symmetry of (z 1 ij ) and (z 2 ij ), under the expectation E&#10216;&#8901;&#10217; &#8242; 0 , NR(&#120590;, &#120591;)</p><p>we have</p><p>where recalling <ref type="bibr">(13)</ref>, the last inequality used the bound</p><p>Recalling that</p><p>and taking &#120582; 0 = 1&#8725;(4&#120573; 2 ), whenever t satisfies</p><p>we have</p><p>This completes our proof. &#9642;</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Proof of Theorem 1</head><p>Smooth case: First, we show that Theorem 1 holds under the assumption (A). Recall &#120601; from <ref type="bibr">(10)</ref>.</p><p>It suffices to bound &#120601;(1) -&#120601;(0). Denote &#120578;(t) = E&#10216;R(&#120590;, &#120591;) 2 &#10217; t . For 0 &lt; r &lt; 1, write</p><p>By Lemma 4 and integration by parts, the first term is bounded above by</p><p>where we dropped w(0)&#120578;(0) and &#8747; r 0 w(t)&#120578; &#8242; (t) &#119889;t since they are both nonnegative, which follow by noting that the summand of the Gibbs expectation in &#120578;(t) is of the form F(g 1 t )F(g 2 t ) for some bounded smooth function F.</p><p>On the other hand, observe that &#120601; &#8242; (t) &#8804; &#120573; 2 Nw &#8242; (t). It follows that</p><p>10982418, 2024, 2, Downloaded from <ref type="url">https://onlinelibrary.wiley.com/doi/10.1002/rsa.21183</ref> by UNIVERSITY OF MINNESOTA 170 WILSON LIBRARY, Wiley Online Library on [01/07/2024]. See the Terms and Conditions (<ref type="url">https://onlinelibrary.wiley.com/terms-and-conditions</ref>) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License</p><p>Combining these yields that</p><p>) .</p><p>Next, from Lemma 5, we fix 0 &lt; s &lt; 1 such that</p><p>.</p><p>By using <ref type="bibr">(7)</ref>, for any r satisfying s &#8804; r &lt; 1,</p><p>Consequently, there exists some K &#8242; &gt; 0 depending only on &#120573; such that for any r satisfying s &#8804; r &lt; 1,</p><p>To control the right-hand side, let N &#8805; 2 and take r = (log N)</p><p>Note that s &#8804; r &#8804; 1 and that if a = 1r, then 1 -= (log N) 2 log s&#8725; log N = (log N) -2 log(s -1 )&#8725; log N . Using the bound 1cx &#8804; (1x) c for all x &#8712; [0, 1] and c &#8805; 1 implies that</p><p>and thus, there exists some C depending only on s such that</p><p>On the other hand, from our choice of r,</p><p>Putting ( <ref type="formula">21</ref>) and ( <ref type="formula">22</ref>) back into <ref type="bibr">(20)</ref> yields that</p><p>where K &#8242;&#8242; is a constant depending only on &#120573;. This proves Theorem 1 under assumption (A). General case: Assume that h satisfies Eh = 0, Eh 2 = 1 and E|h| 3 &lt; &#8734; and h can be written as h = f (g) for some nondecreasing f , where g is a standard normal random variable. For any integer n &#8805; 1, set f n (x) = max(min(f (x), n), -n). Let h n = f n (g) for</p><p>Note that Eh n = 0 and Eh 2 n = 1. Also, we have |f n (g)| &#8804; |f (g)| for all n, and hence by the dominated convergence theorem, E|f n (g)-f (g)| 3 &#8594; 0 as n &#8594; &#8734;. Thus, E|h n -h| 3 &#8594; 0 and Ef n (g 1 t )f n (g 2 t ) &#8594; w(t) as n &#8594; &#8734;. From these, if we can show that h n enjoys the inequality in Theorem 1, then so does h. To this end, for any fixed since f n is bounded and nondecreasing, we can construct a sequence of smooth and nondecreasing functions (f n,k ) k&#8805;1 of moderate growth (for instance, take</p><p>satisfies the upper bound in Theorem 1 for any k &#8805; 1, we can pass to the limit k &#8594; &#8734; to obtain the same bound for h n , completing our proof.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">PROOF OF THEOREM 2</head><p>Smooth case: Assume that f satisfies the extra assumption that f is smooth and its derivatives of all orders are of moderate growth. Recall &#120601;(t) from <ref type="bibr">(10)</ref>. Note that in the proof of Lemma 4, we can bound</p><p>by the H&#246;lder inequality. Thus, the statement of Lemma 4 is valid with the replacement of R 0 (t) by the constant</p><p>that is,</p><p>where we recall that &#120578;(t) = E&#10216;R(&#120590;, &#120591;) 2 &#10217; t . From this bound, it can also be checked directly that <ref type="bibr">(18)</ref> holds with D(t) being replaced by D 0 = 16&#120573; 3 C 0 . Moreover, as long as 2&#120573; 2</p><p>we have</p><p>Letting &#120582; 0 = 1&#8725;(4&#120573; 2 ), this inequality then implies that whenever</p><p>we have</p><p>Now, by using ( <ref type="formula">23</ref>), for any 0 &lt; s &lt; 1,</p><p>where the last inequality used monotonicity of &#120578;. Here, we can select and fix s satisfying that 4&#120573; 2 log (1s) -1 &lt; 1 so that we can apply (24) to bound &#120578;(s) &#8804; C 1 (s)N -1&#8725;2 . In a similar manner as that of <ref type="bibr">(19)</ref>, we can bound that for any s &#8804; r &#8804; 1,</p><p>where the second inequality used the bound that for any x &gt; 1,</p><p>Putting these together, we arrive at</p><p>for some universal constant K depending only on &#120573;. Note that the following Gaussian-Poincar&#233; inequality holds (see, e.g., <ref type="bibr">[20]</ref>, Eq. (2.5)),</p><p>where C is a universal constant independent of f . We can bound each E|h| 3 in our main control above by E|f &#8242; (g)| 3 . Together with the trivial bound Ef &#8242; (g) 2 &#8804; 1 + E|f &#8242; (g)| 3 , we obtain the desired inequality (3).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>General case:</head><p>We continue to handle the general case in Theorem 2. First of all, we argue that without loss of generality, we can assume that f is uniformly bounded on R. Indeed, consider the absolutely continuous function f M = max(-M, min(M, f )) for M &#8805; 1. We see that |f In addition, because f n is compactly supported, |f n | 3 is integrable on R with respect to the Lebesgue measure. Since a.e.</p><p>we also have</p><p>With these, for any n &#8805; 1, there exists a sequence of smooth functions (&#120601; n,k ) k&#8805;1 with compact support such that &#120601; n,k &#8594; f n and &#120601; &#8242; n,k &#8594; f &#8242; n as k &#8594; &#8734; under the L 3 -norm with respect to the Lebesgue measure on R (see, for instance, <ref type="bibr">[1]</ref>, <ref type="bibr">Corollary 3.23)</ref>. This readily implies that  Here, the first limit readily implies that the variance of the free energy associated to f n,k (g) converges to that associated to h in the limit k &#8594; &#8734; and then n &#8594; &#8734;, while the second limit leads to Since h n,k &#8758;= f n,k (g) satisfies all the assumptions in Theorem 2 and the derivatives of f n,k of all orders are of moderate growth by the compact supportiveness of &#120601; n,k , from the smooth case above, the inequality (3) holds for h n,k , from which sending the limit in the order k &#8594; &#8734; and then n &#8594; &#8734; completes our proof.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>For two nonnegative sequences(a N ) N&#8805;1 and (b N ) N&#8805;1 , denote by a N = &#920;(b N ) if there exist constants c, C &gt; 0 such that cb N &#8804; a N &#8804; Cb N for all N &#8805; 1.10982418,</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1"><p>2024, 2, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/rsa.21183 by UNIVERSITY OF MINNESOTA 170 WILSON LIBRARY, Wiley Online Library on [01/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_2"><p>10982418, 2024, 2, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/rsa.21183 by UNIVERSITY OF MINNESOTA 170 WILSON LIBRARY, Wiley Online Library on [01/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License</p></note>
		</body>
		</text>
</TEI>
