<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Making trotters sprint: A variational imaginary time ansatz for quantum many-body systems</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>09/01/2019</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10147505</idno>
					<idno type="doi">10.1103/PhysRevB.100.094434</idno>
					<title level='j'>Physical Review B</title>
<idno>2469-9950</idno>
<biblScope unit="volume">100</biblScope>
<biblScope unit="issue">9</biblScope>					

					<author>Matthew J. Beach</author><author>Roger G. Melko</author><author>Tarun Grover</author><author>Timothy H. Hsieh</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[We introduce a variational wave function for many-body ground states that involves imaginary-time evolution with two different Hamiltonians in an alternating fashion with variable time intervals. We successfully apply the ansatz on the one-and two-dimensional transverse-field Ising model and systematically study its scaling for the one-dimensional model at criticality. We find the total imaginary time required scales logarithmically with system size, in contrast to the linear scaling in conventional quantum Monte Carlo. We suggest this is due to unique dynamics permitted by alternating imaginary-time evolution, including exponential growth of bipartite entanglement. For generic models, the superior scaling of our ansatz potentially mitigates the negative sign problem at the expense of having to optimize variational parameters.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>Imaginary time plays a prominent role in multiple branches of physics, including cosmology, statistical mechanics, and quantum field theory. The seemingly simple replacement of real time, t, with its imaginary counterpart, &#964; = -it, leads to fundamental connections between quantum theory and statistical mechanics <ref type="bibr">[1]</ref>. Such connections enable the efficient simulation of many quantum systems using quantum Monte Carlo techniques <ref type="bibr">[2]</ref><ref type="bibr">[3]</ref><ref type="bibr">[4]</ref>. However, for many physically interesting models, these methods suffer from the prohibitive "negative sign problem" <ref type="bibr">[5,</ref><ref type="bibr">6]</ref>, which requires an exponential amount of computational resources to obtain reasonable accuracy for quantum many-body systems. Many outstanding problems in condensed matter, such as those involving high-temperature superconductors or topologically ordered phases, require an understanding of complex interacting models which are unsolved with present techniques.</p><p>One class of Monte Carlo methods that can avoid the sign problem are so-called variational Monte Carlo (VMC) methods <ref type="bibr">[7]</ref><ref type="bibr">[8]</ref><ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref>. In VMC, one assumes a sufficiently general trial state that depends on adjustable parameters. These parameters are then chosen to minimize the energy with respect to the given Hamiltonian. Finding an effective trial state such as Jastrow <ref type="bibr">[11]</ref>, matrix product states <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref>, or neural network states <ref type="bibr">[15]</ref><ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref>, can result in efficient simulation of interacting quantum systems. The key to the success of these techniques is a well-chosen ansatz that reflects the properties of the target phase and the existence of a viable optimization scheme <ref type="bibr">[19,</ref><ref type="bibr">20]</ref>.</p><p>The recent advent of quantum computers and simulators has motivated the development of new variational approaches <ref type="bibr">[21]</ref>. Such variational quantum algorithms involve applying a sequence of unitary operators, parametrized by several variables onto a easy-to-prepare initial state. The variables are chosen to optimize a given cost function involving the resulting wave function. For example, in the quantum approximate optimization algorithm (QAOA) <ref type="bibr">[22]</ref><ref type="bibr">[23]</ref><ref type="bibr">[24]</ref><ref type="bibr">[25]</ref><ref type="bibr">[26]</ref> the cost function is a classical Hamiltonian encoding a combinatorial optimization problem, and the variational wave function is prepared by alternating between evolving with the Hamiltonian and a transverse field. The evolution times constitute variational parameters that are optimized to minimize the Hamiltonian cost function. This variational approach has been generalized for preparing both strongly correlated and highly entangled states on near-term quantum devices <ref type="bibr">[27]</ref><ref type="bibr">[28]</ref><ref type="bibr">[29]</ref>.</p><p>Motivated by the success of such variational approaches, in this paper we propose a variational ansatz for ground states of quantum many-body systems which involves sequentially evolving with different Hamiltonians in imaginary time. In contrast to real-time evolution with local Hamiltonians, which is limited by Lieb-Robinson bounds on the growth of correlation functions, imaginary-time evolution does not have this constraint and can exhibit remarkable efficiency in traversing Hilbert space.</p><p>As proof of concept, we demonstrate the efficiency of our ansatz in representing the ground state of the transverse-field Ising model at criticality. Whereas standard projector methods require imaginary-time scaling with system size L to reach the critical ground state, we show numerically that our ansatz requires time scaling logarithmically with L. Furthermore, we analyze how entanglement grows after each imaginary-time operation in our ansatz, and we find an exponential growth that is a unique feature of imaginary-time dynamics. We conclude by demonstrating that the ansatz continues to perform well in the presence of integrability-breaking perturbations, and we mention generalizations of our ansatz to other models, including those with sign problems. We envision the main purpose of this ansatz to be an efficient trial wave function for quantum many-body physics on (classical) computers; however, it is possible that one can also implement such imaginary-time evolution natively on a quantum computer <ref type="bibr">[30,</ref><ref type="bibr">31]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. ANSATZ</head><p>Many Hamiltonians are naturally a linear combination of two components H = H A + gH B , where H A,B are individually tractable to analyze. Examples include transverse-field Ising, Hubbard, and the J 1 -J 2 model. Motivated by the QAOA procedure, we consider the following variational imaginary-time ansatz (VITA) for the ground state of H:</p><p>where P is the number of pairs of variational parameters &#945; = (&#945; 1 , . . . , &#945; P ), &#946; = (&#946; 1 , . . . , &#946; P ), |&#968; 0 is an initial state, and N is a normalization factor. We further define the total imaginary time &#964; = 1 2 P p=1 (&#945; p + &#946; p ). A circuit representation is shown in Fig. <ref type="figure">1</ref>. The bang-bang, or square-pulse, style of the ansatz is optimal for quantum control in real-time QAOA as per Pontryagin's principle <ref type="bibr">[32]</ref>.</p><p>While VITA is applicable to any Hamiltonian, in specific cases there is explicit physical motivation for considering such an ansatz. For example, for the fermionic Hubbard model, the P = 1 ansatz reduces to Otsuka's generalization of the Gutzwiller variational wave function <ref type="bibr">[7,</ref><ref type="bibr">9,</ref><ref type="bibr">10]</ref>, which seeks to balance single occupancy per site with itinerancy. The P 3 case has been considered in Ref. <ref type="bibr">[33]</ref> for the twodimensional Hubbard model, but a systematic analysis of how its performance scales with system size and P was not carried out. A related variational approach for the Hubbard model has also been considered in Ref. <ref type="bibr">[34]</ref>.</p><p>The standard projector method for attaining the ground state of H involves evaluating e -&#964; H |&#968; 0 for &#964; 1/ where is the many-body spectral gap. This can be decomposed via Trotterization into a sequence of the VITA form, with parameters &#945; p = &#946; p = &#964; /2P for large P. This guarantees that VITA can exactly represent the ground state in the P &#8594; &#8734; limit. However, the projector method is especially expensive for critical systems where &#8764; 1/L, and hence &#964; scales polynomially with L. One can consider Eq. ( <ref type="formula">1</ref>) as a nonuniform Trotterization with large (and variable) time steps. We will show that remarkably high fidelities can be attained even with &#964; that is exponentially smaller compared to the aforementioned estimate from the standard projector method.</p><p>We first present some general considerations of why such an ansatz may be efficient. It is useful to first compare with</p><p>1. Circuit representation of the trial state |&#968; P (&#945;, &#946;) for P = 1. Each box denotes imaginary-time evolution with the enclosed Hamiltonian. the real-time analog, which are QAOA-type circuits involving alternating real-time evolution between two Hamiltonians. The Lieb-Robinson bound dictates that real-time evolution with local Hamiltonians can generate correlations only within a light cone, and thus there are lower bounds on the time it takes to prepare highly correlated states starting from unentangled product states. For example, in one dimension, the total time to prepare the Greenberger-Horne-Zeilinger (GHZ; "cat") state 1</p><p>scales at least linearly with system size L <ref type="bibr">[28,</ref><ref type="bibr">35]</ref>. In contrast, by evolving with the GHZ parent Hamiltonian H ZZ = -L i=1 Z i Z i+1 in imaginary time (Z is the Pauli-Z matrix), the GHZ state can be prepared with imaginary-time scaling as log L <ref type="bibr">[36]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. APPLICATION</head><p>We test VITA on the transverse-field Ising model (TFIM)</p><p>with periodic boundary conditions on a system with N spins. Z, X are the Pauli matrices, and h is the transverse-field strength. Our ansatz in this case starts from the paramagnetic ground state of H X , |+ , and alternates between</p><p>The Ising chain can be mapped to free fermions via the Jordan-Wigner transformation <ref type="bibr">[37]</ref> which allows for the efficient evaluation of Eq. ( <ref type="formula">1</ref>). We can thus optimize our ansatz for very large system sizes and several pulses. This allows us to properly characterize how efficient the VITA ansatz is without introducing sampling error. For a fixed P, optimization involves finding the minima of the energy cost function E P (&#945;, &#946;) = &#968; P (&#945;, &#946;)|H |&#968; P (&#945;, &#946;) .</p><p>We first focus on approximating the critical ground state of H at h = 1. Figure <ref type="figure">2</ref>(a) shows the relative error in energy rel = |(E P (&#945;, &#946;) -E exact )/E exact | where E exact is the exact ground state energy at the critical point, for various P. Evidently, increasing P dramatically improves the accuracy in the energy, even for large system sizes. Optimized parameters for various P are provided in the Supplemental Material <ref type="bibr">[38]</ref>. Since the exact ground state for the TFIM is known, we also compare the fidelity, f , of the optimized trial state with the target state. The error in fidelity, 1</p><p>is shown in Fig. <ref type="figure">2</ref>(b) for various P, L. The efficiency is quite remarkable; for example, for L = 64, P = 2 is already sufficient to approximate the critical state to within around 10 -4 in relative energy and 10 -2 in fidelity. Recall that the error in fidelity provides an upper bound for the error in any observable <ref type="bibr">[38]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. ENTANGLEMENT DYNAMICS</head><p>While there is no Lieb-Robinson bound limiting the rate for generating long-range correlations in our ansatz, entanglement considerations provide lower bounds on the iterations P required to prepare the critical state. Ignoring normalization, imaginary-time evolution with a local Hamiltonian can be represented by a (nonunitary) quantum circuit; each iteration of our ansatz corresponds to three layers shown in Fig. <ref type="figure">1</ref>. After P pulses, the bipartite entanglement entropy between the left and right halves of the system, hereafter abbreviated EE, can be attained by bisecting the circuit through P bonds. Hence, EE after P iterations is at most P log D, where D is the Schmidt rank (number of singular values) upon decomposing a single two-qubit imaginary-time operator.</p><p>In order to generate the EE of the critical state, which scales as S &#8733; log L, we need the number of pulses scaling at least as P &#8733; log L. We observe that Fig. <ref type="figure">2(b</ref>) is consistent with this scaling form. For example, for a target fidelity error of 10 -10 , each additional P in the ansatz can represent a system approximately twice as large.</p><p>The entanglement dynamics in imaginary-time evolution can be considerably different from its real-time counterpart. For real-time evolution, EE across a bipartition can increase only by acting with an operator supported on both sides of the partition. If the circuit in Fig. <ref type="figure">1</ref> were unitary, any increase in EE from one layer to the next would be bounded by a constant depending on the two-qubit unitary but not on the state being acted on (see the "small incremental entangling theorem" <ref type="bibr">[39]</ref>). In contrast, even imaginary-time operators acting on one side of the bipartition can generate entanglement across the cut. As a very simple example involving two spins, the action of e -&#946;Z 1 on &#951; + |11 + &#951; -|00 can increase EE as long as |&#951; + | &gt; |&#951; -| &gt; 0. This illustrates that the more entangled the initial state, the more imaginary-time evolution can change the entanglement. The change is not simply bounded by a state-independent constant. This allows in principle an exponential growth of EE, as long as the total EE after P steps lies below P log D.</p><p>Our ansatz exhibits such dynamics. We take the P = 5 ansatz and analyze the EE of the states at intermediate steps, p, of our protocol using the technique of <ref type="bibr">[40,</ref><ref type="bibr">41]</ref>. We find that the EE increases exponentially with imaginary time [Fig. <ref type="figure">3(a)</ref>]. Moreover, for every intermediate state, we plot the mutual information (S A + S B -S AB ) between two spins A, B as a function of their separation [Fig. <ref type="figure">3(b)</ref>]. The power-law decay for every step is in stark contrast to any local realtime evolution and illustrates the ability of imaginary-time evolution to generate long-range correlations <ref type="bibr">[42]</ref>. We find that under imaginary-time evolution, entanglement starts to grow immediately following a local quench, in contrast to real-time evolution <ref type="bibr">[43]</ref> where it takes a time proportional to &#8467; before growing, &#8467; being the distance between the location of the local quench and the entanglement cut.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>V. SCALING</head><p>To compare directly with the projector method, we also investigate the total imaginary time &#964; &#8801; 1 2 P p=1 (&#945; p + &#946; p ) required to achieve a target fidelity. Motivated by the P &#8733; log L scaling for achieving a target fidelity, we propose a scaling form of 1f = G[&#964; (log L) -&#957; ] for some exponent &#957;. In Fig. <ref type="figure">4</ref>, we perform a scaling collapse for L &#8712; [4, 6, . . . , 1024], and P &#8712; [1, . . . , 7] and find the optimal exponent &#957; = 2.3 &#177; 0.1. This logarithmic scaling is an exponential speedup compared to the linear scaling of the projector method, 1f = F (&#964; L -1 ) [44].</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>VI. MONTE CARLO APPROACH</head><p>While the TFIM model admits a dual representation as free fermions, for a general model sampling methods are crucial for estimating the energy cost function. As a proof of concept, we also use Monte Carlo sampling for stochastically optimizing VITA.</p><p>The quantum-classical correspondence maps quantum observables to dual classical observables of a classical anisotropic Ising model on an L &#215; (2P + 1) lattice. We denote the classical spin configurations by {s} and the spatial and imaginary time by (i, p), respectively 0 0.5 1 1.5 2 h 10 -6 10 -5 10 -4 10 -3 10 -2 rel (a) L = 64 P = 1 P = 2 2.5 3 3.5 4 h (b) N = 10 &#215; 10 FIG. 5. Relative error in energy, rel , for VMC using our ansatz on the TFIM: (a) one-dimensional model with L = 64 spins. Solid lines denote the results from the free fermion approach. (b) Twodimensional model on a 10 &#215; 10 square lattice. Energies are compared with those from zero-temperature stochastic series expansion [47].</p><p>The expectation value of a quantum observable O is</p><p>where &#213; are dual classical observables, and p &#945;,&#946; (s) is the Boltzmann weight corresponding to Ising model with couplings J x (p) = &#945; p , J &#964; (p) = 1 2 ln coth &#946; p between nearest neighbors in space and imaginary time, respectively. The couplings vary with imaginary time, p, but are uniform in space. In this way, the energy of the trial wave function can be sampled efficiently with Monte Carlo.</p><p>We use this scheme with P = 1, 2 to target the ground states for various values of the transverse field h in both the (integrable) one-dimensional and the (nonintegrable) twodimensional TFIM. We use stochastic natural gradient descent (stochastic reconfiguration) to optimize the parameters <ref type="bibr">[20]</ref>. We find rapid convergence for P = 1, while higher P becomes more difficult, especially for with a noisy objective function.</p><p>The relative error in energy achieved is shown in Fig. <ref type="figure">5</ref>, with the free fermion results for comparison. For P = 1 the VMC achieves the same accuracy as the free fermion method. However, for P = 2 away from the critical point h = 1, the VMC performance is limited by sampling error <ref type="bibr">[46]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>VII. DISCUSSION</head><p>We have introduced a variational technique that is motivated by both projector methods and recently developed quantum algorithms. It provides substantial shortcuts to the usual Trotterization of imaginary-time evolution, at the expense of making the procedure variational. Using TFIM as a first testbed, we have demonstrated that this ansatz is viable for sampling methods and highly efficient. In particular, the number of variational parameters required to represent the TFIM critical state scales as &#8764;log L, in contrast to other variational methods such as density matrix renormalization group, which in this critical case requires bond dimension scaling with L and thus the number of parameters scaling with L 2 . One reason for this efficiency is the fact that imaginarytime evolution is not subject to many bounds for real-time evolution; despite being generated by local Hamiltonians, our ansatz exhibits an exponential growth of entanglement entropy and rapid generation of long-range correlations, features unique to imaginary-time evolution.</p><p>Our variational approach is potentially useful in the many situations where imaginary-time Trotterization involves a prohibitively large number of steps. For example, in models with a sign problem, the computational cost scales exponentially with space and imaginary time O(&#964; L d ). Our ansatz provides a variational shortcut that significantly reduces &#964; [from &#964; &#8764; L to &#964; &#8764; (log L) 2.3 in the critical one-dimensional TFIM] which could enable the study of larger systems even with a sign problem. For few variational parameters (P = 1, 2), the optimization may be feasible, and we leave these investigations to future work.</p></div></body>
		</text>
</TEI>
