skip to main content


Title: Maximal Couplings of the Metropolis-Hastings Algorithm
Couplings play a central role in the analysis of Markov chain Monte Carlo algorithms and appear increasingly often in the algorithms themselves, e.g. in convergence diagnostics, parallelization, and variance reduction techniques. Existing couplings of the Metropolis-Hastings algorithm handle the proposal and acceptance steps separately and fall short of the upper bound on one-step meeting probabilities given by the coupling inequality. This paper introduces maximal couplings which achieve this bound while retaining the practical advantages of current methods. We consider the properties of these couplings and examine their behavior on a selection of numerical examples.  more » « less
Award ID(s):
1844695
NSF-PAR ID:
10225158
Author(s) / Creator(s):
; ;
Editor(s):
Banerjee, Arindam; Fukumizu, Kenji
Date Published:
Journal Name:
The 24th International Conference on Artificial Intelligence and Statistics
Volume:
130
Page Range / eLocation ID:
1225-1233
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Markov chain Monte Carlo (MCMC) methods generate samples that are asymptotically distributed from a target distribution of interest as the number of iterations goes to infinity. Various theoretical results provide upper bounds on the distance between the target and marginal distribution after a fixed number of iterations. These upper bounds are on a case by case basis and typically involve intractable quantities, which limits their use for practitioners. We introduce L-lag couplings to generate computable, non-asymptotic upper bound estimates for the total variation or the Wasserstein distance of general Markov chains. We apply L-lag couplings to the tasks of (i) determining MCMC burn-in, (ii) comparing different MCMC algorithms with the same target, and (iii) comparing exact and approximate MCMC. Lastly, we (iv) assess the bias of sequential Monte Carlo and self-normalized importance samplers. 
    more » « less
  2. null (Ed.)
    Abstract Semiconductor quantum-dot spin qubits are a promising platform for quantum computation, because they are scalable and possess long coherence times. In order to realize this full potential, however, high-fidelity information transfer mechanisms are required for quantum error correction and efficient algorithms. Here, we present evidence of adiabatic quantum-state transfer in a chain of semiconductor quantum-dot electron spins. By adiabatically modifying exchange couplings, we transfer single- and two-spin states between distant electrons in less than 127 ns. We also show that this method can be cascaded for spin-state transfer in long spin chains. Based on simulations, we estimate that the probability to correctly transfer single-spin eigenstates and two-spin singlet states can exceed 0.95 for the experimental parameters studied here. In the future, state and process tomography will be required to verify the transfer of arbitrary single qubit states with a fidelity exceeding the classical bound. Adiabatic quantum-state transfer is robust to noise and pulse-timing errors. This method will be useful for initialization, state distribution, and readout in large spin-qubit arrays for gate-based quantum computing. It also opens up the possibility of universal adiabatic quantum computing in semiconductor quantum-dot spin qubits. 
    more » « less
  3. Abstract The formation of clusters at sub-saturation densities, as a result of many-body correlations, constitutes an essential feature for a reliable modelization of the nuclear matter equation of state (EoS). Phenomenological models that make use of energy density functionals (EDFs) offer a convenient approach to account for the presence of these bound states of nucleons when introduced as additional degrees of freedom. However, in these models clusters dissolve, by construction, when the nuclear saturation density is approached from below, revealing inconsistencies with recent findings that evidence the existence of short-range correlations (SRCs) even at larger densities. The idea of this work is to incorporate SRCs in established models for the EoS, in light of the importance of these features for the description of heavy-ion collisions, nuclear structure and in the astrophysical context. Our aim is to describe SRCs at supra-saturation densities by using effective quasi-clusters immersed in dense matter as a surrogate for correlations, in a regime where cluster dissolution is usually predicted in phenomenological models. Within the EDF framework, we explore a novel approach to embed SRCs within a relativistic mean-field model with density dependent couplings through the introduction of suitable in-medium modifications of the cluster properties, in particular their binding energy shifts, which are responsible for describing the cluster dissolution. As a first exploratory step, the example of a quasi-deuteron within the generalized relativistic density functional approach is investigated. The zero temperature case is examined, where the deuteron fraction is given by the density of a boson condensate. For the first time, suitable parameterizations of the cluster mass shift at zero temperature are derived for all baryon densities. They are constrained by experimental results for the effective deuteron fraction in nuclear matter near saturation and by microscopic many-body calculations in the low-density limit. A proper description of well-constrained nuclear matter quantities at saturation is kept through a refit of the nucleon meson coupling strengths. The proposed parameterizations allow to also determine the density dependence of the quasi-deuteron mass fraction at arbitrary isospin asymmetries. The strength of the deuteron-meson couplings is assessed to be of crucial importance. Novel effects on some thermodynamic quantities, such as the matter incompressibility, the symmetry energy and its slope, are finally discerned and discussed. The findings of the present study represent a first step to improve the description of nuclear matter and its EoS at supra-saturation densities in EDFs by considering correlations in an effective way. In a next step, the single-particle momentum distributions in nuclear matter can be explored using proper wave functions of the quasi-deuteron in the medium. The momentum distributions are expected to exhibit a high-momentum tail, as observed in the experimental study of SRCs by nucleon knockout with high-energy electrons. This will be studied in a forthcoming publication with an extensive presentation of the theoretical method and the results. 
    more » « less
  4. We consider the double photoionization of the 2p2 valence electrons of atomic carbon, which provides for distinct final-state symmetries depending on the three possible angular momentum couplings (3P,1D, and 1S) of the initially-bound p2 electrons that are ejected into the continuum by a single photon. Comparison of this process with neon provides an analogous case for the resulting final states within the treatment of the double photoionization proceeding with the ejected electrons influenced by the remaining bound electrons. 
    more » « less
  5. Abstract

    This paper aims to develop distributed algorithms for nonconvex optimization problems with complicated constraints associated with a network. The network can be a physical one, such as an electric power network, where the constraints are nonlinear power flow equations, or an abstract one that represents constraint couplings between decision variables of different agents. Despite the recent development of distributed algorithms for nonconvex programs, highly complicated constraints still pose a significant challenge in theory and practice. We first identify some difficulties with the existing algorithms based on the alternating direction method of multipliers (ADMM) for dealing with such problems. We then propose a reformulation that enables us to design a two-level algorithm, which embeds a specially structured three-block ADMM at the inner level in an augmented Lagrangian method framework. Furthermore, we prove the global and local convergence as well as iteration complexity of this new scheme for general nonconvex constrained programs, and show that our analysis can be extended to handle more complicated multi-block inner-level problems. Finally, we demonstrate with computation that the new scheme provides convergent and parallelizable algorithms for various nonconvex applications, and is able to complement the performance of the state-of-the-art distributed algorithms in practice by achieving either faster convergence in optimality gap or in feasibility or both.

     
    more » « less