skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on March 18, 2026

Title: Knowledge as a Breaking of Ergodicity
Abstract We construct a thermodynamic potential that can guide training of a generative model defined on a set of binary degrees of freedom. We argue that upon reduction in description, so as to make the generative model computationally manageable, the potential develops multiple minima. This is mirrored by the emergence of multiple minima in the free energy proper of the generative model itself. The variety of training samples that employ N binary degrees of freedom is ordinarily much lower than the size 2N of the full phase space. The nonrepresented configurations, we argue, should be thought of as comprising a high-temperature phase separated by an extensive energy gap from the configurations composing the training set. Thus, training amounts to sampling a free energy surface in the form of a library of distinct bound states, each of which breaks ergodicity. The ergodicity breaking prevents escape into the near continuum of states comprising the high-temperature phase; thus, it is necessary for proper functionality. It may, however, have the side effect of limiting access to patterns that were underrepresented in the training set. At the same time, the ergodicity breaking within the library complicates both learning and retrieval. As a remedy, one may concurrently employ multiple generative models—up to one model per free energy minimum.  more » « less
Award ID(s):
1956389
PAR ID:
10579463
Author(s) / Creator(s):
;
Publisher / Repository:
MIT Press
Date Published:
Journal Name:
Neural Computation
Volume:
37
Issue:
4
ISSN:
0899-7667
Page Range / eLocation ID:
742 to 792
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The introduction of transient degrees of freedom into a system can lead to novel material design and training protocols that guide a system into a desired metastable state. In this approach, some degrees of freedom, which were not initially included in the system dynamics, are first introduced and subsequently removed from the energy minimization process once the desired state is reached. Using this conceptual framework, we create stable jammed packings that exist in exceptionally deep energy minima marked by the absence of low-frequency quasilocalized modes; this added stability persists in the thermodynamic limit. The inclusion of particle radii as transient degrees of freedom leads to deeper and much more stable minima than does the inclusion of particle stiffnesses. This is because particle radii couple to the jamming transition, whereas stiffnesses do not. Thus, different choices for the added degrees of freedom can lead to very different training outcomes. 
    more » « less
  2. Ergodicity, the central tenet of statistical mechanics, requires an isolated system to explore all available phase space constrained by energy and symmetry. Mechanisms for violating ergodicity are of interest for probing nonequilibrium matter and protecting quantum coherence in complex systems. Polyatomic molecules have long served as a platform for probing ergodicity breaking in vibrational energy transport. Here, we report the observation of rotational ergodicity breaking in an unprecedentedly large molecule,12C60, determined from its icosahedral rovibrational fine structure. The ergodicity breaking occurs well below the vibrational ergodicity threshold and exhibits multiple transitions between ergodic and nonergodic regimes with increasing angular momentum. These peculiar dynamics result from the molecule’s distinctive combination of symmetry, size, and rigidity, highlighting its relevance to emergent phenomena in mesoscopic quantum systems. 
    more » « less
  3. Abstract Deep generative models have become ubiquitous due to their ability to learn and sample from complex distributions. Despite the proliferation of various frameworks, the relationships among these models remain largely unexplored, a gap that hinders the development of a unified theory of AI learning. In this work, we address two central challenges: clarifying the connections between different deep generative models and deepening our understanding of their learning mechanisms. We focus on Restricted Boltzmann Machines (RBMs), a class of generative models known for their universal approximation capabilities for discrete distributions. By introducing a reciprocal space formulation for RBMs, we reveal a connection between these models, diffusion processes, and systems of coupled bosons. Our analysis shows that at initialization, the RBM operates at a saddle point, where the local curvature is determined by the singular values of the weight matrix, whose distribution follows the Marc̆enko-Pastur law and exhibits rotational symmetry. During training, this rotational symmetry is broken due to hierarchical learning, where different degrees of freedom progressively capture features at multiple levels of abstraction. This leads to a symmetry breaking in the energy landscape, reminiscent of Landau’s theory. This symmetry breaking in the energy landscape is characterized by the singular values and the weight matrix eigenvector matrix. We derive the corresponding free energy in a mean-field approximation. We show that in the limit of infinite size RBM, the reciprocal variables are Gaussian distributed. Our findings indicate that in this regime, there will be some modes for which the diffusion process will not converge to the Boltzmann distribution. To illustrate our results, we trained replicas of RBMs with different hidden layer sizes using the MNIST dataset. Our findings not only bridge the gap between disparate generative frameworks but also shed light on the fundamental processes underpinning learning in deep generative models. 
    more » « less
  4. Metasurfaces have been rapidly advancing our command over the many degrees of freedom of light within compact, lightweight devices. However, so far, they have mostly been limited to manipulating light in free space. Grating couplers provide the opportunity of bridging far-field optical radiation and in-plane guided waves, and thus have become fundamental building blocks in photonic integrated circuits. However, their operation and degree of light control is much more limited than metasurfaces. Metasurfaces integrated on top of guided wave photonic systems have been explored to control the scattering of light off-chip with enhanced functionalities – namely, point-by-point manipulation of amplitude, phase or polarization. However, these efforts have so far been limited to controlling one or two optical degrees of freedom at best, and to device configurations much more complex compared to conventional grating couplers. Here, we introduce leaky-wave metasurfaces, which are based on symmetry-broken photonic crystal slabs that support quasi-bound states in the continuum. This platform has a compact form factor equivalent to the one of conventional grating couplers, but it provides full command over amplitude, phase and polarization (four optical degrees of freedom) across large apertures. We present experimental demonstrations of various functionalities for operation at λ= 1.55 μm based on leaky-wave metasurfaces, including devices for phase and amplitude control at a fixed polarization state, and devices controlling all four optical degrees of freedom. Our results merge the fields of guided and free-space optics under the umbrella of metasurfaces, exploiting the hybrid nature of quasi-bound states in the continuum, for opportunities to advance in disruptive ways imaging, communications, augmented reality, quantum optics, LIDAR, and integrated photonic systems. 
    more » « less
  5. Molecular chirality is a fundamental phenomenon, underlying both life as we know it and industrial pharmaceutical syntheses. Understanding the symmetry breaking phase transitions exhibited by many chiral molecular substances provides basic insights for topics ranging from the origin of life to the rational design of drug manufacturing processes. In this work, we have performed molecular dynamics simulations to investigate the fluid–fluid phase transitions of a flexible three-dimensional four-site chiral molecular model developed by Latinwo et al. [J. Chem. Phys. 145, 154503 (2016)] and Petsev et al. [J. Chem. Phys. 155, 084105 (2021)]. By introducing a bias favoring local homochiral vs heterochiral interactions, the system exhibits a phase transition from a single achiral phase to a single chiral phase that undergoes infrequent interconversion between the two thermodynamically identical chiral states: the L-rich and D-rich phases. According to the phase rule, this reactive binary system has two independent degrees of freedom and exhibits a density-dependent critical locus. Below the liquid–liquid critical locus, there exists a first-order vapor–liquid coexistence region with a single independent degree of freedom. Our results provide basic thermodynamic and kinetic insights for understanding many-body chiral symmetry breaking phenomena. 
    more » « less