Learning maps between data samples is fundamental. Applications range from representation learning, image translation and generative modeling, to the estimation of spatial deformations. Such maps relate feature vectors, or map between feature spaces. Well-behaved maps should be regular, which can be imposed explicitly or may emanate from the data itself. We explore what induces regularity for spatial transformations, e.g., when computing image registrations. Classical optimization-based models compute maps between pairs of samples and rely on an appropriate regularizer for well-posedness. Recent deep learning approaches have attempted to avoid using such regularizers altogether by relying on the sample population instead. We explore if it is possible to obtain spatial regularity using an inverse consistency loss only and elucidate what explains map regularity in such a context. We find that deep networks combined with an inverse consistency loss and randomized off-grid interpolation yield well behaved, approximately diffeomorphic, spatial transformations. Despite the simplicity of this approach, our experiments present compelling evidence, on both synthetic and real data, that regular maps can be obtained without carefully tuned explicit regularizers, while achieving competitive registration performance.
more »
« less
This content will become publicly available on December 1, 2025
Correlational Lagrangian Schrödinger Bridge: Learning Dynamics with Population-Level Regularization
Modeling population dynamics is a fundamental problem with broad scientific applications. Motivated by real-world applications including biosystems with diverse populations, we consider a class of population dynamics modeling with two technical challenges: (i) dynamics to learn for individual particles are heterogeneous and (ii) available data to learn from are not time-series (i.e, each individual’s state trajectory over time) but cross-sectional (i.e, the whole population’s aggregated states without individuals matched over time). To address the challenges, we introduce a novel computational framework dubbed correlational Lagrangian Schrödinger bridge (CLSB) that builds on optimal transport to “bridge" cross-sectional data distributions. In contrast to prior methods regularizing all individuals’ transport “costs” and then applying them to the population homogeneously, CLSB directly regularizes population cost allowing for population heterogeneity and potentially improving model generalizability. Specifically our contributions include (1) a novel population perspective of the transport cost and a new class of population regularizers capturing the temporal variations in multivariate relations, with the tractable formulation derived, (2) three domain-informed instantiations of population regularizers on covariance, and (3) integration of population regularizers into data-driven generative models as constrained optimization and an approximate numerical solution, with further extension to conditional generative models. Empirically, we demonstrate the superiority of CLSB in single-cell sequencing data analyses (including cell differentiation and drug-conditioned cell responses) and opinion depolarization.
more »
« less
- Award ID(s):
- 1943008
- PAR ID:
- 10596092
- Publisher / Repository:
- NeurIPS 2024 Workshop on AI for New Drug Modalities (AIDrugX)
- Date Published:
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
apid growth of high-dimensional datasets in fields such as single-cell RNA sequencing and spatial genomics has led to unprecedented opportunities for scientific discovery, but it also presents unique computational and statistical challenges. Traditional methods struggle with geometry-aware data generation, interpolation along meaningful trajectories, and transporting populations via feasible paths. To address these issues, we introduce Geometry-Aware Generative Autoencoder (GAGA), a novel framework that combines extensible manifold learning with generative modeling. GAGA constructs a neural network embedding space that respects the intrinsic geometries discovered by manifold learning and learns a novel warped Riemannian metric on the data space. This warped metric is derived from both the points on the data manifold and negative samples off the manifold, allowing it to characterize a meaningful geometry across the entire latent space. Using this metric, GAGA can uniformly sample points on the manifold, generate points along geodesics, and interpolate between populations across the learned manifold. GAGA shows competitive performance in simulated and real-world datasets, including a 30% improvement over SOTA in single-cell population-level trajectory inference.more » « less
-
Abstract Time-series single-cell RNA sequencing (scRNA-seq) datasets provide unprecedented opportunities to learn dynamic processes of cellular systems. Due to the destructive nature of sequencing, it remains challenging to link the scRNA-seq snapshots sampled at different time points. Here we present TIGON, a dynamic, unbalanced optimal transport algorithm that reconstructs dynamic trajectories and population growth simultaneously as well as the underlying gene regulatory network from multiple snapshots. To tackle the high-dimensional optimal transport problem, we introduce a deep learning method using a dimensionless formulation based on the Wasserstein–Fisher–Rao (WFR) distance. TIGON is evaluated on simulated data and compared with existing methods for its robustness and accuracy in predicting cell state transition and cell population growth. Using three scRNA-seq datasets, we show the importance of growth in the temporal inference, TIGON’s capability in reconstructing gene expression at unmeasured time points and its applications to temporal gene regulatory networks and cell–cell communication inference.more » « less
-
River bathymetry is needed to accurately simulate river hydrodynamics. River bathymetric data are typically collected through boat-mounted single- or multi-beam echosounder surveys. Detailed bathymetric data from multibeam surveys may exceed the requirements of standard river hydraulic models (1D and 2D). Compared to data-intensive but expensive multibeam surveys, single-beam surveys are cost-effective. Single-beam surveys can sufficiently inform river simulations when coupled with specific preprocessing and interpolation techniques. This study contrasts two survey patterns, including the commonly used but under-studied zigzag surveys, against the traditional cross-sectional surveys. Linear and anisotropic Kriging interpolations, two widely used methods, are applied to construct bathymetry mesh from different survey configurations. Results from this study highlight efficient survey configurations for both cross-sectional and zigzag patterns, balancing accuracy and cost. Notably, zigzag surveys approach the efficacy of cross-sectional surveys when spaced below a certain threshold, but Kriging interpolation shows diminished performance with sparse zigzag surveys. The findings from this study bridge gaps in previous research by offering nuanced comparisons between survey configurations and interpolations. This study offers a comparative analysis to guide more effective planning and utilization of single-beam surveys, without advocating for specific survey patterns or interpolation techniques.more » « less
-
We introduce adversarial learning methods for data-driven generative modeling of dynamics of n-th-order stochastic systems. Our approach builds on Generative Adversarial Networks (GANs) with generative model classes based on stable m-step stochastic numerical integrators. From observations of trajectory samples, we introduce methods for learning long-time predictors and stable representations of the dynamics. Our approaches use discriminators based on Maximum Mean Discrepancy (MMD), training protocols using both conditional and marginal distributions, and methods for learning dynamic responses over different time-scales. We show how our approaches can be used for modeling physical systems to learn force-laws, damping coefficients, and noise-related parameters. Our adversarial learning approaches provide methods for obtaining stable generative models for dynamic tasks including long-time prediction and developing simulations for stochastic systems.more » « less
An official website of the United States government
