NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

Shrestha, Sagar; Fu, Xiao (April 2025, ICLR)

Free, publicly-accessible full text available April 23, 2026
Identifiable Shared Component Analysis of Unpaired Multimodal Mixtures

Timilsina, Subash; Shrestha, Sagar; Fu, Xiao (December 2024, NeurIPS 2024)

Full Text Available
Translation Identifiability-Guided Unsupervised Cross-Platform Super-Resolution for OCT Images

https://doi.org/10.1109/SAM60225.2024.10636686

Song, Jiahui; Shrestha, Sagar; Li, Xueshen; Gan, Yu; Fu, Xiao (July 2024, IEEE)

Full Text Available
TOWARDS IDENTIFIABLE UNSUPERVISED DOMAIN TRANSLATION: A DIVERSIFIED DISTRIBUTION MATCHING APPROACH

Shrestha, Sagar; Fu, Xiao (May 2024, ICLR)

Full Text Available
Probabilistic Simplex Component Analysis via Variational Auto-Encoding

https://doi.org/10.1109/ICASSP48485.2024.10448368

Li, Yuening; Fu, Xiao; Ma, Wing-Kin (April 2024, IEEE)

Full Text Available
Provable Subspace Identification Under Post-Nonlinear Mixtures

Lyu, Qi; Fu, Xiao (December 2022, Advances in neural information processing systems)

Unsupervised mixture learning (UML) aims at identifying linearly or nonlinearly mixed latent components in a blind manner. UML is known to be challenging: Even learning linear mixtures requires highly nontrivial analytical tools, e.g., independent component analysis or nonnegative matrix factorization. In this work, the post-nonlinear (PNL) mixture model---where {\it unknown} element-wise nonlinear functions are imposed onto a linear mixture---is revisited. The PNL model is widely employed in different fields ranging from brain signal classification, speech separation, remote sensing, to causal discovery. To identify and remove the unknown nonlinear functions, existing works often assume different properties on the latent components (e.g., statistical independence or probability-simplex structures). This work shows that under a carefully designed UML criterion, the existence of a nontrivial {\it null space} associated with the underlying mixing system suffices to guarantee identification/removal of the unknown nonlinearity. Compared to prior works, our finding largely relaxes the conditions of attaining PNL identifiability, and thus may benefit applications where no strong structural information on the latent components is known. A finite-sample analysis is offered to characterize the performance of the proposed approach under realistic settings. To implement the proposed learning criterion, a block coordinate descent algorithm is proposed. A series of numerical experiments corroborate our theoretical claims.
more » « less
Full Text Available
On Finite-Sample Identifiability of Contrastive Learning-Based Nonlinear Independent Component Analysis

Lyu, Qi; Fu, Xiao (July 2022, Proceedings of the 39th International Conference on Machine Learning)

Nonlinear independent component analysis (nICA) aims at recovering statistically independent latent components that are mixed by unknown nonlinear functions. Central to nICA is the identifiability of the latent components, which had been elusive until very recently. Specifically, Hyvärinen et al. have shown that the nonlinearly mixed latent components are identifiable (up to often inconsequential ambiguities) under a generalized contrastive learning (GCL) formulation, given that the latent components are independent conditioned on a certain auxiliary variable. The GCL-based identifiability of nICA is elegant, and establishes interesting connections between nICA and popular unsupervised/self-supervised learning paradigms in representation learning, causal learning, and factor disentanglement. However, existing identifiability analyses of nICA all build upon an unlimited sample assumption and the use of ideal universal function learners—which creates a non-negligible gap between theory and practice. Closing the gap is a nontrivial challenge, as there is a lack of established “textbook” routine for finite sample analysis of such unsupervised problems. This work puts forth a finite-sample identifiability analysis of GCL-based nICA. Our analytical framework judiciously combines the properties of the GCL loss function, statistical generalization analysis, and numerical differentiation. Our framework also takes the learning function’s approximation error into consideration, and reveals an intuitive trade-off between the complexity and expressiveness of the employed function learner. Numerical experiments are used to validate the theorems.
more » « less
Full Text Available

Search for: All records