NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

Lin, Yingyu; Huang, Yuxing; Liu, Wenqin; Deng, Haoran; Ng, Ignavier; Zhang, Kun; Gong, Mingming; Ma, Yian; Huang, Biwei (April 2025, OpenReview.net)

Real-world data often violates the equal-variance assumption (homoscedasticity), making it essential to account for heteroscedastic noise in causal discovery. In this work, we explore heteroscedastic symmetric noise models (HSNMs), where the effect Y is modeled as Y = f(X) + σ(X)N, with X as the cause and N as independent noise following a symmetric distribution. We introduce a novel criterion for identifying HSNMs based on the skewness of the score (i.e., the gradient of the log density) of the data distribution. This criterion establishes a computationally tractable measurement that is zero in the causal direction but nonzero in the anticausal direction, enabling the causal direction discovery. We extend this skewness-based criterion to the multivariate setting and propose SkewScore, an algorithm that handles heteroscedastic noise without requiring the extraction of exogenous noise. We also conduct a case study on the robustness of SkewScore in a bivariate model with a latent confounder, providing theoretical insights into its performance. Empirical studies further validate the effectiveness of the proposed method.
more » « less
Free, publicly-accessible full text available April 24, 2026
Hierarchical Amortized GAN for 3D High Resolution Medical Image Synthesis

https://doi.org/10.1109/JBHI.2022.3172976

Sun, Li; Chen, Junxiang; Xu, Yanwu; Gong, Mingming; Yu, Ke; Batmanghelich, Kayhan (August 2022, IEEE Journal of Biomedical and Health Informatics)

Full Text Available
Fair Classification with Instance-dependent Label Noise

Wu, Songhua; Gong, Mingming; Han, Bo; Liu, Yang; Liu, Tongliang (January 2022, First Conference on Causal Learning and Reasoning)

With the widespread use of machine learning systems in our daily lives, it is important to consider fairness as a basic requirement when designing these systems, especially when the systems make life-changing decisions, e.g., \textit{COMPAS} algorithm helps judges decide whether to release an offender. For another thing, due to the cheap but imperfect data collection methods, such as crowdsourcing and web crawling, label noise is ubiquitous, which unfortunately makes fairness-aware algorithms even more prejudiced than fairness-unaware ones, and thereby harmful. To tackle these problems, we provide general frameworks for learning fair classifiers with \textit{instance-dependent label noise}. For statistical fairness notions, we rewrite the classification risk and the fairness metric in terms of noisy data and thereby build robust classifiers. For the causality-based fairness notion, we exploit the internal causal structure of data to model the label noise and \textit{counterfactual fairness} simultaneously. Experimental results demonstrate the effectiveness of the proposed methods on real-world datasets with controllable synthetic label noise.
more » « less
Full Text Available
Domain Adaptation with Invariant Representation Learning: What Transformations to Learn?

Stojanov, Petar; Li, Zijian; Gong, Mingming; Cai, Ruichu; Carbonell, Jaime; Zhang, Kun (January 2021, Advances in Neural Information Processing Systems)
Ranzato, M.; Beygelzimer, A; Dauphin, Y.; Liang, P.S.; Vaughan, J. Wortman (Ed.)
Full Text Available
Instance-dependent Label-noise Learning under a Structural Causal Model

Yao, Yu; Liu, Tongliang; Gong, Mingming; Han, Bo; Niu, Gang; Zhang, Kun (January 2021, Advances in Neural Information Processing Systems)
Ranzato, M.; Beygelzimer, A.; Dauphin, Y.; Liang, P.S.; Vaughan, J. Wortman (Ed.)
Full Text Available
3D-BoxSup: Positive-Unlabeled Learning of Brain Tumor Segmentation Networks From 3D Bounding Boxes

https://doi.org/10.3389/fnins.2020.00350

Xu, Yanwu; Gong, Mingming; Chen, Junxiang; Chen, Ziye; Batmanghelich, Kayhan (April 2020, Frontiers in Neuroscience)

Full Text Available
Generative-Discriminative Complementary Learning

https://doi.org/10.1609/aaai.v34i04.6126

Xu, Yanwu; Gong, Mingming; Chen, Junxiang; Liu, Tongliang; Zhang, Kun; Batmanghelich, Kayhan (June 2020, Proceedings of the AAAI Conference on Artificial Intelligence)

The majority of state-of-the-art deep learning methods are discriminative approaches, which model the conditional distribution of labels given inputs features. The success of such approaches heavily depends on high-quality labeled instances, which are not easy to obtain, especially as the number of candidate classes increases. In this paper, we study the complementary learning problem. Unlike ordinary labels, complementary labels are easy to obtain because an annotator only needs to provide a yes/no answer to a randomly chosen candidate class for each instance. We propose a generative-discriminative complementary learning method that estimates the ordinary labels by modeling both the conditional (discriminative) and instance (generative) distributions. Our method, we call Complementary Conditional GAN (CCGAN), improves the accuracy of predicting ordinary labels and is able to generate high-quality instances in spite of weak supervision. In addition to the extensive empirical studies, we also theoretically show that our model can retrieve the true conditional distribution from the complementarily-labeled data.
more » « less
Full Text Available
Unpaired data empowers association tests

https://doi.org/10.1093/bioinformatics/btaa886

Gong, Mingming; Liu, Peng; Sciurba, Frank C; Stojanov, Petar; Tao, Dacheng; Tseng, George C; Zhang, Kun; Batmanghelich, Kayhan (October 2020, Bioinformatics)
Alfonso, Valencia (Ed.)
Abstract Motivation There is growing interest in the biomedical research community to incorporate retrospective data, available in healthcare systems, to shed light on associations between different biomarkers. Understanding the association between various types of biomedical data, such as genetic, blood biomarkers, imaging, etc. can provide a holistic understanding of human diseases. To formally test a hypothesized association between two types of data in Electronic Health Records (EHRs), one requires a substantial sample size with both data modalities to achieve a reasonable power. Current association test methods only allow using data from individuals who have both data modalities. Hence, researchers cannot take advantage of much larger EHR samples that includes individuals with at least one of the data types, which limits the power of the association test. Results We present a new method called the Semi-paired Association Test (SAT) that makes use of both paired and unpaired data. In contrast to classical approaches, incorporating unpaired data allows SAT to produce better control of false discovery and to improve the power of the association test. We study the properties of the new test theoretically and empirically, through a series of simulations and by applying our method on real studies in the context of Chronic Obstructive Pulmonary Disease. We are able to identify an association between the high-dimensional characterization of Computed Tomography chest images and several blood biomarkers as well as the expression of dozens of genes involved in the immune system. Availability and implementation Code is available on https://github.com/batmanlab/Semi-paired-Association-Test. Supplementary information Supplementary data are available at Bioinformatics online.
more » « less
Full Text Available
Unusual Deformation and Fracture in Gallium Telluride Multilayers

https://doi.org/10.1021/acs.jpclett.2c00411

Zhou, Yan; Zhou, Shi; Ying, Penghua; Zhao, Qinghua; Xie, Yong; Gong, Mingming; Jiang, Pisu; Cai, Hui; Chen, Bin; Tongay, Sefaattin; et al (April 2022, The Journal of Physical Chemistry Letters)

Search for: All records