NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Generate, Then Retrieve: Addressing Missing Modalities in Multimodal Learning via Generative AI and MoE

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Ding, Ying; Long, Qi; Chen, Tianlong (March 2025, aUpA5gulZ4)

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to retrieve embeddings for missing modalities, two main bottlenecks remain: (1) the need to consider both intra- and inter-modal context, and (2) the cost of embedding selection, where embeddings often lack modality-specific knowledge. To address this, the authors propose MoE-Retriever, a novel framework inspired by Sparse Mixture of Experts (SMoE). MoE-Retriever defines a supporting group for intra-modal inputs—samples that commonly lack the target modality—by selecting samples with complementary modality combinations for the target modality. This group is integrated with inter-modal inputs from different modalities of the same sample, establishing both intra- and inter-modal contexts. These inputs are processed by Multi-Head Attention to generate context-aware embeddings, which serve as inputs to the SMoE Router that automatically selects the most relevant experts (embedding candidates). Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data.
more » « less
Free, publicly-accessible full text available March 7, 2026
Sparse MoE as a New Retriever: Addressing Missing Modality Problem in Incomplete Multimodal Data

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Long, Qi; Chen, Tianlong (February 2025, ICLR 2025 https://openreview.net/forum?id=j9DbobO0mY)

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to effectively retrieve embeddings for missing modalities, two main bottlenecks remain: the consideration of both intra- and inter-modal context, and the cost of embedding selection, where embeddings often lack modality-specific knowledge. In response, we propose MoE-Retriever, a novel framework inspired by the design principles of Sparse Mixture of Experts (SMoE). First, MoE-Retriever samples the relevant data from modality combinations, using a so-called supporting group to construct intra-modal inputs while incorporating inter-modal inputs. These inputs are then processed by Multi-Head Attention, after which the SMoE Router automatically selects the most relevant expert, i.e., the embedding candidate to be retrieved. Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data.
more » « less
Free, publicly-accessible full text available February 5, 2026
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

https://doi.org/10.18653/v1/2025.naacl-long.623

Ouyang, Yang; Gu, Hengrui; Lin, Shuhang; Hua, Wenyue; Peng, Jie; Kailkhura, Bhavya; Gao, Meijun; Chen, Tianlong; Zhou, Kaixiong (January 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available January 1, 2026
Testing General Linear Hypotheses Under a High-Dimensional Multivariate Regression Model with Spiked Noise Covariance

https://doi.org/10.1080/01621459.2023.2278825

Li, Haoran; Aue, Alexander; Paul, Debashis; Peng, Jie (March 2024, Journal of the American Statistical Association)

Full Text Available
Estimating fiber orientation distribution with application to study brain lateralization using HCP D-MRI data

https://doi.org/10.1214/23-AOAS1781

Hwang, Seungyong; Lee, Thomas_C M; Paul, Debashis; Peng, Jie (March 2024, The Annals of Applied Statistics)

Full Text Available
DAGBagM: learning directed acyclic graphs of mixed variables with an application to identify protein biomarkers for treatment response in ovarian cancer

https://doi.org/10.1186/s12859-022-04864-y

Chowdhury, Shrabanti; Wang, Ru; Yu, Qing; Huntoon, Catherine J.; Karnitz, Larry M.; Kaufmann, Scott H.; Gygi, Steven P.; Birrer, Michael J.; Paulovich, Amanda G.; Peng, Jie; et al (December 2022, BMC Bioinformatics)

Abstract Background Applying directed acyclic graph (DAG) models to proteogenomic data has been shown effective for detecting causal biomarkers of complex diseases. However, there remain unsolved challenges in DAG learning to jointly model binary clinical outcome variables and continuous biomarker measurements. Results In this paper, we propose a new tool, DAGBagM, to learn DAGs with both continuous and binary nodes. By using appropriate models, DAGBagM allows for either continuous or binary nodes to be parent or child nodes. It employs a bootstrap aggregating strategy to reduce false positives in edge inference. At the same time, the aggregation procedure provides a flexible framework to robustly incorporate prior information on edges. Conclusions Through extensive simulation experiments, we demonstrate that DAGBagM has superior performance compared to alternative strategies for modeling mixed types of nodes. In addition, DAGBagM is computationally more efficient than two competing methods. When applying DAGBagM to proteogenomic datasets from ovarian cancer studies, we identify potential protein biomarkers for platinum refractory/resistant response in ovarian cancer. DAGBagM is made available as a github repository at https://github.com/jie108/dagbagM .
more » « less
Full Text Available
An adaptable generalization of Hotelling’s $$T^{2}$$ test in high dimension

https://doi.org/10.1214/19-AOS1869

Li, Haoran; Aue, Alexander; Paul, Debashis; Peng, Jie; Wang, Pei (June 2020, Annals of Statistics)

Full Text Available

Search for: All records