Generate, Then Retrieve: Addressing Missing Modalities in Multimodal Learning via Generative AI and MoE

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Ding, Ying; Long, Qi; Chen, Tianlong

Citation Details

This content will become publicly available on March 7, 2026

Generate, Then Retrieve: Addressing Missing Modalities in Multimodal Learning via Generative AI and MoE

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to retrieve embeddings for missing modalities, two main bottlenecks remain: (1) the need to consider both intra- and inter-modal context, and (2) the cost of embedding selection, where embeddings often lack modality-specific knowledge. To address this, the authors propose MoE-Retriever, a novel framework inspired by Sparse Mixture of Experts (SMoE). MoE-Retriever defines a supporting group for intra-modal inputs—samples that commonly lack the target modality—by selecting samples with complementary modality combinations for the target modality. This group is integrated with inter-modal inputs from different modalities of the same sample, establishing both intra- and inter-modal contexts. These inputs are processed by Multi-Head Attention to generate context-aware embeddings, which serve as inputs to the SMoE Router that automatically selects the most relevant experts (embedding candidates). Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data. more »

Award ID(s):: 2505865

PAR ID:: 10631826

Author(s) / Creator(s):: Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Ding, Ying; Long, Qi; Chen, Tianlong

Publisher / Repository:: aUpA5gulZ4

Date Published:: 2025-03-07

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 7, 2026
Conference Paper:
The DOI is not currently available.

More Like this