Sparse MoE as a New Retriever: Addressing Missing Modality Problem in Incomplete Multimodal Data

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Long, Qi; Chen, Tianlong

Citation Details

This content will become publicly available on February 5, 2026

Sparse MoE as a New Retriever: Addressing Missing Modality Problem in Incomplete Multimodal Data

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to effectively retrieve embeddings for missing modalities, two main bottlenecks remain: the consideration of both intra- and inter-modal context, and the cost of embedding selection, where embeddings often lack modality-specific knowledge. In response, we propose MoE-Retriever, a novel framework inspired by the design principles of Sparse Mixture of Experts (SMoE). First, MoE-Retriever samples the relevant data from modality combinations, using a so-called supporting group to construct intra-modal inputs while incorporating inter-modal inputs. These inputs are then processed by Multi-Head Attention, after which the SMoE Router automatically selects the most relevant expert, i.e., the embedding candidate to be retrieved. Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data. more »

Award ID(s):: 2505865

PAR ID:: 10631052

Author(s) / Creator(s):: Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Long, Qi; Chen, Tianlong

Publisher / Repository:: ICLR 2025 https://openreview.net/forum?id=j9DbobO0mY

Date Published:: 2025-02-05

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on February 5, 2026
Conference Paper:
The DOI is not currently available.

More Like this