CMRM: A Cross-Modal Reasoning Model to Enable Zero-Shot Imitation Learning for Robotic RFID Inventory in Unstructured Environments

Wu, Yongshuai; Zhang, Jian; Wu, Shaoen; Mao, Shiwen; Wang, Ying

doi:10.1109/GLOBECOM54140.2023.10437833

Citation Details

CMRM: A Cross-Modal Reasoning Model to Enable Zero-Shot Imitation Learning for Robotic RFID Inventory in Unstructured Environments

The fast development in Deep Learning (DL) has made it a promising technique for various autonomous robotic systems. Recently, researchers have explored deploying DL models, such as Reinforcement Learning and Imitation Learning, to enable robots for Radio-frequency Identification (RFID) based inventory tasks. However, the existing methods are either focused on a single field or need tremendous data and time to train. To address these problems, this paper presents a Cross-Modal Reasoning Model (CMRM), which is designed to extract high-dimension information from multiple sensors and learn to reason from spatial and historical features for latent crossmodal relations. Furthermore, CMRM aligns the learned tasking policy to high-level features to offer zero-shot generalization to unseen environments. We conduct extensive experiments in several virtual environments as well as in indoor settings with robots for RFID inventory. The experimental results demonstrate that the proposed CMRM can significantly improve learning efficiency by around 20 times. It also demonstrates a robust zero-shot generalization for deploying a learned policy in unseen environments to perform RFID inventory tasks successfully. more »

Award ID(s):: 2245607 2245608 2300955 1923717

PAR ID:: 10519879

Author(s) / Creator(s):: Wu, Yongshuai; Zhang, Jian; Wu, Shaoen; Mao, Shiwen; Wang, Ying

Publisher / Repository:: IEEE

Date Published:: 2023-12-04

ISSN:: 2576-6813

ISBN:: 979-8-3503-1090-0

Page Range / eLocation ID:: 5354 to 5359

Subject(s) / Keyword(s):: Imitating Learning, RFID inventory, Longhorizon tasks, Cross-modal reasoning, multiple sensing spaces

Format(s):: Medium: X

Location:: Kuala Lumpur, Malaysia

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/GLOBECOM54140.2023.10437833

More Like this