A Walk to Remember: Mllm Memory-Driven Visual Navigation*

Vitharana, Sandun S; Mallikarachchi, Sanjaya; Wijayagrahi, HG Chamika; Abizov, Nuralem; Bektemessov, Amanzhol; Ibrayev, Aidos; Godage, Isuru S

doi:10.1109/UR65550.2025.11078086

Citation Details

This content will become publicly available on June 30, 2026

A Walk to Remember: Mllm Memory-Driven Visual Navigation*

This paper presents a novel framework for memory-based navigation for terrestrial robots, utilizing a customized multimodal large language model (MLLM) to interpret visual inputs and generate navigation commands. The system employs a Unitree GO1 robot equipped with a camera to capture environmental images, which are processed by the customized MLLM for navigation. By leveraging a memory-based approach, the robot efficiently reuses previously traversed paths, reducing the need for re-exploration and enhancing navigation efficiency. The hybrid controller in this work features a deliberation unit and a reactive controller for high-level commands and robot alignment. Experimental validation in a hallway-like environment demonstrates that memory-driven navigation improves path retracing and overall performance. more »

Award ID(s):: 2326536 2327702

PAR ID:: 10633672

Author(s) / Creator(s):: Vitharana, Sandun S; Mallikarachchi, Sanjaya; Wijayagrahi, HG Chamika; Abizov, Nuralem; Bektemessov, Amanzhol; Ibrayev, Aidos; Godage, Isuru S

Publisher / Repository:: IEEE

Date Published:: 2025-06-30

ISBN:: 979-8-3315-2441-8

Page Range / eLocation ID:: 235 to 240

Format(s):: Medium: X

Location:: College Station, TX, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 30, 2026
Conference Paper:
https://doi.org/10.1109/UR65550.2025.11078086

More Like this