Speech-Guided Sequential Planning for Autonomous Navigation Using Large Language Model Meta AI 3 (Llama3)

Srivastava, Alkesh K; Dames, Philip

doi:10.1007/978-981-96-3519-1_15

Citation Details

This content will become publicly available on March 25, 2026

Speech-Guided Sequential Planning for Autonomous Navigation Using Large Language Model Meta AI 3 (Llama3)

In social robotics, a pivotal focus is enabling robots to engage with humans in a more natural and seamless manner. The emergence of advanced large language models (LLMs) has driven significant advancements in integrating natural language understanding capabilities into social robots. This paper presents a system for speech-guided sequential planning in pick and place tasks, which are found across a range of application areas. The proposed system uses Large Language Model Meta AI (Llama3) to interpret voice commands by extracting essential details through parsing and decoding the commands into sequential actions. These actions are sent to DRL-VO, a learning-based control policy built on the Robot Operating System (ROS) that allows a robot to autonomously navigate through social spaces with static infrastructure and crowds of people. We demonstrate the effectiveness of the system in simulation experiment using Turtlebot 2 in ROS1 and Turtlebot 3 in ROS2. We conduct hardware trials using a Clearpath Robotics Jackal UGV, highlighting its potential for real-world deployment in scenarios requiring flexible and interactive robotic behaviors. more »

Award ID(s):: 2143312

PAR ID:: 10590513

Author(s) / Creator(s):: Srivastava, Alkesh K; Dames, Philip

Publisher / Repository:: Springer Nature Singapore

Date Published:: 2025-03-25

ISBN:: 978-981-96-3519-1

Page Range / eLocation ID:: 158 to 168

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 25, 2026
Book Chapter:
https://doi.org/10.1007/978-981-96-3519-1_15

More Like this