skip to main content


This content will become publicly available on September 24, 2024

Title: Inverse Reinforcement Learning and Gaussian Process Regression-based Real-Time Framework for Personalized Adaptive Cruise Control
Adaptive Cruise Control (ACC) has become increasingly popular in modern vehicles, providing enhanced driving safety, comfort, and fuel efficiency. However, predefined ACC settings may not always align with a driver's preferences, leading to discomfort and possible safety hazards. To address this issue, Personalized ACC (P-ACC) has been studied by scholars. However, existing research mostly relies on historical driving data to imitate driver styles, which ignores real-time feedback from the driver. To overcome this limitation, we propose a cloud-vehicle collaborative P-ACC framework, which integrates real-time driver feedback adaptation. This framework consists of offline and online modules. The offline module records the driver's naturalistic car-following trajectory and uses inverse reinforcement learning (IRL) to train the model on the cloud. The online module utilizes the driver's real-time feedback to update the driving gap preference in real-time using Gaussian process regression (GPR). By retraining the model on the cloud with the driver's takeover trajectories, our approach achieves incremental learning to better match the driver's preference. In human-in-the-loop (HuiL) simulation experiments, the proposed framework results in a significant reduction of driver intervention in automatic control systems, up to 70.9%.  more » « less
Award ID(s):
2152258
NSF-PAR ID:
10510973
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
IEEE
Date Published:
ISBN:
979-8-3503-9946-2
Page Range / eLocation ID:
4428 to 4435
Format(s):
Medium: X
Location:
Bilbao, Spain
Sponsoring Org:
National Science Foundation
More Like this
  1. Advanced Driver Assistance Systems (ADAS) are increasingly important in improving driving safety and comfort, with Adaptive Cruise Control (ACC) being one of the most widely used. However, pre-defined ACC settings may not always align with driver's preferences and habits, leading to discomfort and potential safety issues. Personalized ACC (P-ACC) has been proposed to address this problem, but most existing research uses historical driving data to imitate behaviors that conform to driver preferences, neglecting real-time driver feedback. To bridge this gap, we propose a cloud-vehicle collaborative P-ACC framework that incorporates driver feedback adaptation in real time. The framework is divided into offline and online parts. The offline component records the driver's naturalistic car-following trajectory and uses inverse reinforcement learning (IRL) to train the model on the cloud. In the online component, driver feedback is used to update the driving gap preference in real time. The model is then retrained on the cloud with driver's takeover trajectories, achieving incremental learning to better match driver's preference. Human-in-the-loop (HuiL) simulation experiments demonstrate that our proposed method significantly reduces driver intervention in automatic control systems by up to 62.8%. By incorporating real-time driver feedback, our approach enhances the comfort and safety of P-ACC, providing a personalized and adaptable driving experience. 
    more » « less
  2. Abstract

    Vehicle‐to‐Everything (V2X) communication has been proposed as a potential solution to improve the robustness and safety of autonomous vehicles by improving coordination and removing the barrier of non‐line‐of‐sight sensing. Cooperative Vehicle Safety (CVS) applications are tightly dependent on the reliability of the underneath data system, which can suffer from loss of information due to the inherent issues of their different components, such as sensors' failures or the poor performance of V2X technologies under dense communication channel load. Particularly, information loss affects the target classification module and, subsequently, the safety application performance. To enable reliable and robust CVS systems that mitigate the effect of information loss, a Context‐Aware Target Classification (CA‐TC) module coupled with a hybrid learning‐based predictive modeling technique for CVS systems is proposed. The CA‐TC consists of two modules: a Context‐Aware Map (CAM), and a Hybrid Gaussian Process (HGP) prediction system. Consequently, the vehicle safety applications use the information from the CA‐TC, making them more robust and reliable. The CAM leverages vehicles' path history, road geometry, tracking, and prediction; and the HGP is utilized to provide accurate vehicles' trajectory predictions to compensate for data loss (due to communication congestion) or sensor measurements' inaccuracies. Based on offline real‐world data, a finite bank of driver models that represent the joint dynamics of the vehicle and the drivers' behavior is learned. Offline training and online model updates are combined with on‐the‐fly forecasting to account for new possible driver behaviors. Finally, the framework is validated using simulation and realistic driving scenarios to confirm its potential in enhancing the robustness and reliability of CVS systems.

     
    more » « less
  3. Driver assist features such as adaptive cruise control (ACC) and highway assistants are becoming increasingly prevalent on commercially available vehicles. These systems are typically designed for safety and rider comfort. However, these systems are often not designed with the quality of the overall traffic flow in mind. For such a system to be beneficial to the traffic flow, it must be string stable and minimize the inter-vehicle spacing to maximize throughput, while still being safe. We propose a methodology to select autonomous driving system parameters that are both safe and string stable using the existing control framework already implemented on commercially available ACC vehicles. Optimal parameter values are selected via model-based optimization for an example highway assistant controller with path planning. 
    more » « less
  4. The objective of this research is to enable safety‐critical systems to simultaneously learn and execute optimal control policies in a safe manner to achieve complex autonomy. Learning optimal policies via trial and error, that is, traditional reinforcement learning, is difficult to implement in safety‐critical systems, particularly when task restarts are unavailable. Safe model‐based reinforcement learning techniques based on a barrier transformation have recently been developed to address this problem. However, these methods rely on full‐state feedback, limiting their usability in a real‐world environment. In this work, an output‐feedback safe model‐based reinforcement learning technique based on a novel barrier‐aware dynamic state estimator has been designed to address this issue. The developed approach facilitates simultaneous learning and execution of safe control policies for safety‐critical linear systems. Simulation results indicate that barrier transformation is an effective approach to achieve online reinforcement learning in safety‐critical systems using output feedback. 
    more » « less
  5. Driver-assistance systems are becoming more commonplace; however, the realized safety benefits of these technologies depend on whether a person accepts and adopts automated driving aids. One challenge to adoption could be a preference-performance dissociation (PPD), which is a mismatch between a self-perceived desire and an objective need for assistance. Research has reported PPD in driving but has not extensively leveraged driving performance data to confirm its existence. Thus, the goal of this study was to compare drivers’ self-reported need for vehicle assistance to their objective driving performance. Twenty-one participants drove on a simulated road and traversed challenging, real-world roadway obstacles. Afterwards, they were asked about their preference for automated vehicle assistance (e.g., steering and braking) during their drive. Overall, some participants exhibited PPD that included both over- and underestimating their need for a particular type of automated assistance. Findings can be used to develop shared control and adaptive automation strategies tailored to particular users and contexts across various safety-critical environments.

     
    more » « less