skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Full Vehicle Trajectory Planning Model for Urban Traffic Control Based on Imitation Learning
Connected and automated vehicles (CAVs) extend urban traffic control from temporal to spatiotemporal by enabling the control of CAV trajectories. Most of the existing studies on CAV trajectory planning only consider longitudinal behaviors (i.e., in-lane driving), or assume that the lane changing can be done instantaneously. The resultant CAV trajectories are not realistic and cannot be executed at the vehicle level. The aim of this paper is to propose a full trajectory planning model that considers both in-lane driving and lane changing maneuvers. The trajectory generation problem is modeled as an optimization problem and the cost function considers multiple driving features including safety, efficiency, and comfort. Ten features are selected in the cost function to capture both in-lane driving and lane changing behaviors. One major challenge in generating a trajectory that reflects certain driving policies is to balance the weights of different features in the cost function. To address this challenge, it is proposed to optimize the weights of the cost function by imitation learning. Maximum entropy inverse reinforcement learning is applied to obtain the optimal weight for each feature and then CAV trajectories are generated with the learned weights. Experiments using the Next Generation Simulation (NGSIM) dataset show that the generated trajectory is very close to the original trajectory with regard to the Euclidean distance displacement, with a mean average error of less than 1 m. Meanwhile, the generated trajectories can maintain safety gaps with surrounding vehicles and have comparable fuel consumption.  more » « less
Award ID(s):
2038215
PAR ID:
10379496
Author(s) / Creator(s):
 ;  
Publisher / Repository:
SAGE Publications
Date Published:
Journal Name:
Transportation Research Record: Journal of the Transportation Research Board
Volume:
2676
Issue:
7
ISSN:
0361-1981
Format(s):
Medium: X Size: p. 186-198
Size(s):
p. 186-198
Sponsoring Org:
National Science Foundation
More Like this
  1. Preceding vehicles typically dominate the movement of following vehicles in traffic systems, thereby significantly influencing the efficacy of eco-driving control that concentrates on vehicle speed optimization. To potentially mitigate the negative effect of preceding vehicles on eco-driving control at the signalized intersection, this study proposes an overtaking-enabled eco-approach control (OEAC) strategy. It combines driving lane planning and speed optimization for connected and automated vehicles to relax the first-in-first-out queuing policy at the signalized intersection, minimizing the host vehicle’s energy consumption and travel delay. The OEAC adopts a two-stage receding horizon control framework to derive optimal driving trajectories for adapting to dynamic traffic conditions. In the first stage, the driving lane optimization problem is formulated as a Markov decision process and solved using dynamic programming, which takes into account the uncertain disturbance from preceding vehicles. In the second stage, the vehicle’s speed trajectory with the minimal driving cost is optimized rapidly using Pontryagin’s minimum principle to obtain the closed-form analytical optimal solution. Extensive simulations are conducted to evaluate the effectiveness of the OEAC. The results show that the OEAC is excellent in driving cost reduction over constant speed and regular eco-approach and departure strategies in various traffic scenarios, with an average improvement of 20.91% and 5.62%, respectively. 
    more » « less
  2. Stop-and-go traffic poses significant challenges to the efficiency and safety of traffic operations, and its impacts and working mechanism have attracted much attention. Recent studies have shown that Connected and Automated Vehicles (CAVs) with carefully designed longitudinal control have the potential to dampen the stop-and-go wave based on simulated vehicle trajectories. In this study, Deep Reinforcement Learning (DRL) is adopted to control the longitudinal behavior of CAVs and real-world vehicle trajectory data is utilized to train the DRL controller. It considers a Human-Driven (HD) vehicle tailed by a CAV, which are then followed by a platoon of HD vehicles. Such an experimental design is to test how the CAV can help to dampen the stop-and-go wave generated by the lead HD vehicle and contribute to smoothing the following HD vehicles’ speed profiles. The DRL control is trained using real-world vehicle trajectories, and eventually evaluated using SUMO simulation. The results show that the DRL control decreases the speed oscillation of the CAV by 54% and 8%-28% for those following HD vehicles. Significant fuel consumption savings are also observed. Additionally, the results suggest that CAVs may act as a traffic stabilizer if they choose to behave slightly altruistically. 
    more » « less
  3. null (Ed.)
    Stop-and-go traffic poses significant challenges to the efficiency and safety of traffic operations, and its impacts and working mechanism have attracted much attention. Recent studies have shown that Connected and Automated Vehicles (CAVs) with carefully designed longitudinal control have the potential to dampen the stop-and-go wave based on simulated vehicle trajectories. In this study, Deep Reinforcement Learning (DRL) is adopted to control the longitudinal behavior of CAVs and real-world vehicle trajectory data is utilized to train the DRL controller. It considers a Human-Driven (HD) vehicle tailed by a CAV, which are then followed by a platoon of HD vehicles. Such an experimental design is to test how the CAV can help to dampen the stop-and-go wave generated by the lead HD vehicle and contribute to smoothing the following HD vehicles’ speed profiles. The DRL control is trained using realworld vehicle trajectories, and eventually evaluated using SUMO simulation. The results show that the DRL control decreases the speed oscillation of the CAV by 54% and 8%-28% for those following HD vehicles. Significant fuel consumption savings are also observed. Additionally, the results suggest that CAVs may act as a traffic stabilizer if they choose to behave slightly altruistically. 
    more » « less
  4. Abstract This article focuses on the development of distributed robust model predictive control (MPC) methods for multiple connected and automated vehicles (CAVs) to ensure their safe operation in the presence of uncertainty. The proposed layered control framework includes reference trajectory generation, distributionally robust obstacle occupancy set computation, distributed state constraint set evaluation, data-driven linear model representation, and robust tube-based MPC design. To enable distributed operation among the CAVs, we present a method, which exploits sampling-based reference trajectory generation and distributed constraint set evaluation methods, that decouples the coupled collision avoidance constraint among the CAVs. This is followed by data-driven linear model representation of the nonlinear system to evaluate the convex equivalent of the nonlinear control problem. Finally, to ensure safe operation in the presence of uncertainty, this article employs a robust tube-based MPC method. For a multiple CAV lane change problem, simulation results show the efficacy of the proposed controller in terms of computational efficiency and the ability to generate safe and smooth CAV trajectories in a distributed fashion. 
    more » « less
  5. Exclusive bus lane strategy is widely adopted in many cities to improve bus operation effciency and reliability. With the development of connected vehicle technologies, the dynamic bus lane (DBL) strategy was proposed, with allowing general vehicles to share use of the bus lane to improve traffc effciency in general purpose lanes (GPLs). Previous studies have rarely considered the eco-driving strategy of connected and automated vehicles/buses (CAVs/CABs) in GPLs under the mixed traffc conditions, and how to ensure bus priority with DBL control. In this study, a novel DBL control strategy was developed under the partially connected vehicle environment. A trajectory planning method while considering the joint effects of bus stop and signal phase for CAB was adopted, an eco-driving strategy for CAVs in GPL was proposed using a trigonometry trajectory planning method. And a novel DBL control method was established by integrated trajectory planning for both the CAVs and CABs to ensure bus operation priority. Numerical experiments were conducted to evaluate performance of the proposed novel DBL control in terms of travel time and energy consumption of general vehicles at the different levels of CAV market penetration rates (MPRs). Results indicated that about 16%-42% energy savings can be achieved with MPR varying from 20% to 100%, and the travel time can be improved by about 4%-10%. Meanwhile, sensitivity analysis was conducted to quantify the impacts of key parameters, including vehicle target speeds, heterogeneous traffc fow, random arrival interval of cars, position of bus stop, traffc volume in GPL 
    more » « less