skip to main content

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 11:00 PM ET on Thursday, October 10 until 2:00 AM ET on Friday, October 11 due to maintenance. We apologize for the inconvenience.


Title: Optimal Weight Adaptation for Model Predictive Control of Connected and Automated Vehicles in Mixed Traffic with Bayesian Optimization
In this paper, we develop an optimal weight adap- tation strategy of model predictive control (MPC) for connected and automated vehicles (CAVs) in mixed traffic. We model the interaction between a CAV and a human-driven vehicle (HDV) as a simultaneous game and formulate a game-theoretic MPC problem to find a Nash equilibrium of the game. In the MPC problem, the weights in the HDV’s objective function can be learned online using moving horizon inverse reinforcement learning. Using Bayesian optimization, we propose a strategy to optimally adapt the weights in the CAV’s objective function so that the expected true cost when using MPC in simulations can be minimized. We validate the effectiveness of the optimal strategy by numerical simulations of a vehicle crossing example at an unsignalized intersection.  more » « less
Award ID(s):
2149520 2219761
NSF-PAR ID:
10421260
Author(s) / Creator(s):
;
Date Published:
Journal Name:
2023 American Control Conference
Page Range / eLocation ID:
1183-1188
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Speed planning in a vehicle-following scenario can reduce vehicle fuel consumption even under limited traffic preview and in moderate penetration of connected autonomous vehicles (CAVs), but could also lead to colder exhaust temperature, and consequently, less efficient aftertreatment conversion. To investigate this potential trade-off, this paper presents a model predictive controller (MPC) to optimally plan in an energy-conscious way the optimal speed trajectory for a diesel car following a hypothetical lead vehicle that drives through the velocity trace of a federal test procedure. Using this energy-conscious optimal speed plan we investigate different horizons for three objective functions, including minimum acceleration, minimum fuel consumption and minimum power. Then, MPC results are compared to the trajectories obtained by dynamic programming with full knowledge of the drive cycle. As expected, longer previews lead to smoother velocity trajectories that reduce the fuel consumption by 11% when power is the objective function, if the preview is accurate. When the minimum fuel is set as the objective in the MPC, the controller coordinates to operate the engine at more efficient conditions, which increases the fuel saving to 25%. However, the extra fuel saving is shown to be achieved at the expense of high vehicle NOx emissions, since the engine operates at low speeds and high loads, where the output NOx emissions are high, when the aftertreatment catalyst is not hot enough. Finally, it is shown that the minimum power formulation leads to a better trade-off, where fuel economy can be increased without a large penalty on NOx emissions. 
    more » « less
  2. From the perspective of practical application, a novel task allocation problem for multi-vehicle systems is proposed. The goal is to allocate an optimal route for each vehicle to execute tasks. The planning result is a comprehensive decision considering the influence of time windows, collaborative tasks, and recharging. This problem is represented as a new extension of the classical vehicle routing problem and a multi-objective integer programming mathematical model is established. The objective functions are the total completion time and total penalty costs. A solution strategy hybridizing non-dominated sorting genetic algorithm-II and variable neighborhood search is proposed, and a feasibility recovery strategy and the concept of the immigrant population are introduced. Finally, the simulation results show that the proposed algorithm can solve the problem effectively and is robust to different complexity scenarios. To illustrate concretely the optimization process, an instance is given in the last.

     
    more » « less
  3. As vehicular communication networks embrace metaverse beyond 5G/6G systems, the rich content update via the least interfered subchannel of the optimal frequency band in a hybrid band vehicle to everything (V2X) setting emerges as a challenging optimization problem. We model this problem as a tradeoff between multi-band VR/AR devices attempting to perform metaverse scenes and environmental updates to metaverse roadside units (MRSUs) while minimizing energy consumption. Due to the computational hardness of this optimization, we formulate an opportunistic band selection problem using a multi-armed bandit (MAB) that provides a good quality solution in real-time without computationally burdening the already stretched augmented/virtual reality (AR/VR) units acting as transmitting nodes. The opportunistic use of scheduling rich content updates at traffic signals and stand-still scenarios maps well with the formulated bandit problem. We propose a Dual-Objective Minimax Optimal Stochastic Strategy (DOMOSS) as a natural solution to this problem. Through extensive computer-based simulations, we demonstrate the effectiveness of our proposal in contrast to baselines and comparable solutions. We also verify the quality of our solution and the convergence of the proposed strategy. 
    more » « less
  4. Abstract

    We consider 3XOR games with perfect commuting operator strategies. Given any 3XOR game, we show existence of a perfect commuting operator strategy for the game can be decided in polynomial time. Previously this problem was not known to be decidable. Our proof leads to a construction, showing a 3XOR game has a perfect commuting operator strategy iff it has a perfect tensor product strategy using a 3 qubit (8 dimensional) GHZ state. This shows that for perfect 3XOR games the advantage of a quantum strategy over a classical strategy (defined by the quantum-classical bias ratio) is bounded. This is in contrast to the general 3XOR case where the optimal quantum strategies can require high dimensional states and there is no bound on the quantum advantage. To prove these results, we first show equivalence between deciding the value of an XOR game and solving an instance of the subgroup membership problem on a class of right angled Coxeter groups. We then show, in a proof that consumes most of this paper, that the instances of this problem corresponding to 3XOR games can be solved in polynomial time.

     
    more » « less
  5. The main objective of this paper is to establish a framework to study the co-adaptation between humans and automation systems in a haptic shared control framework. We specifically used this framework to design control transfer strategies between humans and automation systems to resolve a conflict when co-steering a semi-automated ground vehicle. The proposed framework contains three main parts. First, we defined a modular structure to separate partner-specific strategies from task-dependent representations and use this structure to learn different co-adaption strategies. In this structure, we assume the human and automation steering commands can be determined by optimizing cost functions. For each agent, the costs are defined as a combination of a set of hand-coded features and vectors of weights. The hand-coded features can be selected to describe task-dependent representations. On the other hand, the weight distributions over these features can be used as a proxy to determine the partner-specific conventions. Second, to leverage the learned co-adaptation strategies, we developed a map connecting different strategies to the outputs of human-automation interactions by employing a collaborative-competitive game concept. Finally, using the map, we designed an adaptable automation system capable of co-adapting to human driver’s strategies. Specifically, we designed an episode-based policy search using the deep deterministic policy gradients technique to determine the optimal weights vector distribution of automation’s cost function. The simulation results demonstrate that the handover strategies designed based on co-adaption between human and automation systems can successfully resolve a conflict and improve the performance of the human automation teaming. 
    more » « less