<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Trust-based route planning for automated vehicles</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>05/19/2021</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10216656</idno>
					<idno type="doi">10.1145/3450267.3450529</idno>
					<title level='j'>ICCPS '21: Proceedings of the ACM/IEEE 12th International Conference on Cyber-Physical Systems</title>
<idno></idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Shili Sheng</author><author>Erfan Pakdamanian</author><author>Kyungtae Han</author><author>Ziran Wang</author><author>John Lenneman</author><author>Lu Feng</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Several recent works consider the personalized route planning based on user profiles, none of which accounts for human trust. We argue that human trust is an important factor to consider when planning routes for automated vehicles. This paper presents the first trust-based route planning approach for automated vehicles. We formalize the human-vehicle interaction as a partially observable Markov decision process (POMDP) and model trust as a partially observable state variable of the POMDP, representing human's hidden mental state. We designed and conducted an online user study with 100 participants on the Amazon Mechanical Turk platform to collect data of users' trust in automated vehicles. We build datadriven models of trust dynamics and takeover decisions, which are incorporated in the POMDP framework. We compute optimal routes for automated vehicles by solving optimal policies in the POMDP planning. We evaluated the resulting routes via human subject experiments with 22 participants on a driving simulator. The experimental results show that participants taking the trust-based route generally resulted in higher cumulative POMDP rewards and reported more positive responses in the after-driving survey than those taking the baseline trust-free route.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>Recent years have witnessed significant advances in the development of automated vehicle, which have already been tested over millions of miles on public roads <ref type="bibr">[4]</ref>. However, fully autonomous vehicles that do not require human intervention are still decades away due to technology, infrastructure, and regulation limitations <ref type="bibr">[18]</ref>. The majority of automated vehicles available to the general public nowadays are Level 2 and Level 3 of automation <ref type="bibr">[12]</ref>, which allow the driver to turn attention away from the primary task of driving; but the driver must still be prepared to take over control of the vehicle when necessary. Human's decision on whether or not to rely on the automation is guided by trust. Prior studies have found that distrust is a main barrier to adoption of automated vehicles <ref type="bibr">[27]</ref>; in addition, users with lower trust levels take over control of the vehicle more frequently <ref type="bibr">[28]</ref>. On the other hand, overtrust in automation can lead to catastrophic outcomes (e.g., fatal Tesla autopilot crashes <ref type="bibr">[3]</ref>). Therefore, in order to improve safety and user experience, there is a need for taking into account human trust in the system design of automated vehicles.</p><p>In this paper, we consider the design of route planning system for the navigation of automated vehicles. Existing route planning methods (e.g., <ref type="bibr">[7,</ref><ref type="bibr">19,</ref><ref type="bibr">26]</ref>) mostly focus on computing routes that optimize metrics such as distance, travel time, and fuel consumption. Several recent works (e.g., <ref type="bibr">[9,</ref><ref type="bibr">13,</ref><ref type="bibr">41]</ref>) consider the personalized route recommendation based on user profiles (e.g., mobility options, frequently visited places). However, none of the existing route planning methods explicitly account for human trust. We argue that human trust is an important factor to consider when planning routes for automated vehicles. For example, if the driver has lower trust in the automated vehicle's capability for safely navigating urban streets with pedestrians constantly crossing as opposed to freeways, the driver may prefer a freeway despite longer distance.</p><p>To the best of our knowledge, this paper presents the first work of trust-based route planning for automated vehicles. There are several challenges in developing this work. First, how to measure and model human trust in automation, which is a hidden mental state influenced by many factors and changes over time <ref type="bibr">[38]</ref>. Second, how to incorporate the trust model into the route planning while accounting for the human-vehicle interaction (e.g., takeover decisions). Finally, how to evaluate the proposed trust-based route planning approach. In the following, we provide an overview of how we address these challenges in this work.</p><p>We follow the notion of trust in automation defined in <ref type="bibr">[34]</ref>, which views human trust as delegation of responsibility for actions to the automation and willingness to accept risk (possible harm), while the decision to delegate is based on a subjective evaluation of the automation's capability for a particular task. To concretize the problem, we consider a motivating example where the automated vehicle may encounter three types of typical road incidents (i.e., pedestrian, obstacle, and oncoming truck). Trust is therefore affected by human's takeover decision and the vehicle's capability of handling an incident. We adopt the commonly used method of measuring the subjective belief of trust via user questionnaires. Specifically, we designed and conducted an online user study with 100 participants on the Amazon Mechanical Turk platform. We asked users to watch various driving videos recorded in the driver's view and answer questions about their trust in the automated vehicle's capability of safely handling the incident shown in the video in a 7-point Likert scale, as well as whether they would like to take over control of the vehicle imaging that they were the driver sitting inside the automated vehicle. We model the evolution of trust dynamics (i.e., how trust changes over time) as a linear Gaussian system using the data collected from the online user study. We also build data-driven models to predict human's takeover decisions.</p><p>We formalize the human-vehicle interaction as a partially observable Markov decision process (POMDP), which is a general modeling framework for planning under uncertainty <ref type="bibr">[24]</ref>. We model trust as a partially observable state variable of the POMDP, representing human's hidden mental state. In addition, there are three observable state variables representing the vehicle position, incident type, and the success/failure of the vehicle handling an incident. The estimated trust dynamics model informs the probabilistic transition function of the trust variable in the POMDP. There are two actions: human's takeover decision and the vehicle's route choice. Since the vehicle does not know about human's actual takeover decision in advance, it assumes that human follows the data-driven takeover decision models estimated using the online user study data. The goal of POMDP planning is to compute an optimal policy that makes route choices to maximize the expectation of the cumulative reward, with a reward function designed to promote better safety and user experience of automated vehicles.</p><p>We applied the proposed trust-based route planning approach to the motivating example and obtained two routes: a trust-based route where human makes takeover decisions based on trust dynamics and incidents, and a trust-free route (as a baseline for comparison) where human's takeover decisions only depend on incidents. We evaluated and compared the performance of these two routes via human subject experiments on a driving simulator. We conducted experiments with 22 participants, who were randomly assigned to two equal-sized groups for the between-subject study (each group has 11 participants, who took one of the two routes). The experimental results show that participants taking the trust-based route generally resulted in higher cumulative POMDP rewards and reported more positive responses in the after-driving survey than those taking the trust-free route.</p><p>Contributions. We summarize the major contributions of this work as follows.</p><p>&#8226; We developed the first trust-based route planning approach for automated vehicles, which is based on a POMDP framework and uses data-driven models of trust dynamics and takeover decisions. &#8226; We designed and conducted an online user study with 100 participants on the Amazon Mechanical Turk platform to collect data about users' trust in automated driving. &#8226; We designed and conducted human subject experiments with 22 participants on a driving simulator to evaluate the proposed approach, which showed encouraging results.</p><p>Paper organization. The rest of the paper is organized as follows.</p><p>We discuss the related work in Section 2, describe the motivating example in Section 3, present the trust-based route planning approach in Section 4, describe the driving simulator experiments in Section 5, and draw conclusions in Section 6.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">RELATED WORK</head><p>In this section, we survey the related work in two topics: (1) route planning for vehicles, and (2) trust in automation. For each topic, we identify gaps in the state-of-the-art and discuss the connection with this paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Route Planning for Vehicles</head><p>The goal of route planning is to compute the optimal routes for vehicles. The most commonly used metrics include distance, travel time, and fuel consumption. Graph search algorithms such as Dijkstra's algorithm <ref type="bibr">[14]</ref> and &#119860;* algorithm <ref type="bibr">[21]</ref> can be applied to find the shortest distance path between any two locations. Computing the fastest route (i.e., with the least travel time) is more challenging than finding the shortest distance route. Kanoulas et al. <ref type="bibr">[26]</ref> extended &#119860;* algorithm by considering the speed change at different time of the day to compute the fastest route. Gonzalez et al. <ref type="bibr">[19]</ref> developed an adaptive fastest route planning method based on information learned from the historical traffic data, accounting for various factors (e.g., road quality, weather condition, area crime rate) that may influence vehicle speed patterns. Andersen et al. <ref type="bibr">[7]</ref> proposed to find the most eco-friendly route by assigning eco-weights based on GPS and fuel consumption data. There are several recent studies considering personalized route recommendation for various users. Campigotto et al. <ref type="bibr">[9]</ref> developed a method for the personalized route planning by using Bayesian learning to update users' profile such as home location, work place, and mobility options. Dai et al. <ref type="bibr">[13]</ref> recommended a personalized optimal route considering user preferences encoded as a ratio between different metrics such as distance, travel time, and fuel consumption. Zhu et al. <ref type="bibr">[41]</ref> proposed a personalized and time-sensitive route planning method, in which they inferred users' preferences with locations and visiting time through historical data.</p><p>None of the aforementioned route planning methods considers human trust. In this paper, we aim to fill this gap by developing a trust-based route planning approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Trust in Automation</head><p>Trust in the context of human-technology relationships can be roughly classified into three categories: (1) credentials-based, which is used mainly in security and determines if a user can be trusted based on a set of credentials <ref type="bibr">[25]</ref>; (2) experience-based, which includes reputation-based trust in peer-to-peer and e-commerce applications, determines an agent's trust value based on its own experience in predicting the probability of the execution of a certain action by another agent <ref type="bibr">[30]</ref>; and (3) cognitive trust, which explicitly account for not only the human experience, but also subjective judgment about preferences and mental states <ref type="bibr">[17]</ref>. In this paper, we are interested in human's trust in automated vehicles, and therefore consider cognitive trust that captures the human notion of trust. Specifically, we follow the notion of trust in automation proposed in <ref type="bibr">[34]</ref>, which indicates human's willingness to rely on automation.</p><p>Studies have found that human trust changes over time during the interaction with automation, affected by various factors such as the automation's reliability, predictability, and transparency <ref type="bibr">[20,</ref><ref type="bibr">38]</ref>. Studies have also shown that trust can influence human's reliance on automation and the system is likely to be under-utilized if human mistrust the automation <ref type="bibr">[16]</ref>. For example, a recent study found that users with lower trust tended to take over control from automated vehicles more frequently <ref type="bibr">[28]</ref>. Inspired by insights drawing from these prior studies, we develop a data-driven trust dynamics model to represent the evolution of human trust in automated vehicles, and a takeover decision model to associate the likelihood of human's takeover decision with trust.</p><p>Different method to measure trust have been proposed. User questionnaires are commonly used to evaluate the subjective belief of trust <ref type="bibr">[36,</ref><ref type="bibr">40]</ref>. For example, the study in <ref type="bibr">[11]</ref> asked questions about users' trust in automated vehicles in a 7-point Likert scale. In addition, various sensing technologies have been used for the continuous measurement of human trust in real-time, including gaze tracking <ref type="bibr">[22]</ref>, gestures (e.g., face touching and arms crossed) <ref type="bibr">[35]</ref>, and biometrics (e.g., electroencephalogram and galvanic skin response) <ref type="bibr">[23]</ref>. We measure human trust in a 7-point Likert scale via questionnaires in the online user study, and via continuous user control input (i.e., pressing buttons mounted on the steering wheel) in the driving simulator study.</p><p>Existing works about trust in automated vehicles include investigating factors that influence users' adoption of automated vehicles <ref type="bibr">[32,</ref><ref type="bibr">33,</ref><ref type="bibr">39]</ref>, studying the effect of alarm timing on driver's trust <ref type="bibr">[5]</ref>, designing forward collision warning system <ref type="bibr">[29]</ref> and cruise control system <ref type="bibr">[8]</ref> to improve users' trust. By contrast, this paper develops a route planning approach that accounts for trust to improve user experience of automated vehicles.</p><p>Several recent works have explored the idea of modeling trust with POMDPs. For example, a POMDP model for trust-workload dynamics in Level 2 driving automation was developed in <ref type="bibr">[6]</ref>, and a POMDP-based method for human-robot collaboration in table cleaning tasks was proposed in <ref type="bibr">[10]</ref>. Our work is inspired by these methods. We focus on trust-based route planning for automated vehicles, which requires different POMDP modeling. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">MOTIVATING EXAMPLE</head><p>We describe a motivating example of route planning for automated vehicles. Figure <ref type="figure">1</ref> shows an example map, where three types of typical incidents that may occur on the road are considered: (1) a pedestrian crossing the road, (2) an obstacle ahead of the lane, and (3) an oncoming truck in the neighboring lane. We can easily generalize to more complex examples with a richer set of incidents. For simplicity, we assume that each road segment may have up to one incident at a time. We also assume that the vehicle has information about the potential incident that it may encounter in the next road segment. Such information can be easily obtained, for example, via sensing and crowd sourcing traffic monitoring apps.</p><p>Figure <ref type="figure">2</ref> shows a schematic view of the automated vehicle traveling from one location to another. Suppose that the vehicle is approaching an incident in the autopilot mode. Due to safety concerns, the driver may decide to take over control of the vehicle and switch to manual driving. Such takeover decisions can be influenced by the driver's trust in the automated vehicle's capability of handling different types of incidents: the driver with lower trust is more likely to take over. In addition, the driver's trust evolves over time depending on the takeover decision and the vehicle's capability of handling an incident.</p><p>The goal of this work is to develop a trust-based route planning approach that computes an optimal route for the automated vehicle (e.g., navigating from A to K in the example map) while taking into account human trust dynamics and takeover decisions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">TRUST-BASED ROUTE PLANNING</head><p>We present a trust-based route planning approach for automated vehicles. The key idea is to model the human-vehicle interaction as a POMDP and compute the optimal vehicle route by solving the optimal policy of POMDP planning.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">The Proposed POMDP Framework</head><p>Formally, a POMDP is denoted as a tuple (&#119878;, &#119860;, T , &#119877;, &#119874;, &#120575;, &#120574;), where &#119878; is a finite of state, &#119860; is a set of actions, T is the transition function representing conditional transition probabilities between states, &#119877; : &#119878; &#215; &#119860; &#8594; R is the real-valued reward function, &#119874; is a set of observations, &#120575; is the observation function representing the conditional probabilities of observations given states and actions, Figure <ref type="figure">2</ref>: A schematic view of an automated vehicle navigating from one location to another. When approaching an incident, the driver may decide to take over and switch to manual driving. The takeover decision can be influenced by the driver's trust in the automated vehicle, which evolves over time. As discussed in Section 2, trust in automation can be influenced by many factors. Here, we model the evolution of trust dynamics with a probabilistic transition function T (&#119906; &#119905; +1 |&#119906; &#119905; , &#119910; &#119905; , &#119894; &#119905; , &#8462; &#119905; ), based on a simplified assumption that trust evolves depending on the takeover decision and the vehicle's capability of handling an incident. The intuition is that trust may increase when the human chooses to not take over and witnesses the automated vehicle successfully handling an incident, and the trust may decrease if the automated vehicle fails to handle an incident.</p><p>The vehicle agent does not know about human's actual takeover action in advance, and it computes the optimal POMDP policy &#120587; * of route choices &#119888; &#119905; based on a model that predicts human's takeover decision &#8462; &#119905; . We consider two different takeover decision models for comparison: (1) trust-free model, denoted by &#120587; &#8462; (&#8462; &#119905; |&#119894; &#119905; , &#119910; &#119905; ), where human decides whether to takeover depending on the incident and a fixed belief on the automated vehicle's capability to handle certain types of incidents; and (2) trust-based model, denoted by &#120587; &#8462; (&#8462; &#119905; |&#119894; &#119905; , &#119910; &#119905; , &#119906; &#119905; ), where human make takeover decisions based on the incident and trust, indicating that human's belief on the automated vehicle's capability changes over time depending on the trust dynamics.</p><p>Considering the motivating example described in Section 3. The vehicle position &#119907; &#119905; is one of the locations {&#119860;, &#119861;, . . . , &#119870; } shown in the map (Figure <ref type="figure">1</ref>). The incident &#119894; &#119905; can take one of the four values: null, pedestrian, obstacle, and truck. The vehicle's capability &#119910; &#119905; of handling incidents has binary outcomes: success, and failure. Since human's trust is a partially observable variable &#119906; &#119905; representing the hidden mental state, we use an observation variable &#251;&#119905; to represent the subjective trust in a 7-point Likert scale (1 and 7 indicate the lowest and highest levels of trust, respectively) measured via user questionnaires. The available route choices &#119888; &#119905; are given by the map. For example, in location &#119860;, the vehicle may choose one of the three routes colored in yellow, red and green to navigate to &#119861;, &#119862; and &#119863;, respectively. The human takeover decision &#8462; &#119905; is a binary choice of whether or not to take over control of the vehicle and resume manual driving. We can define the transition functions T (&#119907; &#119905; +1 |&#119907; &#119905; , &#119888; &#119905; ) and T (&#119894; &#119905; |&#119907; &#119905; ) based on the map. We can estimate T (&#119910; &#119905; |&#119894; &#119905; ) based on the historical testing logs of the automated vehicle safely handling incidents. For the motivating example, we assume that the automated vehicle can always safely handle incidents (but the human driver has no prior knowledge about this assumption).</p><p>We design a reward function shown in Table <ref type="table">1</ref> for the motivating example. Intuitively, we want to reward for better safety and user experience of automated vehicles. If the automated vehicle handles an incident successfully, we assign positive rewards based on the difficulty of driving tasks. When approaching a pedestrian incident, the automated vehicle needs to stop before the crosswalk and wait  until the pedestrian crossing the road. When approaching an obstacle incident, the automated vehicle needs to perform lane changing in order to avoid collision with the obstacle. When there is an oncoming truck in the neighboring lane, the automated vehicle needs to keep driving in the same lane. Thus, we rank the pedestrian incident as the most difficult task and assign the highest reward value of 3, followed by the obstacle incident with a reward value of 2 and the truck incident with a reward value of 1. On the other hand, if the automated vehicle fails to handle an incident safely, we assign rewards based on the severity of incidents (e.g., striking a pedestrian can cause more serious damages than colliding with an obstacle). We assign zero reward to manual driving, because we want to promote better user experience and let the driver to enjoy non-driving tasks (e.g., reading or using mobile devices) in the automated vehicle. In addition, we assign a reward value of 5 to empty road (i.e., no incident thus no failure or takeover) to indicate this as the most favorable choice.</p><p>For the rest of this section, we describe the design of an online user study for data collection in Section 4.2; we present a datadriven method to estimate trust dynamics T (&#119906; &#119905; +1 |&#119906; &#119905; , &#119910; &#119905; , &#119894; &#119905; , &#8462; &#119905; ) and the observation function &#120575; ( &#251;&#119905; |&#119906; &#119905; ) in Section 4.3; we describe the data-driven modeling of trust-free takeover decision &#120587; &#8462; (&#8462; &#119905; |&#119894; &#119905; , &#119910; &#119905; ) and the trust-based takeover decision &#120587; &#8462; (&#8462; &#119905; |&#119894; &#119905; , &#119910; &#119905; , &#119906; &#119905; ) in Section 4.4; and finally, we apply the proposed approach to the motivating example and present the computed optimal routes in Section 4.5.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Online User Study for Data Collection</head><p>We designed and conducted an online user study<ref type="foot">foot_0</ref> with 100 anonymous participants on the Amazon Mechanical Turk platform. The objective of this study is to collect data about human's trust on automated vehicles. In particular, we investigated how trust evolves with respect to different incidents on the road and how human's takeover decisions are affected by incidents and trust. We created a set of driving videos using the PreScan driving simulation software [1]. Figure <ref type="figure">4</ref> shows the screenshots of example videos covering three types of incidents (i.e., pedestrian, obstacle, and oncoming truck) used in the motivating example.</p><p>During the online user study, we first established the baseline by asking participants about their trust in automated vehicles in a 7-point Likert scale (i.e., trust ranges from 1 to 7). Then, we showed a video of the automated vehicle approaching an incident on the road from the driver's view, and asked participants if they would like to takeover control of the vehicle and switch to manual driving, imaging that they were the driver siting inside the automated vehicle. Depending on the participant's response of takeover or not, we showed the next video of either the vehicle is driven autonomously or manually to handle the incident. After that, we asked participants to fill in a questionnaire which estimates their updated trust in automated vehicle. We adapted the Muir's questionnaire <ref type="bibr">[37]</ref> and asked participants to answer the following questions in 7-point Likert scale:</p><p>(1) To what extent can you predict the automated vehicle's behavior from moment to moment? (2) To what extent can you count on the automated vehicle to do its job? (3) What degree of faith do you have that the automated vehicle will be able to cope with similar incidents in the future? (4) Overall how much do you trust the automated vehicle?</p><p>We averaged a participant's responses to these four questions into a single rating between 1 and 7 to represent the participant's updated trust. We repeated the above process nine times (three times per incident type) with a randomized order of incidents.</p><p>We did not include any vehicle crash video in this study, because we assume that the automated vehicle is capable of handling all incidents safely. For example, the vehicle would automatically stop and wait for the pedestrian to cross the lane, or change the lane to avoid the obstacle. However, participants are not aware of such information in advance. They make takeover decisions based on their trust beliefs about the automated vehicle's capability to safely handle certain incident, and the trust levels may change based on their experience of watching prior incident videos.</p><p>The data we collected from each participant has the following format: D = { &#251;0 , &#119894; 0 , &#8462; 0 , &#251;1 , . . . , &#119894; 8 , &#8462; 8 , &#251;9 }, where &#251;&#119905; is the measured user trust, &#119894; &#119905; is the incident type, &#8462; &#119905; is the user decision of takeover or not, at each time step &#119905;. In order to guarantee the data quality, our study recruitment criteria required that participants must be able to read English fluently and have an above 95% approval rate on the Amazon Mechanical Turk platform. We also inserted questions for attention checks during the user study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3">Data-Driven Trust Dynamics Model</head><p>As described in Section 4.1, the proposed POMDP framework for trust-based route planning represents human trust as a partially observable variable &#119906; &#119905; at time step &#119905;, which evolves to &#119906; &#119905; +1 over time depending on human's takeover decision &#8462; &#119905; and the automated vehicle's capability &#119910; &#119905; to handle incident &#119894; &#119905; . Using the data collected from the online user study described in Section 4.2, we model the trust dynamics and the POMDP observation function as a linear Gaussian system: where N (&#120583;, &#120590; 2 ) represents the Gaussian distribution with the mean &#120583; and the variance &#120590;; &#120572; &#119905; and &#120573; &#119905; are linear coefficients of trust dynamics given &#119910; &#119905; , &#119894; &#119905; and &#8462; &#119905; ; and &#251;&#119905; represents the observations of trust measured via subjective questionnaires in the online user study. We estimate these parameter values using full Bayesian inference with Hamiltonian Monte Carlo sampling algorithm <ref type="bibr">[15]</ref>. Figure <ref type="figure">5</ref> illustrates a visualization of the learned trust dynamics model. There are six probabilistic transition matrices, corresponding to all combinations of three road incidents and binary human takeover decisions. Each transition matrix indicates the probability of changing from &#119906; &#119905; (trust before value) to &#119906; &#119905; +1 (trust after value). We observe that trust values are more likely to increase when human decides to not take over (top row of Figure <ref type="figure">5</ref>), while trust values tend to be constant or decrease when there is a takeover decision (bottom row of Figure <ref type="figure">5</ref>). These observations are consistent with the insight from the prior studies (see Section 2) that takeover decisions are often correlated to trust.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4">Data-Driven Takeover Decision Models</head><p>In the POMDP framework, we use the variable &#8462; &#119905; to denote human's takeover decisions (i.e., whether or not to take over control of the vehicle) when approaching an incident &#119894; &#119905; at time step &#119905;. Such takeover decisions may also be influenced by human trust &#119906; &#119905; . In the following, we present two takeover decision models based on whether or not to consider trust as an influencing factor. Trust-free takeover decision model. Let &#119887; &#119894; denote human's belief on the automated vehicle's capability of safely handling an incident &#119894;, which remains constant in the trust-free model. Let &#119901; &#119905; denote the probability of human deciding to not take over at time step &#119905;. We define &#119901; &#119905; = S(&#119887; &#119894; &#119903; s,&#119894; + (1 -&#119887; &#119894; )&#119903; f,&#119894; ), where S(&#119909;) = 1 1+&#119890; -&#119909; is the sigmoid function, &#119903; s,&#119894; and &#119903; f,&#119894; are rewards of the automated vehicle handling the incident &#119894; with success and failure (see Table <ref type="table">1</ref>), respectively. We model the takeover decision with a Bernoulli distribution, denoted by &#8462; &#119905; &#8764; B (&#119901; &#119905; ). Trust-based takeover decision model. Let &#119887; &#119894; &#119905; denote human's belief on the automated vehicle's capability of safely handling an incident &#119894; at time step &#119905;, which evolves over time depending on the human trust &#119906; &#119905; . Thus, we model the belief as a sigmoid function &#119887; &#119894; &#119905; = S(&#120581; &#119894; &#119906; &#119905; + &#120582; &#119894; ), where &#120581; &#119894; and &#120582; &#119894; are linear coefficients associated with the incident &#119894;. We assume that the human trust &#119906; &#119905; follows a Gaussian distribution, denoted by &#251;&#119905; &#8764; N (&#119906; &#119905; , &#120590;<ref type="foot">foot_1</ref> &#119906; ) where &#251;&#119905; are the measured trust values from the online user study. We define the probability of human deciding to not takeover as &#119901; &#119905; = S(&#119887; &#119894; &#119905; &#119903; s,&#119894; + (1 -&#119887; &#119894; &#119905; )&#119903; f,&#119894; ), which is defined similarly to the trust-free model, but using the dynamic belief &#119887; &#119894; &#119905; instead of the constant &#119887; &#119894; . Finally, the takeover decision is given by the Bernoulli distribution &#8462; &#119905; &#8764; B (&#119901; &#119905; ). Data-driven modeling results. We applied the full Bayesian inference with Hamiltonian Monte Carlo sampling algorithm <ref type="bibr">[15]</ref> to estimate parameters in both the trust-free and trust-based models, using the data collected from the online user study. The results of log-likelihood show that the trust-based model (-359.37) fits better to the collected data as opposed to trust-free model (-446.83). The difference in log-likelihood results shows that accounting for trust in the takeover decision model can achieve better prediction performance, which supports our assumption that human takeover decisions is influenced by trust. Figure <ref type="figure">6</ref> shows model predictions of takeover probability with respect to trust and incidents. With the trust-free model, since the takeover decision does not depend on human trust, we observe three straight lines for three incidents. With the trust-based model, we observe the general trends of decreasing takeover likelihood with increasing trust, which are consistent with findings in the prior studies (see Section 2). Furthermore, we observe from the results of both models that it is more likely for human to decide to take over with riskier incidents: pedestrian with the highest takeover probability, followed by obstacle and truck.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.5">Planning for the Motivating Example</head><p>We applied the Approximate POMDP Planning (APPL) Toolkit [2], which is an implementation of the point-based SARSOP algorithm for efficient POMDP planning <ref type="bibr">[31]</ref>, to compute the optimal policies of the proposed POMDP framework. For the motivating example, depending on the use of trust-based and trust-free takeover decision models, we obtained two optimal routes:</p><p>Note that the main difference between these two routes is the order of road incidents. In the trust-based route, the ordered incidents occurring in each road segment are oncoming truck (A-D), null (D-G), obstacle (G-J), and pedestrian (J-K). In the trust-free route, the incidents follows the order of pedestrian (A-C), null (C-E), obstacle (E-H), and oncoming truck (H-K). We evaluate and compare the performance of these two routes via human subject experiments 2 on a driving simulator, as described in the next section.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">DRIVING SIMULATOR EXPERIMENTS</head><p>We describe the design, procedure, and results of our driving simulator experiments as follows.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Experiment Design</head><p>Apparatus. Figure <ref type="figure">7</ref> shows the driving simulator setup used for the experiments. The hardware platform is based on the Force Dynamics 401CR driving simulator, which is a four-axis motion platform that tilts and rotates to simulate the experience of being in a vehicle. The platform includes the seat, interlocked seat belt, interlocked doors, display screen, steering wheel, brake, paddle shifters, and throttle. There are two buttons mounted on the steering wheel (bottom zoomed-in view in Figure <ref type="figure">7</ref>). We programmed the simulator's control input such that the driver can switch between automated and manual driving by pressing the two buttons simultaneously. In addition, we used the same set of buttons to measure participants' trust in automated vehicles during the experiments. The driver can press the left (resp. right) button to decrease (resp. increase) the trust value ranging from 1 to 7. Driving scenario. We created a driving scenario based on the motivating example described in Section 3, using the PreScan driving simulation software <ref type="bibr">[1]</ref>. We also programmed an autopilot controller for the simulated automated vehicle, which has the capability of leveraging the integrated sensors (e.g., radar, Lidar, and GPS) in PreScan for various driving tasks such as lane keeping, detecting and handling incidents. Manipulated factor. We manipulate a single factor: the route that the autopilot controller follows. As stated in Section 4.5, the two conditions are: trust-based route and trust-free route. Dependent measures. We are interested in studying the route which brings more cumulative reward. We recorded the participants' takeover decisions and calculated the cumulative POMDP reward using the reward function defined in Table <ref type="table">1</ref>. Hypothesis. We hypothesize that participants taking the trustbased route can obtain higher cumulative POMDP rewards than those taking the trust-free route. Subject allocation. We recruited 22 participants (average age: 23.7 years, SD=4.3 years, 31.8% female) from the university community. Each participant was compensated with a $20 gift card for completing the experiment. The recruitment criteria required all participants to have a valid driver license, at least one year of driving experience, and normal or corrected-to-normal vision. To avoid participants' bias, we adopted a between-subject study design: we randomly allocated 11 participants to take the trust-based route and the other 11 participants to experience the trust-free route.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Experiment Procedure</head><p>Upon arrival, a participant was instructed to read and sign a consent form approved by the Institutional Review Board. We conducted a five-minute training to help the participant get familiar with the driving simulator setup. Then, the participant was instructed to drive through the trust-based or trust-free route with the simulated automated vehicle, depending on the assigned study group. The journey started in the autopilot mode. When the vehicle approached an incident (i.e., pedestrian, obstacle, or truck), it alerted the participant by issuing an auditory alarm and displaying textual information about the incident type in the GUI. If the participant decided to not takeover, the vehicle would continue in the autopilot mode to handle the incident. The participant can take over control of the vehicle and switch to manual driving at any point during the experiment. If the participant did takeover, he was required to switch back to the autopilot mode after the vehicle passing that incident. We asked the participant to periodically record their trust in the automated vehicle using the buttons on the steering wheel (see bottom left in Figure <ref type="figure">7</ref>). After the driving session, we asked the participant to answer the following survey questions in 7-point Likert scale (1 means strongly disagree, 4 is neural, 7 means strongly agree). Q1 I believe that the automated vehicle can get me to the destination safely. Q2 I find the route easy to drive. Q3 I find it easy to take over control of the automated vehicle.  Q4 I have concern about using the automated vehicle to drive through this route. Q5 I believe that the selected route is not dangerous. Q6 I think the selected route fits well with the way I would like to drive. Q7 I can depend on the reliability of the automated vehicle.</p><p>It took about 40 minutes for each participant to complete the entire experiment.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3">Results</head><p>We calculated the cumulative POMDP rewards (using the reward function defined in Table <ref type="table">1</ref>) for each participant, based on their takeover decisions when approaching incidents along the route. Figure <ref type="figure">8</ref> shows the box plot of cumulative rewards of all participants. We observe that participants taking the trust-based route tend to achieve higher cumulative rewards than participants taking the trust-free route, which is consistent with our study hypothesis. We also performed one-way analysis of variance (ANOVA) to evaluate this hypothesis, i.e, comparing the observed &#119865; -test statistics with &#119865; (&#119889; 1 , &#119889; 2 ) (&#119865; -distribution with between-group degree of freedom &#119889; 1 and within-group degree of freedom &#119889; 2 ). The observed statistics &#119865; (1, 20) = 9.14 is greater than the critical value at significance level 0.01. Thus, our study hypothesis is supported by ANOVA results statistically.</p><p>Figure <ref type="figure">9</ref> shows the average takeover likelihood of all participants, for different incidents along the two routes. It is not surprising to find that participants are more likely to take over in the trust-free route than the trust-based route. With both routes, participants have higher probabilities to take over when approaching a pedestrian than an obstacle, while none of them choose to take over the control when there was an oncoming truck in the neighboring lane. A possible explanation is that participants are more likely to take over when approaching incidents that are more challenging to handle or can cause more severe damages. These trends are consistent with the takeover predictions computed using the online user study data (see Figure <ref type="figure">6</ref>).</p><p>Figure <ref type="figure">10</ref> shows how participants' average trust in the automated vehicle evolves as they were driving through different locations along the two routes. For the trust-based route, we observe that the average trust increases in the route segment A-D, this may result from the automated vehicle successfully handling the incident of oncoming truck in this segment. The trust continues to increase in the segment D-G, which is an empty road without any incident. However, the trust decreases in the next segment G-J where the vehicle needs to change lane to avoid an obstacle, and the trust further decreases in the last segment J-K where the vehicle needs to stop and wait for a pedestrian to cross the road. The decreasing of average trust may be explained by the occurring of more challenging and riskier incidents. For the trust-free route, we observe that the average trust drops sharply in the first route segment A-C with an pedestrian incident. However, the trust continues to increase slowly for the rest of the route. The average trust of participants taking the trust-based route is generally higher than taking the trust-free route.</p><p>Figure <ref type="figure">11</ref> summarizes the participants' responses to the afterdriving survey questions. The results of Q1 indicate that participants experienced the trust-based route had higher belief in the automated vehicle's capability of driving safely than participants experienced the trust-free route. The results of Q2 show that participants found the trust-based route easier to drive than the trust-free route. The results of Q3 illustrate that participants driving through the trust-based route found it easier to take over control of the vehicle than those driving through the trust-free route. The results of Q4 show that participants experienced the trust-based route had less concern about the automated vehicle than those experienced the trust-free route. The results of Q5 indicate that participants tended to have a neutral opinion about how dangerous the routes are. The results of Q6 show that participants thought the trust-based route fits to the way they would like to drive better than the trust-free route in general. The results of Q7 find that participants driving through the trust-based route perceived higher reliability of the automated vehicle than those experienced the trust-free route. In summary, our human subject experimental results show that</p><p>&#8226; Participants taking the trust-based route generally resulted in higher cumulative POMDP rewards (where the reward function was designed to promote better safety and user experience of automated vehicles) than those taking the trust-free route. &#8226; Participants were more likely to take over in the trust-free route than in the trust-based route; and riskier incidents led to higher takeover likelihood. &#8226; Participants' trust in the automated vehicle evolved over time during the driving experience and was influenced by different types of incidents. &#8226; Participants experienced the trust-based route had more positive responses in the after-driving survey than those driving through the trust-free route.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">CONCLUSION</head><p>In this paper, we present a trust-based route planning approach for automated vehicles. We model the human-vehicle interaction as a POMDP and compute optimal routes for the vehicle by solving the POMDP planning. In order to incorporate trust into the route planning, we build data-driven models of trust dynamics and takeover decisions using data collected from an online user study with 100 participants on the Amazon Mechanical Turk platform. We applied the proposed trust-based route planning approach to a motivating example and obtained a trust-based route and a trust-free route (as a baseline for comparison). We evaluated these two routes via human subject experiments with 22 participants on a driving simulator.</p><p>The results show that participants taking the trust-based route generally resulted in higher cumulative POMDP rewards (where the reward function was designed to promote better safety and user experience of automated vehicles), were less likely to take over control of the vehicle, and reported more positive responses in the after-driving survey than those taking the trust-free route.</p><p>In addition, we observed that participants' trust changed over time during the study and was influenced by different road incidents. These observations are consistent with the findings of prior studies. This work makes the first step towards incorporating human trust into route planning for automated vehicles. There are a few directions for future work. First, we would like to evaluate the scalability of the proposed approach. We believe that the proposed POMDP-based approach can be applied to larger route planning problems (e.g., larger maps, more locations, and more route choices). However, the bottleneck lies in the evaluation. We will need to design and conduct new human subject experiments to evaluate the resulting routes of each problem, which can be costly and time consuming. Second, we would like to consider a richer set of incident types to reflect the complex road conditions that automated vehicles may encounter in the real-world. We will need to design and conduct new online user studies to collect data about trust in the automated vehicle's capability of safely handling these new incident types and build new data-driven trust dynamics model. Furthermore, we would like to explore the POMDP modeling of other factors that may influence human's trust in automated vehicles, such as the system transparency and predictability.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>This study was approved by the Institutional Review Board at the University of Virginia.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1"><p>This human subject study was approved by the Institutional Review Board at the University of Virginia.</p></note>
		</body>
		</text>
</TEI>
