skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Identifying Important Risk Factors Associated with Vehicle Injuries using Driving Behavior Data and Predictive Analytics
Road injuries are rated among the top 10 causes of death by the World Health Organization, and the only one that is not a disease. The total economic cost of motor vehicle crashes in the United States was estimated to be $242 billion a year. This study examines multiple factors of accidents simultaneously with a goal of generating an interpretable model that can predict the occurrence of an accident given road conditions and driver behavior. The study compared 4 machine learning and deep learning modeling techniques on a dataset of 7707 trips collected by the Second Strategic Highway Research Program. A gradient boosted model was found to be most accurate and interpretable in accident prediction. This modeling technique also allows us to rank the feature importance of the factors in the model. The study finds that driver behavior, pre-incident maneuvers and secondary task duration are the most important variables in the predictive model. Using these conclusions will allow us to perform more work to infer these accident causes directly from vehicle sensor data in the future.  more » « less
Award ID(s):
1741306
PAR ID:
10125496
Author(s) / Creator(s):
Date Published:
Journal Name:
IEEE ICHI
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. ObjectiveThis study investigated the use of human performance modeling (HPM) approach for prediction of driver behavior and interactions with in-vehicle technology. BackgroundHPM has been applied in numerous human factors domains such as surface transportation as it can quantify and predict human performance; however, there has been no integrated literature review for predicting driver behavior and interactions with in-vehicle technology in terms of the characteristics of methods used and variables explored. MethodA systematic literature review was conducted using Compendex, Web of Science, and Google Scholar. As a result, 100 studies met the inclusion criteria and were reviewed by the authors. Model characteristics and variables were summarized to identify the research gaps and to provide a lookup table to select an appropriate method. ResultsThe findings provided information on how to select an appropriate HPM based on a combination of independent and dependent variables. The review also summarized the characteristics, limitations, applications, modeling tools, and theoretical bases of the major HPMs. ConclusionThe study provided a summary of state-of-the-art on the use of HPM to model driver behavior and use of in-vehicle technology. We provided a table that can assist researchers to find an appropriate modeling approach based on the study independent and dependent variables. ApplicationThe findings of this study can facilitate the use of HPM in surface transportation and reduce the learning time for researchers especially those with limited modeling background. 
    more » « less
  2. ABSTRACT Traffic accidents have become a major concern for governments, organizations and individuals worldwide due to the material and moral losses they cause. It is possible to reduce this concern by taking into account the research conducted by relevant institutions and organizations in this field. The main objective of this study is to categorize traffic accidents according to driver violation types and analyse them using machine learning algorithms and feature sensitivity to identify the most influential variables in each category. For this purpose, traffic accident reports that occurred in Erzurum province in the last 1 year were used to categorize and classify driver violation behaviour types. Five different machine learning algorithms, namely k‐nearest neighbour, support vector machines, naive Bayes, multilayer perception and random forest, were used to examine the success performance of the classification. Among these, 91% successful classification was obtained with the random forest algorithm. Based on the classification obtained from this algorithm, sensitivity analysis was used to reveal the variables that most affect each violation category. The results of the analysis revealed that driver age and vehicle type were the most influential variables for many types of violations. Thanks to this study, the problems were clearly identified by going into the details of driver violation behaviours. At the end of the study, measures to reduce driver violation behaviours were proposed. If the recommendations that can reduce driver behaviour are taken into consideration by transportation authorities and policy makers, traffic accidents can be significantly reduced. 
    more » « less
  3. ObjectiveOur objectives were to assess the efficacy of active inference models for capturing driver takeovers from automated vehicles and to evaluate the links between model parameters and self-reported cognitive fatigue, trust, and situation awareness. BackgroundControl transitions between human drivers and automation pose a substantial safety and performance risk. Models of driver behavior that predict these transitions from data are a critical tool for designing safer, human-centered, systems but current models do not sufficiently account for human factors. Active inference theory is a promising approach to integrate human factors because of its grounding in cognition and translation to a quantitative modeling framework. MethodWe used data from a driving simulation to develop an active inference model of takeover performance. After validating the model’s predictions, we used Bayesian regression with a spike and slab prior to assess substantial correlations between model parameters and self-reported trust, situation awareness, fatigue, and demographic factors. ResultsThe model accurately captured driving takeover times. The regression results showed that increases in cognitive fatigue were associated with increased uncertainty about the need to takeover, attributable to mapping observations to environmental states. Higher situation awareness was correlated with a more precise understanding of the environment and state transitions. Higher trust was associated with increased variance in environmental conditions associated with environmental states. ConclusionThe results align with prior theory on trust and active inference and provide a critical connection between complex driver states and interpretable model parameters. ApplicationThe active inference framework can be used in the testing and validation of automated vehicle technology to calibrate design parameters to ensure safety. 
    more » « less
  4. As of early 2023, only a limited number of Society of Automotive Engineers (SAE) Level 3 (L3) automated driving systems are available on the market, and they are primarily offered by luxury vehicle brands. SAE L3 automated driving systems are classified as conditional automation (CA), meaning that the vehicle can undertake some well-defined driving tasks under specific conditions, but the driver must be ready to assume control of the vehicle when prompted by the system. It is anticipated that an increasing number of L3 CA systems will be introduced on public roads in the next few years. However, L3 systems pose unique Human Factors (HF) challenges that require thoughtful consideration to ensure that production systems are feasible without compromising driver or road safety. This panel discussion brings together HF researchers and practitioners with expertise in human behavior and usability design for automotive applications to discuss and delineate key issues specifically related to L3 systems, as well as potential approaches to tackle these issues. 
    more » « less
  5. Advances in data collection and storage infrastructure offer an unprecedented opportunity to integrate both data and emergency resources in a city into a dynamic learning system that can anticipate and rapidly respond to heterogeneous incidents. In this paper, we describe integration methods for spatio-temporal incident forecasting using previously collected vehicular accident data provided to us by the Nashville Fire Department. The literature provides several techniques that focus on analyzing features and predicting accidents for specific situations (specific intersections in a city, or certain segments of a freeway, for example), but these models break down when applied to a large, general area consisting of many road and intersection types and other factors like weather conditions. We use Similarity Based Agglomerative Clustering (SBAC) analysis to categorize incidents to account for these variables. Thereafter, we use survival analysis to learn the likelihood of incidents per cluster. The mapping of the clusters to the spatial locations is achieved using a Bayesian network. The prediction methods we have developed lay the foundation for future work on an optimal emergency vehicle allocation and dispatch system in Nashville. 
    more » « less