TY - JOUR
T1 - Using contextual data to predict risky driving events
T2 - A novel methodology from explainable artificial intelligence
AU - Masello, Leandro
AU - Castignani, German
AU - Sheehan, Barry
AU - Guillen, Montserrat
AU - Murphy, Finbarr
N1 - Publisher Copyright:
© 2023 The Author(s)
PY - 2023/5
Y1 - 2023/5
N2 - Usage-based insurance has allowed insurers to dynamically tailor insurance premiums by understanding when and how safe policyholders drive. However, telematics information can also be used to understand the driving contexts experienced by the driver within each trip (e.g., road types, weather, traffic). Since different combinations of these conditions affect exposure to accidents, this understanding introduces predictive opportunities in driving risk assessment. This paper investigates the relationships between driving context combinations and risk using a naturalistic driving dataset of 77,859 km. In particular, XGBoost and Random Forests are used to determine the predictive significance of driving contexts for near-misses, speeding and distraction events. Moreover, the most important contextual factors in predicting these risky events are identified and ranked through Shapley Additive Explanations. The results show that the driving context has significant power in predicting driving risk. Speed limit, weather temperature, wind speed, traffic conditions and road slope appear in the top ten most relevant features for most risky events. Analysing contextual feature variations and their influence on risky events showed that low-speed limits increase the predicted frequency of speeding and phone unlocking events, whereas high-speed limits decrease harsh accelerations. Low temperatures decrease the expected frequency of harsh manoeuvres, and precipitations increase harsh acceleration, harsh braking, and distraction events. Furthermore, road slope, intersections and pavement quality are the most critical factors among road layout attributes. The methodology presented in this study aims to support road safety stakeholders and insurers by providing insights to study the contextual risk factors that influence road accident frequency and driving risk.
AB - Usage-based insurance has allowed insurers to dynamically tailor insurance premiums by understanding when and how safe policyholders drive. However, telematics information can also be used to understand the driving contexts experienced by the driver within each trip (e.g., road types, weather, traffic). Since different combinations of these conditions affect exposure to accidents, this understanding introduces predictive opportunities in driving risk assessment. This paper investigates the relationships between driving context combinations and risk using a naturalistic driving dataset of 77,859 km. In particular, XGBoost and Random Forests are used to determine the predictive significance of driving contexts for near-misses, speeding and distraction events. Moreover, the most important contextual factors in predicting these risky events are identified and ranked through Shapley Additive Explanations. The results show that the driving context has significant power in predicting driving risk. Speed limit, weather temperature, wind speed, traffic conditions and road slope appear in the top ten most relevant features for most risky events. Analysing contextual feature variations and their influence on risky events showed that low-speed limits increase the predicted frequency of speeding and phone unlocking events, whereas high-speed limits decrease harsh accelerations. Low temperatures decrease the expected frequency of harsh manoeuvres, and precipitations increase harsh acceleration, harsh braking, and distraction events. Furthermore, road slope, intersections and pavement quality are the most critical factors among road layout attributes. The methodology presented in this study aims to support road safety stakeholders and insurers by providing insights to study the contextual risk factors that influence road accident frequency and driving risk.
KW - Driving context
KW - Explainable AI
KW - Machine learning
KW - Risk assessment
KW - Usage-based insurance
UR - http://www.scopus.com/inward/record.url?scp=85148693711&partnerID=8YFLogxK
U2 - 10.1016/j.aap.2023.106997
DO - 10.1016/j.aap.2023.106997
M3 - Article
C2 - 36854225
AN - SCOPUS:85148693711
SN - 0001-4575
VL - 184
SP - -
JO - Accident Analysis and Prevention
JF - Accident Analysis and Prevention
M1 - 106997
ER -