Document Type : Applied Article

Authors

1 Computer Engineering Department, Imam Khomeini International University, Qazvin, Iran

2 Civil Engineering Department, Imam Khomeini International University, Qazvin, Iran

Abstract

This paper predicts the severity of crashes based on the analysis of multiple variables and using machine learning methods. For this purpose, data related to the years 2012 to 2024 of Tempe city in the state of Arizona USA was used. Features were selected using the metaheuristic method. Then, by using decision tree and artificial neural network, the classification of the severity of crashes was carried out. Based on the metrics, decision tree with an overall accuracy of 54% was the optimal. Finally, using the permutation feature importance method, the optimal model was interpreted. The results show that the characteristics of the year with 0.22 and the spatial characteristics with 0.11 and the collision manner with 0.1 have a higher importance in predicting the severity of crashes on urban roads.

Keywords

Main Subjects

[1] WHO (World Health Organization). "Road Traffic Injuries, Retrieved from WHO, Fact Sheets."
 
[2] G. Singh, S. Sachdeva, and M. Pal, "Comparison of three parametric and machine learning approaches for modeling accident severity on non-urban sections of Indian highways," Advances in transportation studies, vol. 45, 2018.
 
[3] W. D. Fan, L. Gong, E. M. Washing, M. Yu, and E. Haile, "Identifying and quantifying factors affecting vehicle crash severity at highway-rail grade crossings: Models and their comparison," 2016.
 
[4] J. Zhang, Z. Li, Z. Pu, and C. Xu, "Comparing prediction performance for crash injury severity among various machine learning and statistical methods," IEEE Access, vol. 6, pp. 60079-60087, 2018.
 
[5] K. Santos, J. P. Dias, and C. Amado, "A literature review of machine learning algorithms for crash injury severity prediction," Journal of safety research, vol. 80, pp. 254-269, 2022.
 
[6] W.-H. Chen and P. P. Jovanis, "Method for identifying factors contributing to driver-injury severity in traffic crashes," Transportation Research Record, vol. 1717, no. 1, pp. 1-9, 2000.
 
[7] M. T. Kashifi, M. Al-Turki, and A. W. Sharify, "Deep hybrid learning framework for spatiotemporal crash prediction using big traffic data," International journal of transportation science and technology, vol. 12, no. 3, pp. 793-808, 2023.
 
[8] W. Shunshun, Y. Changshun, and S. Yong, "A review of road traffic accident prediction methods," American Journal of Management Science and Engineering, vol. 8, no. 3, pp. 73-77, 2023.
 
[9] X. Wen, Y. Xie, L. Wu, and L. Jiang, "Quantifying and comparing the effects of key risk factors on various types of roadway segment crashes with LightGBM and SHAP," Accident Analysis & Prevention, vol. 159, p. 106261, 2021.
 
[10]         I. Ahmad, M. Basheri, M. J. Iqbal, and A. Rahim, "Performance comparison of support vector machine, random forest, and extreme learning machine for intrusion detection," IEEE access, vol. 6, pp. 33789-33795, 2018.
 
[11]         H. Nassiri, S. I. Mohammadpour, and M. Dahaghin, "Forecasting time trends of fatal motor vehicle crashes in Iran using an ensemble learning algorithm," Traffic injury prevention, vol. 24, no. 1, pp. 44-49, 2023.
 
[12]         M.-M. Chen and M.-C. Chen, "Modeling road accident severity with comparisons of logistic regression, decision tree and random forest," Information, vol. 11, no. 5, p. 270, 2020.
 
[13]         Q. Cai, M. Abdel-Aty, Y. Sun, J. Lee, and J. Yuan, "Applying a deep learning approach for transportation safety planning by using high-resolution transportation and land use data," Transportation research part A: policy and practice, vol. 127, pp. 71-85, 2019. [Online]. Available: https://doi.org/10.1016/j.tra.2019.07.010.
 
[14]         X. Shi, Y. D. Wong, M. Z.-F. Li, C. Palanisamy, and C. Chai, "A feature learning approach based on XGBoost for driving assessment and risk prediction," Accident Analysis & Prevention, vol. 129, pp. 170-179, 2019. [Online]. Available:
 
[15]         J. Bao, P. Liu, and S. V. Ukkusuri, "A spatiotemporal deep learning approach for citywide short-term crash risk prediction with multi-source data," Accident Analysis & Prevention, vol. 122, pp. 239-254, 2019. [Online]. Available:
 
[16]         H. A. Abd Rahman and B. W. Yap, "Imbalance effects on classification using binary logistic regression," in Soft Computing in Data Science: Second International Conference, SCDS 2016, Kuala Lumpur, Malaysia, September 21-22, 2016, Proceedings 2, 2016: Springer, pp. 136-147.
 
[17]         M. Yahaya, W. Fan, C. Fu, X. Li, Y. Su, and X. Jiang, "A machine-learning method for improving crash injury severity analysis: a case study of work zone crashes in Cairo, Egypt," International journal of injury control and safety promotion, vol. 27, no. 3, pp. 266-275, 2020.
 
[18] D. Liu, P. Yan, Z. Pu, Y. Wang, and E. I. Kaisar, "Hybrid artificial immune algorithm for optimizing a Van-Robot E-grocery delivery system," Transportation Research Part E: Logistics and Transportation Review, vol. 154, p. 102466, 2021.
 
[19] K. Yang, X. Wang, and R. Yu, "A Bayesian dynamic updating approach for urban expressway real-time crash risk evaluation," Transportation research part C: emerging technologies, vol. 96, pp. 192-207, 2018.
 
[20]         X. Wang and S. H. Kim, "Prediction and factor identification for crash severity: comparison of discrete choice and tree-based models," Transportation research record, vol. 2673, no. 9, pp. 640-653, 2019.
 
[21]         Z. E. Abou Elassad, H. Mousannif, and H. Al Moatassime, "A real-time crash prediction fusion framework: An imbalance-aware strategy for collision avoidance systems," Transportation research part C: emerging technologies, vol. 118, p. 102708, 2020.
 
[22]         Y. Peng, C. Li, K. Wang, Z. Gao, and R. Yu, "Examining imbalanced classification algorithms in predicting real-time traffic crash risk," Accident Analysis & Prevention, vol. 144, p. 105610, 2020.
 
[23]         X. Shi, Y. D. Wong, C. Chai, and M. Z.-F. Li, "An automated machine learning (AutoML) method of risk prediction for decision-making of autonomous vehicles," IEEE Transactions on Intelligent Transportation Systems, vol. 22, no. 11, pp. 7145-7154, 2020. [Online]. Available: https://doi.org/10.1109/TITS.2020.3002419.
 
[24]         M. Becerra-Rozas et al., "Continuous metaheuristics for binary optimization problems: An updated systematic literature review," Mathematics, vol. 11, no. 1, p. 129, 2022.
 
[25]         I. Guyon and A. Elisseeff, "An introduction to variable and feature selection," Journal of machine learning research, vol. 3, no. Mar, pp. 1157-1182, 2003. [Online]. Available:
 
[26]         N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," Journal of artificial intelligence research, vol. 16, pp. 321-357, 2002.
 
[27]         T. Beshah and S. Hill, "Mining road traffic accident data to improve safety: role of road-related factors on accident severity in Ethiopia," in 2010 AAAI Spring symposium series, 2010.
 
[28]         L. Wahab and H. Jiang, "A comparative study on machine learning based algorithms for prediction of motorcycle crash severity," PLoS one, vol. 14, no. 4, p. e0214966, 2019.
 
[29]         S. Malik, H. El Sayed, M. A. Khan, and M. J. Khan, "Road accident severity prediction—a comparative analysis of machine learning algorithms," in 2021 IEEE Global Conference on Artificial Intelligence and Internet of Things (GCAIoT), 2021: IEEE, pp. 69-74.
 
[30]         M. U. Abdulazeez, W. Khan, and K. A. Abdullah, "Predicting child occupant crash injury severity in the United Arab Emirates using machine learning models for imbalanced dataset," IATSS research, vol. 47, no. 2, pp. 134-159, 2023.
 
[31]         A. Fisher, C. Rudin, and F. Dominici, "All models are wrong, but many are useful: Learning a variable's importance by studying an entire class of prediction models simultaneously," Journal of Machine Learning Research, vol. 20, no. 177, pp. 1-81, 2019.
 
[32]         H. T. Abdelwahab and M. A. Abdel-Aty, "Development of artificial neural network models to predict driver injury severity in traffic accidents at signalized intersections," Transportation research record, vol. 1746, no. 1, pp. 6-13, 2001.
 
[33]         X. Wen, Y. Xie, L. Jiang, Z. Pu, and T. Ge, "Applications of machine learning methods in traffic crash severity modelling: current status and future directions," Transport reviews, vol. 41, no. 6, pp. 855-879, 2021.
 
[34]               S. Ahmed, M. A. Hossain, S. K. Ray, M. M. I. Bhuiyan, and S. R. Sabuj, "A study on road accident prediction and contributing factors using explainable machine learning models: analysis and performance," Transportation research interdisciplinary perspectives, vol. 19, p. 100814, 2023.
 
[35]         S. Sun, B. Zhou, and S. Zhang, "Analysis of factors affecting injury severity in motorcycle involved crashes," in CICTP 2020, 2020, pp. 4207-4219.