Dark Data in Accident Prediction: Using AdaBoost and Random Forest for Improved Accuracy

Authors

  • Masroor Shah Department of Computer Science, Iqra National University Peshawar, Khyber Pakhtunkhwa (KPK), Pakistan.
  • Fazal Malik Department of Computer Science, Iqra National University Peshawar, Khyber Pakhtunkhwa (KPK), Pakistan.
  • Muhammad Suliman Department of Computer Science, Iqra National University Peshawar, Khyber Pakhtunkhwa (KPK), Pakistan.
  • Noor Rahman Fahad Bin Sultan University, Saudi Arabia.
  • Irfan Ullah Department of Computer Science, Iqra National University Peshawar, Khyber Pakhtunkhwa (KPK), Pakistan.
  • Sana Ullah Department of Computer Science, Iqra National University Peshawar, Khyber Pakhtunkhwa (KPK), Pakistan.
  • Romaan Khan City University of Science and Information Technology Peshawar, KPK, Pakistan.
  • Salman Alam COMSATS University Islamabad (CUI), Pakistan.

Keywords:

Big Data, Data Quality, Dark Data, Complexity of Dark Data, Accident Prediction

Abstract

Dark data, or unused information included into routine activities, poses significant hurdles in the era of data-driven decision-making because of its volume and complexity. The goal of this publication is to increase the accuracy of accident prediction by proposing an efficient procedure for dark data extraction and analysis. Data extraction, classifier implementation, and performance evaluation are all done in a methodical manner by using AdaBoost and Random Forest classifiers. According to the results, the Random Forest classifier outperforms the AdaBoost classifier with an accuracy of 89.50%, compared to the former's 78.4%. These results highlight the potential of dark data to yield insightful information by demonstrating how well these classifiers improve accident prediction models. In addition to emphasizing the value of dark data for decision-makers and urban planners looking to improve prediction models and access hidden information, the study offers a methodology for using it. Our research highlights the increasing significance of dark data in enhancing decision-making procedures and forecast precision as data quantities increase.

Downloads

Published

2024-09-01

How to Cite

Masroor Shah, Fazal Malik, Muhammad Suliman, Noor Rahman, Irfan Ullah, Sana Ullah, Romaan Khan, & Salman Alam. (2024). Dark Data in Accident Prediction: Using AdaBoost and Random Forest for Improved Accuracy. Journal of Computing & Biomedical Informatics, 7(02). Retrieved from https://jcbi.org/index.php/Main/article/view/531