Predictive Modeling and Analysis of Traffic Accidents

Data Analysis Overview
This is an overview of the STATS19 predictive modeling and road analysis project

About the Project

The project focuses on analyzing road accident data from 2019 in the UK to identify patterns and predictors of accident severity. The goal is to enhance public road safety by employing predictive analytics to determine the most effective models for predicting the severity of road accidents. Three datasets were utilized: Accidents, Vehicles, and Casualties, each providing unique insights into the conditions and outcomes of the incidents.

Techniques and Steps

The analysis utilized R programming language, specifically leveraging libraries such as ggvis, dplyr, and rpart. Data visualization was extensively used to illustrate findings, employing ggplot2 for graphical representation.

Tools Used

The analysis utilized R programming language, specifically leveraging libraries such as ggvis, dplyr, and rpart. Data visualization was extensively used to illustrate findings, employing ggplot2 for graphical representation.

Conclusion

The study assessed the performance of the KNN, LDA, and Decision Tree models. It was concluded that the LDA model demonstrated the most consistent and effective performance across different sets of variables, making it the preferred choice for predicting the severity of road accidents. The project underscores the potential of predictive analytics in enhancing traffic safety measures.

Additional Resources

GitHub Repository: Private (Available upon request)