top of page
PURPOSE
The goal of this research is to analyze chicago car accident reports data in order to classify the primary cause of an accident and answer the following questions:
Q1 - What is the distribution of car accident causes?
Q2 - What regions do the most car accidents occur?
Q3 - What effect do external factors have on the amount of car crashes and car crashes with injuries?
The time of the days effect on car accidents.
The weather’s effect on car accidents.
ACCIDENT CLASSIFIER: Projects
QUESTIONS
ACCIDENT CLASSIFIER: Projects
ACCIDENT CLASSIFIER: Projects
Model
Random Forest, X Boosting & LinearSVC classifiers where implimented after re-sampling with SMOTE since the dataset was heavily imbalanced and they all gave roughly the same results give or take 5%. So I opted to go with X Boosting classifier using PCA as its feature selection parameter. The features included where:
Driver’s Action, Driver’s Vision, Roadway Surface Condition, Device Condition, First Crash Type, Posted Speed Limit, Age, Physical Condition.
ACCIDENT CLASSIFIER: Text
ACCIDENT CLASSIFIER: Projects
Future Work
Road Type Division: Segregate the different types of streets/roads to understand the unique properties of accidents that occurs in each
More Data: Gather more data like if a driver was on the phone, exceeded the posted speed limit or or has a good amount of driving experience .
Region Division: Deeper analysis on the primary causes of accidents in the North, South, East West and Central regions of the city.
ACCIDENT CLASSIFIER: Text
bottom of page