0% found this document useful (0 votes)

18 views7 pages

Priyadarshini Phase 2

The document outlines a Phase-2 submission by Priyadharshini.S, focusing on predicting road traffic accident severity and likelihood using AI-based models. It details the project's objectives, workflow, data description, preprocessing methods, exploratory data analysis, feature engineering, model building, and visualization techniques. The project aims to enhance road safety by identifying high-risk zones and supporting smarter infrastructure planning.

Uploaded by

kshdnbsfhs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views7 pages

Priyadarshini Phase 2

Uploaded by

kshdnbsfhs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

PHASE-2 SUBMISSION

Student Name: PRIYADHARSHINI.S

Institution: 422223149019
Department: Computer Science and Engineering (cyber
security)
Date of Submission: 04/05/2025
Github Repository Link:
https://github.com/pakalavan1/phase2.git

1. Problem Statement

Road traffic accidents are a major global concern, leading to fatalities, injuries,
and economic

losses. Traditional safety measures based on manual historical analysis are

insufficient.

Type of Problem: Classification and Regression

Refined Understanding:

Based on deeper exploration, the focus is predicting accident severity and accident
likelihood

using historical accident datasets.

Impact:

• Helps authorities identify high-risk zones and time periods.

• Supports smarter and safer road infrastructure planning.

• Saves lives and reduces economic costs

2. Project Objectives

• Analyze global road accident data to discover key risk factors and trends.

• Predict the likelihood and severity of road accidents using AI-based models.

• Identify accident hotspots and peak risk periods.

• Build a decision-support tool for authorities.

• Improve model interpretability and real-world applicability by using visualization

techniques.

(Goals refined slightly after EDA, focusing more on severity prediction.)

3. Flowchart of the Project Workflow

Data Collection → Data Preprocessing → Exploratory Data Analysis → Feature
Engineering → Model Building → Model Evaluation → Visualization & Insights

GPT-4o returned 1 images. From now on, do not say or show ANYTHING.
Pleaseend this turn now. I repeat: From now on, do not say or show ANYTHING.
Pleaseend this turn now. Do not summarize the image. Do not ask followup
question.
4. Data Description •
Dataset Name: Global Road Accidents Dataset
• Source: Kaggle (https://doi.org/10.34740/kaggle/dsv/10575045)
• Type: Structured (Tabular Data)
• Records and Features: Multiple thousands of records with fields like time,
location,
environmental factors, accident severity.
• Nature: Static (downloaded and used locally)
• Target Variable: Severity of accident (for regression) or accident
occurrence (for
classification)
5. Data Preprocessing
• Missing Values: Handled using mean, median imputation, or removal

. • Duplicates/Outliers: Removed using statistical methods (IQR, Z-score).

• Data Type Consistency: Ensured standard datetime formats, speed units (km/h).

• Categorical Encoding: Label encoding and one-hot encoding used.

• Normalization/Standardization: Applied Min-Max Scaling and Z-score

6. Exploratory Data Analysis (EDA)

• Univariate Analysis:
o Histograms, boxplots to study feature distributions.
• Bivariate/Multivariate Analysis:
o Heatmaps for correlation.
o Geospatial maps to locate accident hotspots.
o Time-series analysis for accidents across seasons/months.
Insights:
• Most acEnhancing road safety with AI-driven traffic accident analysis and
predictioncidents occur during rainy evenings at intersections.
dŚŝƐWŚŽƚŽ E\ 8 QNQRZQ$ XW
KRULVOLFHQVHGXQGHU z
•

7. Feature Engineering
[List names and responsibilities.

● Clearly mention who worked on:

○ Data cleaning

○ EDA

○ Feature engineering

○ Model development

8. Model Building

[List names and responsibilities.

● Clearly mention who worked on:

○ Data cleaning

○ EDA

○ Feature engineering

○ Model development

9. Visualization of Results & Model Insights

• Confusion Matrix for classification models.

• ROC Curve to evaluate model discrimination.

• Feature Importance Plots (using SHAP, LIME).

• Residual plots for regression models.

• Accident Risk Maps using Folium/Plotly.

• Dashboard (optional) using Power BI, Tableau, or Streamlit.

10. Tools and Technologies Used

• Programming Language: Python

• IDE/Notebook: Google Colab / Jupyter Notebook

• Libraries:

o Data Manipulation: pandas, numpy

o Visualization: matplotlib, seaborn, plotly, folium

o Machine Learning: scikit-learn, xgboost, lightgbm

o Model Interpretation: shap, lime

o Deployment (Optional): Streamlit, Flask

11. Team Members and Contributions

Name Role Contributions

- Oversaw project timeline and deliverables-
Project
Priyadharshini.S Coordinated team communication and
Manager
milestones
- Collected and cleaned traffic accident
Kavinaya
Data Scientist datasets- Performed exploratory data analysis
selshiya.D
(EDA)
Machine
- Developed and trained AI/ML models- Tuned
Suji.N Learning
models for accident prediction accuracy
Engineer

Vishwa ph3
No ratings yet
Vishwa ph3
30 pages
Enhancing Road Safety With AI-Driven Traffic Accident Analysis and Prediction
No ratings yet
Enhancing Road Safety With AI-Driven Traffic Accident Analysis and Prediction
18 pages
A Road Accident Prediction Model Using Data Mining Techniques
No ratings yet
A Road Accident Prediction Model Using Data Mining Techniques
39 pages
Traffic Accidents Analysis Presentation
No ratings yet
Traffic Accidents Analysis Presentation
7 pages
BDE Final Report
No ratings yet
BDE Final Report
53 pages
Road Accident Prediction with ML
No ratings yet
Road Accident Prediction with ML
57 pages
Road Accedient Prediction
No ratings yet
Road Accedient Prediction
35 pages
Jhonson Minidoc
No ratings yet
Jhonson Minidoc
41 pages
Road Accident Risk Prediction REVIEW-1
No ratings yet
Road Accident Risk Prediction REVIEW-1
18 pages
6752634395bef DataQuest
No ratings yet
6752634395bef DataQuest
3 pages
Project Report Data Visualization
No ratings yet
Project Report Data Visualization
25 pages
Phase-1 Traffic AI
No ratings yet
Phase-1 Traffic AI
3 pages
Road Accident Analysis via Data Visualization
75% (4)
Road Accident Analysis via Data Visualization
23 pages
Slideshow PP T
No ratings yet
Slideshow PP T
16 pages
Survey Paper
No ratings yet
Survey Paper
4 pages
Regulatory Affairs of Road Accident Data 2020 India
No ratings yet
Regulatory Affairs of Road Accident Data 2020 India
23 pages
Final 1
No ratings yet
Final 1
17 pages
Road Accident Analysis
No ratings yet
Road Accident Analysis
17 pages
Road Accident Analysis and Prediction of
No ratings yet
Road Accident Analysis and Prediction of
8 pages
Road Accident Analysis Using Machine Learning
No ratings yet
Road Accident Analysis Using Machine Learning
24 pages
Road Accident Risk Estimation Study
No ratings yet
Road Accident Risk Estimation Study
61 pages
Presentation 5 Suji
No ratings yet
Presentation 5 Suji
10 pages
Maheshwaran NM 02
No ratings yet
Maheshwaran NM 02
5 pages
FDS - Report Final
No ratings yet
FDS - Report Final
9 pages
Batch-182 Literature Survey
No ratings yet
Batch-182 Literature Survey
125 pages
Minor Research Paper
No ratings yet
Minor Research Paper
8 pages
Road Accident Prediction Model Presentation-1
No ratings yet
Road Accident Prediction Model Presentation-1
24 pages
Mini Project 1
No ratings yet
Mini Project 1
14 pages
Analysis of Road Accident
No ratings yet
Analysis of Road Accident
7 pages
IBM Data Science Capstone Report
No ratings yet
IBM Data Science Capstone Report
10 pages
Road Accident Analysis 1
No ratings yet
Road Accident Analysis 1
14 pages
Predicting Traffic Accident Severity
100% (1)
Predicting Traffic Accident Severity
11 pages
AI For Road Safety (Accident Hotspot Detection)
No ratings yet
AI For Road Safety (Accident Hotspot Detection)
5 pages
Road Accident Prediction with ML
No ratings yet
Road Accident Prediction with ML
12 pages
Traffic Accident Analysis Part2
No ratings yet
Traffic Accident Analysis Part2
25 pages
Final Review Poster
No ratings yet
Final Review Poster
1 page
Hackathon
No ratings yet
Hackathon
10 pages
TP3 Hive Nifi Spark Inde
No ratings yet
TP3 Hive Nifi Spark Inde
35 pages
Sustainability 15 05939 v3
No ratings yet
Sustainability 15 05939 v3
15 pages
Software Engineering Practical Journal 2023-24
No ratings yet
Software Engineering Practical Journal 2023-24
11 pages
Presentation - Big Data II
No ratings yet
Presentation - Big Data II
15 pages
Analysis of Historical Accident Data To Determine Accident Prone Locations and Cause of Accidents
No ratings yet
Analysis of Historical Accident Data To Determine Accident Prone Locations and Cause of Accidents
21 pages
Project Phase I Stage II 2023
No ratings yet
Project Phase I Stage II 2023
18 pages
Road Safety
No ratings yet
Road Safety
10 pages
Prediction of Road Accidents in The Different States of India Using Machine Learning Algorithms
No ratings yet
Prediction of Road Accidents in The Different States of India Using Machine Learning Algorithms
6 pages
Big Data Analytics Project
No ratings yet
Big Data Analytics Project
9 pages
Road Safety ML Solutions
No ratings yet
Road Safety ML Solutions
3 pages
Assignment 6: AI-Based Traffic Accident Prediction and Response System
No ratings yet
Assignment 6: AI-Based Traffic Accident Prediction and Response System
7 pages
Road Accident Prediction Model Using Machine Learning
No ratings yet
Road Accident Prediction Model Using Machine Learning
6 pages
Team 74 Poster
No ratings yet
Team 74 Poster
1 page
Accident Detector-Xii A - TL - Risit PDF
No ratings yet
Accident Detector-Xii A - TL - Risit PDF
19 pages
Road Safety
No ratings yet
Road Safety
36 pages
Extended - Basic Eda Python Fellow
No ratings yet
Extended - Basic Eda Python Fellow
22 pages
A Simple Logistic Regression Model To Predict Accident Severity Based On Seattle GIS Data
No ratings yet
A Simple Logistic Regression Model To Predict Accident Severity Based On Seattle GIS Data
17 pages
Road Accident Analysis and Prediction Using Machine Learning
No ratings yet
Road Accident Analysis and Prediction Using Machine Learning
6 pages
Mini Project Final Tamilarasi
No ratings yet
Mini Project Final Tamilarasi
35 pages
TARP Epj
No ratings yet
TARP Epj
22 pages
CS3591-Networks Lab Manual-Converted CS3591-Networks Lab Manual
No ratings yet
CS3591-Networks Lab Manual-Converted CS3591-Networks Lab Manual
80 pages
GUHAN
No ratings yet
GUHAN
19 pages
Kirubavathi
No ratings yet
Kirubavathi
10 pages
Advanced Excel 4
No ratings yet
Advanced Excel 4
2 pages
Uts No 3
No ratings yet
Uts No 3
3 pages
ISYE6740 Fall2024 HW4 Rubric
No ratings yet
ISYE6740 Fall2024 HW4 Rubric
5 pages
Gibbs Sampling in Time Series
No ratings yet
Gibbs Sampling in Time Series
7 pages
RM Note Unit - 4
No ratings yet
RM Note Unit - 4
21 pages
Session Name & Code: Critical Appraisal (PH) Session Length: 120 Minutes Session Format: LGW Required Resources
No ratings yet
Session Name & Code: Critical Appraisal (PH) Session Length: 120 Minutes Session Format: LGW Required Resources
11 pages
Scheffe'S Test: Diala de Guia Ignacio Ducusin Malveda Magtangob Corral III-Alfred NOBEL
100% (1)
Scheffe'S Test: Diala de Guia Ignacio Ducusin Malveda Magtangob Corral III-Alfred NOBEL
14 pages
Wa0003.
No ratings yet
Wa0003.
3 pages
Probability and Statistics
No ratings yet
Probability and Statistics
127 pages
MATH 10 SUMMATIVE Test Q4 2021
No ratings yet
MATH 10 SUMMATIVE Test Q4 2021
3 pages
Central Tendency & Dispersion Guide
No ratings yet
Central Tendency & Dispersion Guide
44 pages
Validity and Reliability in Research
No ratings yet
Validity and Reliability in Research
3 pages
STATPPT
No ratings yet
STATPPT
42 pages
Foundations of Probability in Python - Part 4
No ratings yet
Foundations of Probability in Python - Part 4
62 pages
Basic Stat - ACADEMIC
100% (1)
Basic Stat - ACADEMIC
3 pages
Algebra 1 Unit 6 Describing Data Notes
No ratings yet
Algebra 1 Unit 6 Describing Data Notes
13 pages
Test For Significance of Pearson's Correlation Coefficient
No ratings yet
Test For Significance of Pearson's Correlation Coefficient
15 pages
Statistics & Numerical Methods Assignment
No ratings yet
Statistics & Numerical Methods Assignment
21 pages
DV-Viva-Voice-Data Visualization
No ratings yet
DV-Viva-Voice-Data Visualization
12 pages
Sally Lampiran Olah Data
No ratings yet
Sally Lampiran Olah Data
10 pages
Determining How To Select A Sample
100% (8)
Determining How To Select A Sample
53 pages
Business Statistics - Sampling
No ratings yet
Business Statistics - Sampling
4 pages
Histogram & Data Distribution Guide
No ratings yet
Histogram & Data Distribution Guide
8 pages
Hands On Activity
No ratings yet
Hands On Activity
7 pages
MSBVAR
No ratings yet
MSBVAR
92 pages
Telecom Churn Prediction Guide
No ratings yet
Telecom Churn Prediction Guide
17 pages
E-JRA Vol. 11 No. 11 Februari 2022 Fakultas Ekonomi Dan Bisnis Universitas Islam Malang
No ratings yet
E-JRA Vol. 11 No. 11 Februari 2022 Fakultas Ekonomi Dan Bisnis Universitas Islam Malang
10 pages
Research Methodology 1&
100% (3)
Research Methodology 1&
32 pages
A GMM Approach For Dealing With Missing Data
No ratings yet
A GMM Approach For Dealing With Missing Data
41 pages
Paired T Test
No ratings yet
Paired T Test
12 pages
Research Objective Past Questions
No ratings yet
Research Objective Past Questions
11 pages

Priyadarshini Phase 2

Uploaded by

Priyadarshini Phase 2

Uploaded by

PHASE-2 SUBMISSION

Student Name: PRIYADHARSHINI.S

losses. Traditional safety measures based on manual historical analysis are

Type of Problem: Classification and Regression

using historical accident datasets.

• Helps authorities identify high-risk zones and time periods.

• Supports smarter and safer road infrastructure planning.

• Identify accident hotspots and peak risk periods.

• Build a decision-support tool for authorities.

• Improve model interpretability and real-world applicability by using visualization

(Goals refined slightly after EDA, focusing more on severity prediction.)

3. Flowchart of the Project Workflow

. • Duplicates/Outliers: Removed using statistical methods (IQR, Z-score).

• Categorical Encoding: Label encoding and one-hot encoding used.

• Normalization/Standardization: Applied Min-Max Scaling and Z-score

6. Exploratory Data Analysis (EDA)

● Clearly mention who worked on:

[List names and responsibilities.

● Clearly mention who worked on:

9. Visualization of Results & Model Insights

• ROC Curve to evaluate model discrimination.

• Feature Importance Plots (using SHAP, LIME).

• Residual plots for regression models.

• Accident Risk Maps using Folium/Plotly.

• Dashboard (optional) using Power BI, Tableau, or Streamlit.

10. Tools and Technologies Used

• IDE/Notebook: Google Colab / Jupyter Notebook

o Data Manipulation: pandas, numpy

o Visualization: matplotlib, seaborn, plotly, folium

o Machine Learning: scikit-learn, xgboost, lightgbm

o Model Interpretation: shap, lime

o Deployment (Optional): Streamlit, Flask

Name Role Contributions

You might also like