0% found this document useful (0 votes)

125 views7 pages

Data Mining and Warehousing: Predicting The Outcome of ODI Matches

The document discusses predicting the outcome of One Day International cricket matches using data mining techniques. It explores using k-Nearest Neighbors, Decision Trees, and Naive Bayes classifiers on a dataset of match statistics scraped from cricinfo between 2006-2011. The kNN algorithm performed best, correctly predicting the winner in over 70% of validation matches. Factors analyzed included home field advantage, toss result, batting order, match timing, opponent, and venue. The models can help teams strategize to increase chances of victory.

Uploaded by

Vandhana Rathod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views7 pages

Data Mining and Warehousing: Predicting The Outcome of ODI Matches

Uploaded by

Vandhana Rathod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

IT-633 Data Mining and warehousing

Report

IT-633
Data Mining and Warehousing

Predicting the outcome of ODI matches

14 November, 2016

Autumn, 2015-16

DA-IICT, Gandhinagar
IT-633 Data Mining and warehousing

Report

Problem Definition:

Analyzing time oriented data and forecasting are among the most important problems
that analysts face across many fields. It is one of the core topics of research in data
mining. Here different approaches for predicting the outcome of One-Day International
(ODI) cricket match has been presented.This study helps us in finding consistent
approach that allows one to predict the match outcome with a great accuracy. Here we
have studied a prediction system that takes in historical match data as well as the
occurring state of a match, and predicts future match event results in a victory or loss.
A range of variables that could define the outcome of an ODI cricket match has to be
explored. We have worked on the following algorithm for predicting the match outcome:
k-Nearest Neighbors (kNN) Decision Tree, and Naive Bayes. We describe our model
and algorithms and finally present quantitative results.

Motivation:

Cricket is the second most popular sports in the world. The ICC cricket World Cup is the
second largest single sporting event in the world, drawing a cumulative television
audience of 2-3 billion people. There is huge commercial interest in strategic planning
for ensuring victory and in game outcome prediction. This has motivated thorough and
methodical analysis of individual and team performance, as well as prediction of future
games, across all formats of the game. Board, coach and captain can use this tool to
shape their strategies and plans. For instance, if tool predicts a WIN for coming match,
they could go confident in ground with a proper game plan and if it predicts a LOSS,
they could adjust their strategies accordingly by being more alert and careful while
playing to turn the match in must win game. Moreover, this study will help analysts to
discover winning pattern of Indian team against all other oppositions

Related Work:

One of the earliest and pioneering works in cricket was by Duckworth and Lewis where
they introduce the Duckworth Lewis or D-L method, which allows fair adjustment of
scores in proportion to the time lost due to match interruption (often due to adverse
weather conditions such as rain, poor visibility etc.). This proposal has been adopted by
the International Cricket Council (ICC) as a means to reset targets in matches where
time is lost due to match interruptions.
IT-633 Data Mining and warehousing

Report

Home field advantage, winning the toss, game plan (batting first or fielding first), match
type (day or day & night), competing team, venue familiarity and season in which the
match is played will be key features studied for the research . For purposes of study
three algorithms are used: k-nearest Neighbour, Decision Tree and Naïve Bayes.

Experiment & Outcomes:

Dataset:

To retrieve all the required statistics, the entire dataset has been scraped from the
cricinfo website.The dataset includes all the matches played between 2006 and
2011. The dataset contains the basic match details including the two competing
teams, the outcome of the toss, first batting, target, day/night, top scorer, and the
winner of the match for all the matches.
We have restricted our study to only top 2 ODI-playing teams, namely, India and
England. Since the impact of the nature of the game cannot be foreseen, a total of
22 matches which were either interrupted by rain or ended up in a draw/tie, have
been removed from the dataset. Finally, we divided the dataset into two parts,
namely, the training data and the validation data. The training dataset contains all
the matches played during the years 2006 and 2007, and the validation dataset
contains all the matches played in the year 2008 and 2011. There are a total of 14
matches in training dataset and 8 matches in validation dataset
IT-633 Data Mining and warehousing

Report

India=1,England=0;Yes=1,No=0
.

Binary Classifiers:
Using various binary and numeric features and the outcome of the match as the
label, we evaluated a large number of binary classifiers using their
models, Decision Trees and kNN

● kNN Training Set:

IT-633 Data Mining and warehousing

Report

● kNN Validation Set:

Naive Bayes:
IT-633 Data Mining and warehousing

Report

Decision Tree:
IT-633 Data Mining and warehousing

Report

Conclusion:
This study brings an exceptional contribution to the literature relating to the new time
series prediction problem i.e. predicting the outcome of One Day International cricket
match. Several unique approaches adopted for dataset formation and classification
model learning have established a worthy statistical approach. Whole dissertation
revolves around formation of accurate dataset and then finding the smart attributes out
of it. It can be observed that, being a simplest algorithm, kNN has outperformed the
other classification algorithms (viz. Decision Tree and Naive Bayes).

References:
[1] Vignesh Veppur Sankaranarayanan, Junaed Sattar and Laks V. S.
Lakshmanan.Autoplay: A Data Mining Approach to ODI Cricket Simulation and
Prediction:pp 1-9,2014.
[2] Mehvish Khan, Riddhi Shah, Role of External Factors on Outcome of a One Day
International Cricket (ODI) Match and Predictive Analysis: pp 192-197, June 2015.
[3] Madan Gopal Jhanwar and Vikram Pudi, Predicting the Outcome of ODI Cricket
Matches: A Team Composition Based Approach: pp 1-10.

Predicting ODI Cricket Outcomes
No ratings yet
Predicting ODI Cricket Outcomes
6 pages
Cricket JETIR2005307
No ratings yet
Cricket JETIR2005307
5 pages
Dalal 2024 Ijca 923744
No ratings yet
Dalal 2024 Ijca 923744
7 pages
Prediction of IPL Match Outcome Using Machine Lear
No ratings yet
Prediction of IPL Match Outcome Using Machine Lear
8 pages
Paper 9073
No ratings yet
Paper 9073
11 pages
Predicting Cricket Match 490021 1 en
No ratings yet
Predicting Cricket Match 490021 1 en
13 pages
Dynamic Cricket Match Outcome Prediction
No ratings yet
Dynamic Cricket Match Outcome Prediction
12 pages
Balasundaram 2020
No ratings yet
Balasundaram 2020
5 pages
Cricket Score & Win Prediction
No ratings yet
Cricket Score & Win Prediction
4 pages
The Cricket Winner Prediction With Application of Machine Learning and Data Analytics
No ratings yet
The Cricket Winner Prediction With Application of Machine Learning and Data Analytics
6 pages
Cricket Score Prediction Using Machine Learning
No ratings yet
Cricket Score Prediction Using Machine Learning
6 pages
Cricket Match Outcome Prediction
No ratings yet
Cricket Match Outcome Prediction
43 pages
Nadeem Report
No ratings yet
Nadeem Report
19 pages
1 PB PDF
No ratings yet
1 PB PDF
3 pages
Project New
No ratings yet
Project New
13 pages
Quantifying and Analyzing The Performance of Cricket Player Using Machine Learning
No ratings yet
Quantifying and Analyzing The Performance of Cricket Player Using Machine Learning
7 pages
Madan Gopal Jhanwar
No ratings yet
Madan Gopal Jhanwar
11 pages
Performance Analysis of A Cricketer by Data Visualization
No ratings yet
Performance Analysis of A Cricketer by Data Visualization
10 pages
IPL Match Predictions with ML
No ratings yet
IPL Match Predictions with ML
5 pages
Ijsred V8i2p177
No ratings yet
Ijsred V8i2p177
6 pages
The Cricket Winner Prediction With Applications of ML and Data Analytics
No ratings yet
The Cricket Winner Prediction With Applications of ML and Data Analytics
18 pages
SSRN Id3572740
No ratings yet
SSRN Id3572740
5 pages
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
No ratings yet
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
10 pages
Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020
No ratings yet
Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020
26 pages
IPL Match Prediction Using ML
No ratings yet
IPL Match Prediction Using ML
7 pages
YMER210577
No ratings yet
YMER210577
3 pages
Shetty 2020
No ratings yet
Shetty 2020
6 pages
Prediction of The Outcome of A Twenty-20 Cricket Match
No ratings yet
Prediction of The Outcome of A Twenty-20 Cricket Match
8 pages
Report
No ratings yet
Report
42 pages
Prediction and Analysis of Franchise Cricket
No ratings yet
Prediction and Analysis of Franchise Cricket
8 pages
Jsa - 2018 - 4 4 - Jsa 4 4 Jsa196 - Jsa 4 Jsa196
No ratings yet
Jsa - 2018 - 4 4 - Jsa 4 4 Jsa196 - Jsa 4 Jsa196
11 pages
Quantitative Assessment of Player Performance... (Madan Gopal Jhanwar, MS, 201202018)
No ratings yet
Quantitative Assessment of Player Performance... (Madan Gopal Jhanwar, MS, 201202018)
69 pages
Predicting BPLMatch Winners An Empirical Study Using Machine Learning Approach
No ratings yet
Predicting BPLMatch Winners An Empirical Study Using Machine Learning Approach
9 pages
Fin Irjmets1697356356
No ratings yet
Fin Irjmets1697356356
4 pages
Predictive Analysis of Sports Data Using Google Prediction API
No ratings yet
Predictive Analysis of Sports Data Using Google Prediction API
1 page
Predicting Outcome of Indian Premier League (IPL) Matches Using Machine Learning
No ratings yet
Predicting Outcome of Indian Premier League (IPL) Matches Using Machine Learning
12 pages
Source File
No ratings yet
Source File
42 pages
Cricket Team Prediction New
No ratings yet
Cricket Team Prediction New
22 pages
IPL Score Prediction with ML Models
No ratings yet
IPL Score Prediction with ML Models
8 pages
IPL Match Winner Prediction Using ML
No ratings yet
IPL Match Winner Prediction Using ML
7 pages
Predicting Players' Performance in One Day International Cricket Matches Using Machine Learning
No ratings yet
Predicting Players' Performance in One Day International Cricket Matches Using Machine Learning
17 pages
ODI Cricket Prediction via AI
No ratings yet
ODI Cricket Prediction via AI
8 pages
Ref 5 PDF
No ratings yet
Ref 5 PDF
9 pages
The Paper About The Method of Cricket Match Outcome
No ratings yet
The Paper About The Method of Cricket Match Outcome
67 pages
A Comparative Study of Data Mining Techniques On Football Match Prediction
No ratings yet
A Comparative Study of Data Mining Techniques On Football Match Prediction
8 pages
Sports Analytics
No ratings yet
Sports Analytics
10 pages
ML
No ratings yet
ML
8 pages
Capstone Presentation
No ratings yet
Capstone Presentation
10 pages
Ijaerv13n5 96
No ratings yet
Ijaerv13n5 96
3 pages
Ipl 2
No ratings yet
Ipl 2
6 pages
Paper3 TeamselectionusingRandomForestAlgorithm
No ratings yet
Paper3 TeamselectionusingRandomForestAlgorithm
8 pages
IPL Match Predictions via ML
No ratings yet
IPL Match Predictions via ML
6 pages
EPL Match Outcome Prediction Using ML
No ratings yet
EPL Match Outcome Prediction Using ML
3 pages
Journal Pone 0284318
No ratings yet
Journal Pone 0284318
15 pages
IPL Cricket Match Prediction Using ML
No ratings yet
IPL Cricket Match Prediction Using ML
6 pages
3072 6115 1 SM
No ratings yet
3072 6115 1 SM
6 pages
Cricket Prediction Using Machine Learning Algorithms
No ratings yet
Cricket Prediction Using Machine Learning Algorithms
4 pages
Cricket Player Data Analysis Using Clustering Technique
No ratings yet
Cricket Player Data Analysis Using Clustering Technique
5 pages
Sample Exam Istqb CTFL 2018
No ratings yet
Sample Exam Istqb CTFL 2018
60 pages
Fine-Grained NIKE Protocols & Bounds
No ratings yet
Fine-Grained NIKE Protocols & Bounds
22 pages
Tailoring AutoCAD P&ID and Plant 3D
100% (8)
Tailoring AutoCAD P&ID and Plant 3D
194 pages
3rd Sem - Mid Term
No ratings yet
3rd Sem - Mid Term
6 pages
Form For Registration of Electives: Structure and Syllabus of The Concerned Program
No ratings yet
Form For Registration of Electives: Structure and Syllabus of The Concerned Program
4 pages
VLSI Lab Questions
No ratings yet
VLSI Lab Questions
5 pages
Entropy 23 00018 v2 41
No ratings yet
Entropy 23 00018 v2 41
1 page
Canvas: Java SE 6
No ratings yet
Canvas: Java SE 6
3 pages
SRA Proposal Template
100% (1)
SRA Proposal Template
8 pages
XL Interpolators
No ratings yet
XL Interpolators
6 pages
AN Grade 3 1
90% (10)
AN Grade 3 1
2 pages
Chapter 4 Questions With Answers - Project Integration Management
No ratings yet
Chapter 4 Questions With Answers - Project Integration Management
3 pages
AlienVault PCI DSS 3.0 Compliance
No ratings yet
AlienVault PCI DSS 3.0 Compliance
5 pages
Unit 4-Exception
No ratings yet
Unit 4-Exception
22 pages
9825A Quick Reference Guide
No ratings yet
9825A Quick Reference Guide
29 pages
Software Project Planning
100% (1)
Software Project Planning
4 pages
Revision WS-Tuples & Dictionary
No ratings yet
Revision WS-Tuples & Dictionary
5 pages
C++ Best Practices by Jason Turner
100% (4)
C++ Best Practices by Jason Turner
184 pages
UIT11e Ch09 PPT
No ratings yet
UIT11e Ch09 PPT
37 pages
Android Dashboard Location Service
No ratings yet
Android Dashboard Location Service
19 pages
Pacifican Job Application Form
No ratings yet
Pacifican Job Application Form
2 pages
Oracle 10g Dataguard Best Practice and Setup Steps
No ratings yet
Oracle 10g Dataguard Best Practice and Setup Steps
6 pages
Deepak Garg: Education
No ratings yet
Deepak Garg: Education
2 pages
Pipelined Adders
No ratings yet
Pipelined Adders
9 pages
ITE 340 / ISM 321 Management Information Systems: Review of Attempt 2
No ratings yet
ITE 340 / ISM 321 Management Information Systems: Review of Attempt 2
4 pages
Statistics Honours & Pass Syllabus
No ratings yet
Statistics Honours & Pass Syllabus
79 pages
Srinivas Pyla Resume
No ratings yet
Srinivas Pyla Resume
3 pages
Sample Assignment Crowdfunding
No ratings yet
Sample Assignment Crowdfunding
2 pages
2011 - A HiL Test Bench For Verification and Validation Purposes of Model-Based Developed Applications Using Simulink and OPC DA Technology PDF
No ratings yet
2011 - A HiL Test Bench For Verification and Validation Purposes of Model-Based Developed Applications Using Simulink and OPC DA Technology PDF
6 pages
LC72322 Microcontroller Specs
No ratings yet
LC72322 Microcontroller Specs
13 pages

Data Mining and Warehousing: Predicting The Outcome of ODI Matches

Uploaded by

Data Mining and Warehousing: Predicting The Outcome of ODI Matches

Uploaded by

IT-633 Data Mining and warehousing

​Predicting the outcome of ODI matches

Experiment & Outcomes:

● kNN Training Set:

● kNN Validation Set:

You might also like

Predicting the outcome of ODI matches