Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
IN CRICKET
Team Members:
18BCE0348 (NISHANT SHUKLA)
Slot: F1 + TF1
Cricket is played in many countries of the world. There are a lot of domestic
and international matches organised by the International Cricket Council (ICC)
all over the world. Also, cricket is popular among the statistical science
community, but its inconsistent nature makes it difficult to predict by using
common probability models.
In this paper, a method has been proposed in which the final score can be
predicted of the first innings and the winning probability of the batting team in
the second innings can be estimated. In the former case Linear Regression
Classifier has been used and in the latter Random forest regression has been
implemented. Unlike the current procedure for projecting the score, the factors
like the venue of the match, the number of wickets fallen and the batting team
have been considered in the estimation and in the second innings, the target
given to the batting team has been included along with the factors taken in the
first innings, for probability estimation.
Only for
577 The user
T20
R lokhande Dataset different can be
Prediction of Comparing matches.
and analysis and matches rewarded
cricket score two Future
P chavan tabular files from if
and winning algorithms work will
(2018) algorithm cricsheet. predicted
be for all
org correctly
formats
More
Matches
Considering accuracy if
Sonu Kumar, Score between Limited
all factors Regression deep
Sneha Roy prediction in 2006-17 dataset,ac
affecting the analysis neural
(2017 ) cricket From cur-acy
match network is
cricinfo
used
“Home
Comparing
advantage
Ananda Predicting run rate Last 12 Only for
Linear ” and
Bandulasiri the winner in model with years ODI
regression benefit of
(2016) ODI cricket regression ODI data matches
winning
model
toss
Future
work will
Score and Comparing
be to
winning bayes 2002- Higher
Vishal single, increase
prediction in Naïve bayes classifier 2014 all accuracy
prateek Bhatia accuracy
cricket classifier with matches for bayes
(2015) and
through data regression played classifier
include
mining model
toss as a
parameter
This project will take all factors in consideration for predicting scores even the
weather of the day/night, and it will be for all formats of the game.
Y=A+Bx
While training the model we are given:-
Random Forest regression – The basic concept behind Random Forest is that
it combines multiple decision trees to determine the final output.
GitHub:-
-
b. Is your project based on any other reference project (Stanford Univ. or MIT)?
- Yes, Using Machine Learning to Predict the Outcome of English County twenty
over Cricket Matches Stylianos Kampakis, University College London.
-
c. How does your project differ from the reference project?*
- The reference project does not make prediction for all formats of the game whereas
my project takes each format along with various factors affecting the game.
l = len(y_pred)
for i in range(0,l):
if(abs(y_pred[i]-y_test[i]) <= thresold):
right += 1
return ((right/l)*100)
import pandas as pd
# Importing the dataset
dataset = pd.read_csv('data/odi.csv')
X = dataset.iloc[:,[7,8,9,12,13]].values
y = dataset.iloc[:, 14].values
# Splitting the dataset into the Training set and Test set
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.25,
random_state = 0)
# Feature Scaling
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)
Output screen:-
Here the final score was predicted after we entered the input values which are:-
Features: [runs,wickets,overs,striker,non-striker]
We input the current runs, wickets and scores of both striker and non-striker end.
R-squared is a statistical measure of how close the data are to the fitted regression
line.
After getting the output we wait for the expected result which will be to predict the final
score of a team after recognising the patterns from the dataset and measure its accuracy using
custom values which will take the current scenario of the game as input.
# Splitting the dataset into the Training set and Test set
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.25,
random_state = 0)
# Feature Scaling
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)
Output Screen:-
Finally, Random forest is better than linear regression as random Forest is able to discover
more complex dependencies at the cost of more time for fitting.
So chances are that Random forest will be able to predict scores correctly
7. References
1- International Journal of Trend in Research and Development, Volume 5(4), ISSN: 2394-
9333 www.ijtrd.com IJTRD | July – Aug 2018 Available Online@www.ijtrd.com 91
Prediction of Live Cricket Score and Winning 1Rameshwari A. Lokhande and 2Pramila M.
Chawan, 1Student, 2Professor, 1,2Computer and IT Dept, Veermata Jeejabai Technological
Institute, Mumbai, India
2- Using Machine Learning to Predict the Outcome of English County twenty over Cricket
Matches Stylianos Kampakis, University College London, stylianos.kampakis@gmail.com
William Thomas, University College London
3- Score and Winning Prediction in Cricket through Data Mining Tejinder Singh, Vishal Singla,
Parteek Bhatia Computer Science & Engineering Computer Science & Engineering Computer
Science & Engineering Thapar University Thapar University Thapar University Patiala,
Punjab, India Patiala, Punjab, India Patiala, Punjab, India teji.tsk@gmail.com
vsingla160@gmail.com parteek.bhatia@thapar.edu
4- Predicting the Winner in One Day International Cricket Ananda Bandulasiri, Ph.D.
***************************************************************************