Abstract

Background/Objectives: The IPL (Indian Premier League) is one of the most viewed cricket matches in the world. With a perpetual increase in the popularity and advertising associated with it, forecasting the IPL matches is becoming a need for the advertisers and the sponsors. This paper is centered on the implementation of machine learning to foretell the winner of an IPL match. Methods/Statistical analysis: The cricket in the T-20 format is highly unpredictable - many features contribute to the result of a cricket match, and each attribute feature has a weighted impact on the outcome of a game. In this paper, first, a meaningful dataset through data mining was defined; next, essential features using various methods like feature engineering and Analytic Hierarchy Process were derived. Besides, a key issue on data symmetry and the inability of models to handle it was identified, which extends to all types of classification models that compare two or more classes using similar features for both the classes. This concept in the paper is termed as model ambiguity that occurs due to the model’s asymmetric nature. Alongside, different machine learning classification algorithms like Naive Bayes, SVM, k- Nearest Neighbor, Random Forest, Logistic Regression, ExtraTreesClassifier, XGBoost were adopted to train the models for predicting the winner. Findings: As per the investigation, tree-based classifiers provided better results with the derived model. The highest accuracy of 60.043% with Random Forest, with a standard deviation of 6.3% and an ambiguity of 1.4%, was observed. Novelty/Applications: Apart from reporting a more accurate result, the derived model has also solved the problem of multicollinearity and identified the issue of data symmetry (termed as model ambiguity). It can be leveraged by brands, sponsors, and advertisers to keep up their marketing strategies. Keywords: The Indian Premier League; machine learning; analytic hierarchy process; winner prediction; IPL

Highlights

  • IntroductionThe IPL (Indian Premier League) is a 20-20 cricket league in India where eight teams (representing eight cities in India) play against each other

  • The IPL (Indian Premier League) is a 20-20 cricket league in India where eight teams play against each other

  • The models returning different results for the same input fed in two configurations was observed. This concept in the paper is termed as model ambiguity, which occurs due to the model’s inability to interpret data symmetry because of its asymmetric nature

Read more

Summary

Introduction

The IPL (Indian Premier League) is a 20-20 cricket league in India where eight teams (representing eight cities in India) play against each other. This game is India’s biggest cricket festival - the most celebrated and the most viewed, where the action is just not limited to the cricket field. The clatters, promotional events, cheerleaders, advertisements, fan clubs, interactions, and betting are celebrated along with the players and the matches. The entire revenue cycle of the IPL revolves around advertising. The IPL cricket league has proved to be a ’game-changer’ for both cricket and the entire Indian advertising industry [2]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call