Cricket Meets Machine Learning: A Classification Approach to IPL Match Prediction

Authors

DOI:

https://doi.org/10.53573/rhimrj.2026.v13n04.006

Keywords:

Indian Premier League (IPL), Machine Learning, Classification Algorithms, Match Outcome Prediction, Data Analytics, Sports Analytics, Predictive Modeling, Random Forest, Logistic Regression, Decision Tree

Abstract

The Indian Premier League is one of the most well-known Twenty20 cricket tournaments, which produces a large amount of organized match data. The goal of this research paper is to create a classification system based on machine learning that uses past data from the Indian Premier League between 2008 and 2025 to forecast match outcomes. The dataset features characteristics such as team names, the winner of the toss, the decision made by the toss, and the venue. Various data cleaning and transformation operations have been performed to preprocess the dataset, which include dealing with missing values and encoding categorical variables. Several machine learning classifiers like Logistic Regression, Decision Tree, Random Forest, and Support Vector Machine have been utilized to create prediction models. To assess the models' performance, the dataset was split into training and test sets. Several evaluation metrics like accuracy score, confusion matrix, and classification report were utilized to compare various models' effectiveness. According to the experiment results, ensemble techniques like Random Forest provide more accurate predictions than other models.

References

Kaggle. IPL Ball By Ball 2008–2024 Dataset. https://www.kaggle.com/datasets/dgsports/ipl-ball-by-ball-2008-to-2024

Bhattacharya, A., & Basu, S. (2021). Predictive Analytics in T20 Cricket: Player Performance Modeling Using Machine Learning. International Journal of Sports Analytics, 7(2), 45–60.

Kumar, R., & Singh, P. (2020). Impact of Contextual Metrics on Player Evaluation in IPL. Journal of Sports Data Science, 5(1), 12–28.

Jamiewelsh2. IPL Ball-by-Ball Dataset (2008–2020). Kaggle. https://www.kaggle.com/jamiewelsh2/ipl-ball-by-ball

Lal, S., & Sharma, V. (2022). Machine Learning Approaches to Auction Strategy in Indian Premier League. International Journal of Sports Management and Analytics, 9(3), 75–89.

Cieslak, A., & Ghosh, S. (2020). Ensemble Methods in Sports Analytics: Application to Cricket. Journal of Computational Sports Science, 3(2), 100–115.

ESPNCricinfo. IPL Stats and Records. https://www.espncricinfo.com/

Colorstech. IPL 2008–2024 Dataset for Data Analysis & Interactive Dashboards. https://colorstech.net/data-analytics/ipl-2008-to-2024-dataset-for-data-analysis-interactive-dashboards

Sinha, A. (2020). Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020. Preprints Research Journal.

Tripathi, A., Islam, R., Khandor, V., & Murugan, V. (2020). Prediction of IPL Matches Using Machine Learning While Tackling Ambiguity in Results. Indian Journal of Science and Technology.

Barot, H., Kothari, A., Bide, P., Ahir, B., & Kankaria, R. (2020). Analysis and Prediction for the Indian Premier League. International Conference on Emerging Technologies.

Srikantaiah, K. C., Khetan, A., Kumar, B., Tolani, D., & Patel, H. (2021). Prediction of IPL Match Outcome Using Machine Learning Techniques. arXiv Research Paper.

Menon, A., Khator, D., Prajapati, D., & Ekbote, A. (2022). IPL Prediction Using Machine Learning. Indian Journal of Computer Science.

Downloads

Published

2026-04-15

How to Cite

Sona, S. R., & Iswarya, M. (2026). Cricket Meets Machine Learning: A Classification Approach to IPL Match Prediction . RESEARCH HUB International Multidisciplinary Research Journal, 13(4), 42–49. https://doi.org/10.53573/rhimrj.2026.v13n04.006