Cricket Meets Machine Learning: A Classification Approach to IPL Match Prediction
DOI:
https://doi.org/10.53573/rhimrj.2026.v13n04.006Keywords:
Indian Premier League (IPL), Machine Learning, Classification Algorithms, Match Outcome Prediction, Data Analytics, Sports Analytics, Predictive Modeling, Random Forest, Logistic Regression, Decision TreeAbstract
The Indian Premier League is one of the most well-known Twenty20 cricket tournaments, which produces a large amount of organized match data. The goal of this research paper is to create a classification system based on machine learning that uses past data from the Indian Premier League between 2008 and 2025 to forecast match outcomes. The dataset features characteristics such as team names, the winner of the toss, the decision made by the toss, and the venue. Various data cleaning and transformation operations have been performed to preprocess the dataset, which include dealing with missing values and encoding categorical variables. Several machine learning classifiers like Logistic Regression, Decision Tree, Random Forest, and Support Vector Machine have been utilized to create prediction models. To assess the models' performance, the dataset was split into training and test sets. Several evaluation metrics like accuracy score, confusion matrix, and classification report were utilized to compare various models' effectiveness. According to the experiment results, ensemble techniques like Random Forest provide more accurate predictions than other models.
References
Kaggle. IPL Ball By Ball 2008–2024 Dataset. https://www.kaggle.com/datasets/dgsports/ipl-ball-by-ball-2008-to-2024
Bhattacharya, A., & Basu, S. (2021). Predictive Analytics in T20 Cricket: Player Performance Modeling Using Machine Learning. International Journal of Sports Analytics, 7(2), 45–60.
Kumar, R., & Singh, P. (2020). Impact of Contextual Metrics on Player Evaluation in IPL. Journal of Sports Data Science, 5(1), 12–28.
Jamiewelsh2. IPL Ball-by-Ball Dataset (2008–2020). Kaggle. https://www.kaggle.com/jamiewelsh2/ipl-ball-by-ball
Lal, S., & Sharma, V. (2022). Machine Learning Approaches to Auction Strategy in Indian Premier League. International Journal of Sports Management and Analytics, 9(3), 75–89.
Cieslak, A., & Ghosh, S. (2020). Ensemble Methods in Sports Analytics: Application to Cricket. Journal of Computational Sports Science, 3(2), 100–115.
ESPNCricinfo. IPL Stats and Records. https://www.espncricinfo.com/
Colorstech. IPL 2008–2024 Dataset for Data Analysis & Interactive Dashboards. https://colorstech.net/data-analytics/ipl-2008-to-2024-dataset-for-data-analysis-interactive-dashboards
Sinha, A. (2020). Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020. Preprints Research Journal.
Tripathi, A., Islam, R., Khandor, V., & Murugan, V. (2020). Prediction of IPL Matches Using Machine Learning While Tackling Ambiguity in Results. Indian Journal of Science and Technology.
Barot, H., Kothari, A., Bide, P., Ahir, B., & Kankaria, R. (2020). Analysis and Prediction for the Indian Premier League. International Conference on Emerging Technologies.
Srikantaiah, K. C., Khetan, A., Kumar, B., Tolani, D., & Patel, H. (2021). Prediction of IPL Match Outcome Using Machine Learning Techniques. arXiv Research Paper.
Menon, A., Khator, D., Prajapati, D., & Ekbote, A. (2022). IPL Prediction Using Machine Learning. Indian Journal of Computer Science.