Abstract

Abstract Each year, millions of people fill out a bracket to predict the outcome of the popular NCAA men’s college basketball tournament, known as March Madness. In this paper we present a new methodology for team ranking and use it to predict the NCAA basketball tournament. We evaluate our model in Kaggle’s March Machine Learning Mania competition, in which contestants were required to predict the results of all possible games in the tournament. Our model combines two methods: the least squares pairwise comparison model and isotonic regression. We use existing team rankings (such as seeds, Sagarin and Pomeroy ratings) and look for a monotonic, non-linear relationship between the ranks’ differences and the probability to win a game. We use the isotonic property to get new rankings that are consistent with both the observed outcomes of past tournaments and previous knowledge about the order of the teams. In the 2016 and 2017 competitions, submissions based on our methodology consistently placed in the top 5% out of over 800 other submissions. Using simulations, we show that the suggested model is usually better than commonly used linear and logistic models that use the same variables.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call