The growing sports betting market works on the premise that sports are unpredictable, making it more likely to be wrong than right, as the user has to choose between win, draw, or lose. So could football, the world’s most popular sport, be predictable? This article studies this question using deep neural networks to predict the outcome of football matches using publicly available data. Data from 24,760 matches from 13 leagues over 2 to 10 years were used as input for the neural network and to generate a state-of-the-art validated feature, the pi-rating, and the parameters proposed in this work, such as relative attack, defence, and mid power. The data were pre-processed to improve the network’s interpretation and deal with missing or inconsistent data. With the validated pi-rating, data organisation methods were evaluated to find the most fitting option for this prediction system. The final network has four layers with 100, 80, 5, and 3 neurons, respectively, applying the dropout technique to reduce overfitting errors. The results showed that the most influential features are the proposed relative defending, playmaking, and midfield power, and the home team goal expectancy features, surpassing the pi-rating. Finally, the proposed model obtained an accuracy of 52.8% in 2589 matches, reaching 80.3% in specific situations. These results prove that football can be predictable and that some leagues are more predictable than others.
Read full abstract