Abstract
Based on men's singles data from the 2023 Wimbledon Tennis Championships, this study applies advanced statistical and machine learning techniques (including Principal Component Analysis PCA, Uniform Streaming Approximation and Projection UMAP, Random Forests, and Extreme Random Trees Extra-Trees) to explore the dynamics of lead changes during matches. By constructing mathematical models, the study predicts lead changes at key moments of the race that have a decisive impact on the race strategy and outcome. The experimental results show that the Extreme Random Trees model performs well on all evaluation metrics, especially in terms of accuracy, recall and F1 score, demonstrating its ability to handle complex interactions and variations in the data. In addition, the CatBoost model performed well in identifying actual leading changes, but with lower accuracy, possibly due to overfitting certain data patterns. The hybrid model, on the other hand, demonstrated a balanced performance, although it failed to outperform the extreme random tree model in terms of overall effectiveness.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.