Abstract

Abstract The International Swimming Federation (FINA) uses a very simple points system with the aim to rank swimmers across all swimming events. The points acquired is a function of the ratio of the recorded time and the current world record for that event. With some world records considered ‘better’ than others however, bias is introduced between events, with some being much harder to attain points where the world record is hard to beat. A model based on extreme value theory is introduced, where swim times are modelled through their rate of occurrence, and with the distribution of the best times following a generalised Pareto distribution. Within this framework, the strength of a particular swim is judged based on its position compared to the whole distribution of swim times, rather than just the world record. This model also accounts for the date of the swim, as training methods improve over the years, as well as changes in technology, such as full body suits. The parameters of the generalised Pareto distribution, for each of the 34 individual long course events, will be shown to vary with covariates, leading to a novel single unified description of swim quality over all events and time. This structure, which allows information to be shared across all strokes, distances, and genders, improves the predictive power as well as the model robustness compared to equivalent independent models. A by-product of the model is that it is possible to estimate other features of interest, such as the ultimate possible time, the distribution of new world records for any event, and to correct swim times for the effect of full body suits. The methods will be illustrated using a dataset of the best 500 swim times for each event in the period 2001–2018.

Highlights

  • On the face of it, comparing the performances of two swimmers in a given competition appears straightforward, compare their swim-times

  • This paper utilises extreme value theory to model the very best swim-times as being observations from a generalised Pareto distribution (GPd) so that the strength of a particular swim is judged on its position compared to the whole distribution of swim-times across all events, rather than just the world record for that event

  • The rankings are determined by the r-value of a swim-time x, that is, the rate at which observations better than x occur in the given event

Read more

Summary

Introduction

On the face of it, comparing the performances of two swimmers in a given competition appears straightforward, compare their swim-times. This paper utilises extreme value theory to model the very best swim-times as being observations from a generalised Pareto distribution (GPd) so that the strength of a particular swim is judged on its position compared to the whole distribution of swim-times across all events, rather than just the world record for that event.

Extremes of identically distributed variables
Extreme values of non-identically distributed variables
The Data
Separate Event Model
Parametric Model
Semi-Parametric model
Assessment of model M7b fit
Rankings
Ultimate times
Expected new world record time
Time until world record is next set for an event
Probability that a record is next set in a particular event
Adjusting Swim-Suit Influenced Times
Discussion
A Spline Construction
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call