Abstract

The r largest order statistics approach is widely used in extreme value analysis because it may use more information from the data than just the block maxima. In practice, the choice of r is critical. If r is too large, bias can occur; if too small, the variance of the estimator can be high. The limiting distribution of the r largest order statistics, denoted by GEV_r, extends that of the block maxima. Two specification tests are proposed to select r sequentially. The first is a score test for the GEV_r distribution. Due to the special characteristics of the GEV_r distribution, the classical chi-square asymptotics cannot be used. The simplest approach is to use the parametric bootstrap, which is straightforward to implement but computationally expensive. An alternative fast weighted bootstrap or multiplier procedure is developed for computational efficiency. The second test uses the difference in estimated entropy between the GEV_r and GEV_{r-1} models, applied to the r largest order statistics and the r-1 largest order statistics, respectively. The asymptotic distribution of the difference statistic is derived. In a large scale simulation study, both tests held their size and had substantial power to detect various misspecification schemes. A new approach to address the issue of multiple, sequential hypotheses testing is adapted to this setting to control the false discovery rate or familywise error rate. The utility of the procedures is demonstrated with extreme sea level and precipitation data.

Highlights

  • The r largest order statistics is an extension of the block maxima approach that is often used in extreme value modeling

  • The second test uses the difference in estimated entropy between the GEVr and GEVr−1 models, applied to the r largest order statistics and the r − 1 largest order statistics, respectively

  • We proposed two model specification tests for a fixed number of largest order statistics as the basis for selecting r for the r largest order statistics approach in extreme value analysis

Read more

Summary

Introduction

The r largest order statistics is an extension of the block maxima approach that is often used in extreme value modeling. The approach is based on the limiting distribution of the r largest order statistics which extends the generalized extreme value (GEV) distribution (e.g., Weissman 1978). This distribution, denoted by GEVr , has the same parameters as the GEV distribution, which makes it useful to estimate the GEV parameters when the r largest values are available for each block. Testing the exponentiality of the spacings on the Gumbel scale provides an approximate diagnosis of the joint distribution of the r largest order statistics when B is large. The Appendix contains the details of random number generation from the GEVr distribution and a sketch of the proof of the asymptotic distribution of the ED test statistic

Model and data setup
Score test
Parametric bootstrap
Return an approximate p value of Vn as
Multiplier bootstrap
Entropy difference test
Automated sequential testing procedure
Lowestoft sea levels
Annual maximum precipitation
50 Year Return Level
Findings
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call