Abstract

Abstract. This paper describes the creation of HadISD: an automatically quality-controlled synoptic resolution dataset of temperature, dewpoint temperature, sea-level pressure, wind speed, wind direction and cloud cover from global weather stations for 1973–2011. The full dataset consists of over 6000 stations, with 3427 long-term stations deemed to have sufficient sampling and quality for climate applications requiring sub-daily resolution. As with other surface datasets, coverage is heavily skewed towards Northern Hemisphere mid-latitudes. The dataset is constructed from a large pre-existing ASCII flatfile data bank that represents over a decade of substantial effort at data retrieval, reformatting and provision. These raw data have had varying levels of quality control applied to them by individual data providers. The work proceeded in several steps: merging stations with multiple reporting identifiers; reformatting to netCDF; quality control; and then filtering to form a final dataset. Particular attention has been paid to maintaining true extreme values where possible within an automated, objective process. Detailed validation has been performed on a subset of global stations and also on UK data using known extreme events to help finalise the QC tests. Further validation was performed on a selection of extreme events world-wide (Hurricane Katrina in 2005, the cold snap in Alaska in 1989 and heat waves in SE Australia in 2009). Some very initial analyses are performed to illustrate some of the types of problems to which the final data could be applied. Although the filtering has removed the poorest station records, no attempt has been made to homogenise the data thus far, due to the complexity of retaining the true distribution of high-resolution data when applying adjustments. Hence non-climatic, time-varying errors may still exist in many of the individual station records and care is needed in inferring long-term trends from these data. This dataset will allow the study of high frequency variations of temperature, pressure and humidity on a global basis over the last four decades. Both individual extremes and the overall population of extreme events could be investigated in detail to allow for comparison with past and projected climate. A version-control system has been constructed for this dataset to allow for the clear documentation of any updates and corrections in the future.

Highlights

  • The Integrated Surface Database (ISD) held at NOAA’s National Climatic Data Center is an archive of synoptic reports from a large number of global surface stations (Smith et al, 2011; Lott, 2004; see http://www.ncdc.noaa.gov/oa/ climate/isd/index.php)

  • All records passing quality control (QC) are available in HadISD versions “.all”, but further checks are performed on stations for inclusion in HadISD versions “.clim”, to ensure adequacy for long-term climate monitoring

  • We note that the raw ISD data may have differing levels of QC applied by National Met Services before ingestion into the ISD

Read more

Summary

Introduction

The Integrated Surface Database (ISD) held at NOAA’s National Climatic Data Center is an archive of synoptic reports from a large number of global surface stations (Smith et al, 2011; Lott, 2004; see http://www.ncdc.noaa.gov/oa/ climate/isd/index.php). In this paper we outline the steps undertaken to provide a new quality-controlled version, called HadISD, which is based on the raw ISD records, in netCDF format for selected variables for a subset of the stations with long records. This new dataset will allow the easy study of the behaviour of short-timescale climate phenomena in recent decades, with the subsequent comparison to past climates and future climate projections. In future periodic updates are planned to keep the dataset up-to-date

Compositing stations
Selection and retrieval of an initial set of stations
Quality control steps and analysis
Test 1: inter-station duplicate check
Test 2: duplicate months check
Test 3: odd cluster check
Test 4: frequent value check
Test 5: diurnal cycle check
Test 6: distributional gap check
Test 7: known records check
Test 9: climatological outlier check
4.1.10 Test 10: spike check
4.1.11 Test 11: temperature and dewpoint temperature cross-check
4.1.12 Test 12: cloud coverage logical checks
4.1.13 Test 13: unusual variance check
4.1.14 Test 14: nearest neighbour data checks
Complete match to least temporally complete month’s record for T
4.1.15 Test 15: station clean-up
Test order
Fine-tuning
Validation and analysis of quality control results
Notes on geographical prevalence
Global overview of the quality control procedure
Final station selection
31 December 1990–31 December 2005 31 December 2000
Median diurnal temperature range
Temperature variations over 24 hours
Findings
Summary
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call