The continuous sample of working lives: improving its representativeness

Juan Manuel Pérez-Salamero González,Carlos Vidal-Meliá,Marta Regúlez-Castillo

doi:10.1007/s13209-017-0154-0

Juan Manuel Pérez-Salamero González, Carlos Vidal-Meliá + Show 1 more

Open Access

https://doi.org/10.1007/s13209-017-0154-0

Copy DOI

Journal: SERIEs	Publication Date: Mar 1, 2017
Citations: 2	License type: open-access

Affiliation: University of Valencia, University of the Basque Country

Abstract

This paper studies the representativeness of the Continuous Sample of Working Lives (CSWL), a set of anonymized microdata containing information on individuals from Spanish Social Security records. We examine several CSWL waves (2005–2013) and show that it is not representative for the population with a pension income. We then develop a methodology to draw a large dataset from the CSWL that is much more representative of the retired population in terms of pension type, gender and age. This procedure also makes it possible for users to choose between goodness of fit and subsample size. In order to illustrate the practical significance of our methodology, the paper also contains an application in which we generate a large subsample distribution from the 2010 CSWL. The results are striking: with a very small reduction in the size of the original CSWL, we significantly reduce errors in estimating pension expenditure for 2010, with a p value greater or equal to 0.999.

Full Text