Optimal design subsampling from Big Datasets

Laura Deldossi,Chiara Tommasi

doi:10.1080/00224065.2021.1889418

Optimal design subsampling from Big Datasets

Laura Deldossi, Chiara Tommasi

Open Access

https://doi.org/10.1080/00224065.2021.1889418

Copy DOI

Journal: Journal of Quality Technology	Publication Date: Feb 19, 2021
Citations: 8

Affiliation: Università Cattolica del Sacro Cuore, University of Milan

#Redundant Observations #Big Datasets + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Big Data are huge amounts of digital information that rarely result from properly planned surveys; as a consequence they often contain redundant observations. When the aim is to answer particular questions of interest, we suggest selecting a subsample of units that contains the majority of the information to achieve this goal. Selection methods driven by the theory of optimal design incorporate the inferential purposes and thus perform better than standard sampling schemes.

Full Text