Abstract

Since 1985, the Demographic and Health Surveys (DHS) Program has conducted more than 400 surveys in over 90 countries. These surveys provide decision markers with key measures of population demographics, health and nutrition, which allow informed policy evaluation to be made. Though standard health indicators are routinely published in survey final reports, much of the value of DHS is derived from the ability to download and analyse standardised microdata datasets for subgroup analysis, pooled multi-country analysis, and extended research studies. We have developed an open-source freely available R package ‘rdhs’ to facilitate management and processing of DHS survey data. The package provides a suite of tools to (1) access standard survey indicators through the DHS Program API, (2) identify all survey datasets that include a particular topic or indicator relevant to a particular analysis, (3) directly download survey datasets from the DHS website, (4) load datasets and data dictionaries into R, and (5) extract variables and pool harmonised datasets for multi-survey analysis. We detail the core functionality of ‘rdhs’ by demonstrating how the package can be used to firstly compare trends in the prevalence of anaemia among women between countries before conducting secondary analysis to assess for the relationship between education and anemia.

Highlights

  • The Demographic and Health Surveys (DHS) Program has collected and disseminated population survey data from over 90 countries for more than 30 years[1]

  • The rdhs package was designed to address these needs and facilitate the management and processing of DHS survey data in the R statistical software environment[6]. This occurs through both functioning as an application programming interface (API) client, allowing access to all data provided within the DHS API, and helping to download the standardised recoded microdatasets from the DHS website and read them into conventional R data structures

  • Between 1987 and March 2019 the DHS Program conducted and published data from 315 surveys, which represents over 12,000 dataset files that can be freely downloaded for further analysis

Read more

Summary

Introduction

The Demographic and Health Surveys (DHS) Program has collected and disseminated population survey data from over 90 countries for more than 30 years[1]. The rdhs package was designed to address these needs and facilitate the management and processing of DHS survey data in the R statistical software environment[6]. This occurs through both functioning as an application programming interface (API) client, allowing access to all data provided within the DHS API, and helping to download the standardised recoded microdatasets from the DHS website and read them into conventional R data structures. The package caches data dictionaries associated with the survey datasets, enabling fast querying for survey variables of interest across multiple surveys

Methods
11: Anemia 13
4: SurveyCharacteristicName Abortion
Discussion
Silva R
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call