Abstract

Human settlements are usually nucleated around manmade central points or distinctive natural features, forming clusters that vary in shape and size. However, population distribution in geo-sciences is often represented in the form of pixelated rasters. Rasters indicate population density at predefined spatial resolutions, but are unable to capture the actual shape or size of settlements. Here we suggest a methodology that translates high-resolution raster population data into vector-based population clusters. We use open-source data and develop an open-access algorithm tailored for low and middle-income countries with data scarcity issues. Each cluster includes unique characteristics indicating population, electrification rate and urban-rural categorization. Results are validated against national electrification rates provided by the World Bank and data from selected Demographic and Health Surveys (DHS). We find that our modeled national electrification rates are consistent with the rates reported by the World Bank, while the modeled urban/rural classification has 88% accuracy. By delineating settlements, this dataset can complement existing raster population data in studies such as energy planning, urban planning and disease response.

Highlights

  • Background & SummaryThe 2030 Agenda for Sustainable Development has set the target of universal energy access[1] (SDG 7.1)

  • Electricity access inequality is present within the countries of the region, as urban electrification rates tend to be significantly higher than the rural ones[6,7,9,10,11]

  • In September of 2020 a constrained version of WorldPop was released for www.nature.com/scientificdata sub-Saharan African countries[60]. This dataset uses the same methods as the unconstrained WorldPop dataset but to High Resolution Settlement Layer (HRSL) and GHS-POP it uses a built-up layer to remove all cells that do not coincide with building footprints

Read more

Summary

Background & Summary

The 2030 Agenda for Sustainable Development has set the target of universal energy access[1] (SDG 7.1). They consist of pixelated areas, each pixel treated on its own, separated from adjacent cells[17] This can have two implications; 1) different modelling results present themselves in the same settlement even in cases where these settlements are too small for this to be the actual case, and 2) the resolution of the population dataset can create biases (e.g., data represented at different spatial scales for the same study area might not generate consistent results[41]). This issue is labeled as the Modifiable Areal Unit Problem (MAUP)[42,43,44,45]. We generate, validate and publish open population “clusters” for 44 countries in SSA, for the first time

Methods
Limitations
Findings
Code availability
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call