Abstract

Earth-observing remote sensing data, including aerial photography and satellite imagery, offer a snapshot of the world from which we can learn about the state of natural resources and the built environment. The components of energy systems that are visible from above can be automatically assessed with these remote sensing data when processed with machine learning methods. Here, we focus on the information gap in distributed solar photovoltaic (PV) arrays, of which there is limited public data on solar PV deployments at small geographic scales. We created a dataset of solar PV arrays to initiate and develop the process of automatically identifying solar PV locations using remote sensing imagery. This dataset contains the geospatial coordinates and border vertices for over 19,000 solar panels across 601 high-resolution images from four cities in California. Dataset applications include training object detection and other machine learning algorithms that use remote sensing imagery, developing specific algorithms for predictive detection of distributed PV systems, estimating installed PV capacity, and analysis of the socioeconomic correlates of PV deployment.

Highlights

  • Background & SummaryFor many years, aerial photography was the primary source of commercial high-resolution imagery, including multispectral color orthoimagery

  • Aerial photography was the primary source of commercial high-resolution imagery, including multispectral color orthoimagery

  • In object detection[5,6,7,8], the goal is to identify all instances in the imagery of a particular object type[9,10,11,12] such as roads[13,14,15,16,17,18], buildings[19,20,21,22,23,24,25], vehicles[26,27] solar PV arrays[28], etc

Read more

Summary

Background & Summary

Aerial photography was the primary source of commercial high-resolution imagery, including multispectral color orthoimagery (imagery that has been orthorectified so the image lacks spatial distortion). To provide a publicly available means of generating this granular information for any geographic region of interest, we created a dataset originally collected to train machine learning object detection algorithms to develop the process of automatically identifying solar PV locations using high-resolution orthoimagery. This dataset contains the geospatial coordinates and border vertices for over 19,000 solar panels from four cities in California: Fresno, Stockton, Oxnard, and Modesto.

Methods
Data Records
Technical Validation
Decimal Longitude
Name of the image file containing this solar array
Array Vertices of the Polygon in Pixels
Usage Notes
Number of Solar Arrays Percent Arrays J
Data Citations
Author Contributions
Findings
Additional Information
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call