Abstract

High-speed Internet access (’broadband’) is available nearly everywhere in urban and suburban parts of the contiguous U.S. However, there are many rural areas which do not have broadband access yet. We implemented a machine learning framework consisting of multiple supervised models for accurately predicting broadband expansion using geographic and demographic data on a census block group level. We then utilized these models to gain an understanding of how these features relate to broadband expansion. Also, we discussed how this data must be considered differently when investigating unsubsidized broadband expansion rather than government subsidized expansion. Understanding relationships between geographic and demographic features with respect to broadband speed and accessibility is reliant upon publicly available data provided by the Federal Communications Commission (FCC), United States Census Bureau, and the Universal Service Administrative Company. The project included identifying, cleaning, and aggregating the data as well as creating a cloud-based-storage infrastructure with Google BigQuery. We found that by considering geographic and demographic data, effective models can be constructed to both understand previous broadband expansion and predict future expansion. Notably, we found that road, housing and population density, winter temperature, and elevation are of significant importance in determining whether an area receives broadband.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.