Abstract

An automatic approach to extract Geographic information especially represented by Points of Interest (POIs), is critical for identifying locations and provides the basis for various location-based services. Currently, geospatial data of POI are available through some open map services (e.g., Google Maps, OpenStreetMap, etc.). However, the data supporting these services are either collected through the expensive commercial purchasing and company investment or gathered by the volunteered contribution of high uncertainty. With the rapid geospatial data growing on the Web, we propose an automatic approach of extracting geographic information for building up POI resources based on the results obtained by the web search engines to mitigate the negative effect from the traditional means. According to the approach, we firstly put the types of POIs extracted from Google Maps and the street names obtained from OpenStreetMap into the Google search engine, and then retrieve the potential addresses of POIs through parsing the search results. Secondly, the Google search engine is employed again with the retrieved addresses of POIs to extract the potential place names. Finally, the Google search engine is employed for a third time with learning both the place names and the corresponding addresses to verify whether the place names are correct. The contributed output of the work is a place-name dataset. We respectively select 20 blocks in Chicago and Houston in U.S. to execute our approach for verifying the research contribution. In the experiments, we choose Google Map that is of high data quality as the reference and compare the results with those from OpenStreet Map and Wikimapia. The final results indicate that the proposed approach could effectively produce the place-name datasets on a par with Google Maps and outperform OpenStreet Map and Wikimapia.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call