Abstract
Geospatial data conflation is the process of combining two datasets to create a better one. It has received increased research attention due to the emergence of new data sources and the need to combine information from these sources in spatial analyses. Many conflation methods exist to date, ranging from simple ones based on spatial join, to sophisticated methods based on statistics and optimization models. This paper focuses on the optimization-based conflation approach. It treats feature-matching in conflation as an optimization problem of finding a plan to match features in two datasets that minimizes the total discrepancy. Optimization based conflation methods may overcome some limitations of conventional methods, such as sub-optimality and greediness. However, they have often been deemed impractical in day-to-day analysis because they induce high computational costs (especially in combining large geospatial data).In this paper, we demonstrate the feasibility of performing optimization-based conflation for large geographic data in Geographic Information Systems. This is accomplished by utilizing efficient network flow-based conflation models and a divide-and-conquer strategy that allows the conflation models to scale to large data. Experiments show that the network-flow based model achieves average recall and precision rates of 97.7% and 90.8%, respectively in small test areas, and outperforms the traditional assignment problem by about 9% each. For larger data, it took the original network-flow model (without divide-and-conquer) nearly two days to conflate the road network in a portion of Los Angeles area near the LAX international airport. By contrast, the same model can be used to conflate the road networks of the entire Los Angeles County, CA in under 3 h with the divide and conquer strategy.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.