Abstract

Automatic detection and locating of objects such as poles, traffic signs, and building corners in street scenes captured from a mobile mapping system has many applications. Template matching is a technique that could automatically recognise the counterparts or correspondents of an object from multi-view images. In this study, we aim at finding correspondents of an object from wide baseline panoramic images with large geometric deformations from sphere projection and significant systematic errors from multi-camera rig geometry. Firstly, we deduce the camera model and epipolar model of a multi-camera rig system. Then, epipolar errors are analysed to determine the search area for pixelwise matching. A low-cost laser scanner is optionally used to constrain the depth of an object. Lastly, several classic feature descriptors are introduced to template matching and evaluated on the multi-view panoramic image dataset. We propose a template matching method combining a fast variation of a scale-invariant feature transform (SIFT) descriptor. Our method experimentally achieved the best performance in terms of accuracy and efficiency comparing to other feature descriptors and the most recent robust template matching methods.

Highlights

  • A ground mobile mapping system (MMS) mounted with multiple sensors such as a mono/stereo camera, panoramic camera, laser scanner, and GPS/INS has shown a wide range of implications in city planning, 2D/3D map making, traffic surveillance, and autonomous cars [1,2,3,4]

  • SURF descriptor performed similar to the dual-space intensity and better thanAthsetHo OefGficdieensccryi,pmtoor.re complex descriptors result in less efficiency, and the scale-invariant feature transform (SIFT) descriptor is muchAlsowtoeerftfihcainenthcye, dmuoarle-spcoamcepilnetxendseitsycriinpteoxrescuretsiounlt

  • Our AccSIFT descriptor improved the efficiency by 450% comparing to the original version, and is even faster than the dTuaabl-lesp3.aCceominptaerinsosnityo.f the different descriptors on the Kashiwa data

Read more

Summary

Introduction

A ground mobile mapping system (MMS) mounted with multiple sensors such as a mono/stereo camera, panoramic camera, laser scanner, and GPS/INS (global positioning system/inertial navigation system) has shown a wide range of implications in city planning, 2D/3D map making, traffic surveillance, and autonomous cars [1,2,3,4]. An intuitive solution is to automatically recognize the designated object instances with high precision and recall rate, calculate their geolocation by GPS/INS and triangulation according to specific points in objects (for example, the geometric center of a manhole cover or the base of a pole). This solution can hardly be approached in practice with the difficulty of accurately extracting and segmenting objects. Available online: https://www.ptgrey.com/ladybug3-360degree-firewire-spherical-camera-systems (accessed on 1 May 2018)

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call