Abstract

Visual identification of objects using cameras requires precise detection, localization, and recognition of the objects in the field-of-view. The visual identification problem is very challenging when the objects look identical and features between distinct objects are indistinguishable, even with state-of-the-art computer vision techniques. The problem becomes significantly more challenging when the objects themselves do not carry rich geometric and photometric features, for example, in visual identification and tracking of light emitting diodes (LED) for visible light communication (VLC) applications. In this paper, we present a camera based visual identification solution where objects or regions of interest are tagged with an actively transmitting LED. Motivated by the concept of pilot symbols, typically used for synchronization and channel estimation in radio communication systems, the LED actively transmits unique pilot symbols which are detected by the camera across a series of image frames using our proposed spatio-temporal correlation based algorithm. We setup the visual identification as a problem of localization of the LED on the camera image, which involves identifying the (<i>pixels</i>) and the <i>unique ID</i> corresponding to the LED. In this paper, we present the algorithm and trace-based evaluation of the identification accuracy under real-world conditions including indoor, outdoor, static and mobile scenarios. In addition to micro-benchmarking the localization accuracy of our technique across different parameter configurations, we show that our technique outperforms comparative techniques, including, color based detection, support-vector machine based (SVM) machine learning, and you only look once (YOLO), which is a state-of-the-art convolutional neural network (CNN) deep learning based object identification tool.

Highlights

  • The advent of camera-based automation in mobile systems, advances in autonomous robotic systems and pervasive use of visual perception as an essential modality in cyber-physical systems, have urged the need for visual identification of objects in a given scene with high accuracy and precision

  • The constantly changing scenery, due to motion, further complicates the process as the visual features are ‘available’ only for a short duration. We propose that such objects in the scene, those which can lead to such vision bottlenecks, be tagged with a light emitting diode (LED) which constantly transmits a unique ID and a camera is used to localize this light emitting diodes (LED)

  • Outdoor, static and motion cases, and comparing with traditional ML and non-ML techniques for LED detection, we showed optical correlation outperforms the comparative techniques

Read more

Summary

Introduction

The advent of camera-based automation in mobile systems, advances in autonomous robotic systems and pervasive use of visual perception as an essential modality in cyber-physical systems, have urged the need for visual identification of objects in a given scene with high accuracy and precision. This problem has long been studied and addressed along the dimensions of object detection/recognition and localization using computer vision. The advancements in deep learning have improved vision based recognition fidelity.

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.