Abstract
As a basic research topic of computer vision, object detection is facing the challenges of being extended for video object detection. For deformation anomaly, low-quality candidate regions feature can be caused by motion blur, rare poses and video defocus, resulting to poor detection performance in videos. In this paper, we propose a video object detector, which consists of an image-based detector using deep reinforcement learning and a correction module using objects tracking algorithms. We treat videos as continuous images. The agent will search on the feature map of each image by a policy network to generate high-quality candidate regions. These candidate regions will be corrected by a correction module. The bounding box at the current image from the tracklet proposed by the object tracking algorithms will correct the candidate regions. Experiments are conducted on the MOT 15 dataset. Our detector achieves better detection performance, exactly 68.28% with 5.01 points improvement compared to the Faster-R CNN as the single-frame baseline.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.