Object Detection in Video with Spatiotemporal Sampling Networks

Gedas Bertasius,Lorenzo Torresani,Jianbo Shi

doi:10.1007/978-3-030-01258-8_21

Object Detection in Video with Spatiotemporal Sampling Networks

Gedas Bertasius, Lorenzo Torresani + Show 1 more

Open Access

https://doi.org/10.1007/978-3-030-01258-8_21

Copy DOI

Publication Date: Jan 1, 2018

Citations: 221

Affiliation: Philadelphia University, University of Pennsylvania, Dartmouth College

#Object Detection In Videos #Detection In Videos + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We propose a Spatiotemporal Sampling Network (STSN) that uses deformable convolutions across time for object detection in videos. Our STSN performs object detection in a video frame by learning to spatially sample features from the adjacent frames. This naturally renders the approach robust to occlusion or motion blur in individual frames. Our framework does not require additional supervision, as it optimizes sampling locations directly with respect to object detection performance. Our STSN outperforms the state-of-the-art on the ImageNet VID dataset and compared to prior video object detection methods it uses a simpler design, and does not require optical flow data for training.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.