Abstract

Target-agnostic person tracking and re-identification across multiple non-overlapping cameras is an open vision problem. It is the task of maintaining the correct identity of people at different time instances and possibly different cameras. This study focuses on existing algorithms that facilitate online person tracking by using discriminative spatio-temporal features from video data, and presents the open issues and future research directions. The initial take on the problem introduces person tracking as a pure association problem, where the influence of human appearance, biometric and location information on re-identification are addressed explicitly. These constraints are modeled and used to understand and associate detections in real world environments. Next, a spatio-temporal model using LSTM networks for propagating associations and recovering from errors by taking advantage of the spatial and temporal information in videos is described. The spatio-temporal context indicates a way for discriminative appearance learning. The novelty of the mentioned approaches is that they do not require to learn target-specific appearance models and collect samples to distinguish different people from each other. The methods are evaluated on large-scale tracking datasets. State-of-the-art performance is achieved using motion metadata such as person bounding box and camera number, and shows better associations for the challenging exit-entry cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.