Abstract

Tracking and recognition of objects, such as faces, in video is commonly accomplished in independent fashion. However, important information is contained in both problems that could be used to increase the overall recognition accuracy. We propose a unified integer program (IP) based framework for multi-object tracking and recognition in video, where the two tasks are conducted jointly, using a set of natural constraints. In the domain of multiple face recognition, pairing constraints limit the number of objects that can be labeled with the same identity while temporal constraints allow the important information about objects identities's to be used to improve tracking. Despite its appeal, the solving the IP objective can be inefficient in real-world scenarios. For this reason, we employ an approximate Generalized Assignment Problem (GAP) solution to the IP problem, which is both theoretically appealing and computationally highly efficient. We finally demonstrate that the IP and GAP methods of conducting multi-object tracking and recognition can be successfully applied to real world videos where the traditional methods of conducting tracking and recognition separately fail to produce satisfactory results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.