Abstract

In order to better balance the detection accuracy and tracking speed, we propose an online balanced multi-object tracking method (BalMOT), which integrates object detection and appearance extraction into a single network, and can simultaneously output detection and appearance embedding. We also model the training of classification, regression, and embedding features as a multi-task training problem and each part is weighted based on the task-independent uncertainty method. In addition, we introduce the transition layer to optimize the repeated gradient information in the network and reduce the training cost. Through the training, our BalMOT system reaches 71.9% multiple object tracking accuracy (MOTA) on the MOT17 challenge dataset, and the speed fluctuates between 17.4 ~ 22.3 frames per second (FPS) according to the size of the input image.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.