Abstract

Video or image-based people counting in real-time has multiple applications in intelligent transportation, density estimation or class management, and so on. This problem is usually carried out by detecting people using conventional detectors. However, this approach can be failed when people stay in various postures or are occluded by each other. In this paper, we notice that even a main part of human body is occluded, their face and head are still observable. We then propose a method that counts people based on face and head detection and pairing. Instead of deploying only face or head detector, we apply both detectors as in many cases the human does not turn his/her face to camera then head detector takes advantage. Otherwise, face detector produces reliable results. The fact of combining both head and face detection results will lead to duplicated responses for one person. We then propose a simple yet effective alignment technique to pair a face with a head of a person. Subsequently, the remaining heads and faces which are not paired with any other faces or heads will be added to our people counter to increase the true positive rate. We evaluate our proposed method on four datasets (Hollywood, Casablanca, Wider Face, and our own dataset). The experimental results show an improvement of average precision and recall comparing to the original head or face detectors.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call