Improvement of People Counting by Pairing Head and Face Detections from Still Images

Thi-Oanh Ha,Thanh-Hai Tran,Huong-Giang Doan,Van-Toi Nguyen,Hong-Quan Nguyen,Phuong-Dung Nguyen,Hoang-Nhat Tran,Thi-Lan Le,Hai Vu

doi:10.1109/mapr53640.2021.9585270

Abstract

Video or image-based people counting in real-time has multiple applications in intelligent transportation, density estimation or class management, and so on. This problem is usually carried out by detecting people using conventional detectors. However, this approach can be failed when people stay in various postures or are occluded by each other. In this paper, we notice that even a main part of human body is occluded, their face and head are still observable. We then propose a method that counts people based on face and head detection and pairing. Instead of deploying only face or head detector, we apply both detectors as in many cases the human does not turn his/her face to camera then head detector takes advantage. Otherwise, face detector produces reliable results. The fact of combining both head and face detection results will lead to duplicated responses for one person. We then propose a simple yet effective alignment technique to pair a face with a head of a person. Subsequently, the remaining heads and faces which are not paired with any other faces or heads will be added to our people counter to increase the true positive rate. We evaluate our proposed method on four datasets (Hollywood, Casablanca, Wider Face, and our own dataset). The experimental results show an improvement of average precision and recall comparing to the original head or face detectors.

Full Text