Abstract

Face detectors using the single shot multibox detector have a small number of discontinuous fixed anchor scales that cannot fully match continuous faces, and these may degrade the accuracy when detecting tiny faces that are not fully matched. To address this issue, we utilize the capabilities and complementary features of the heterogeneous submodel to help the main model extract more abundant features and scale-invariant features from the heterogeneous submodel. First, we propose a deep feature fusion model of the heterogeneous convolutional neural networks to refine and integrate different levels of semantics and obtain a semantically enhanced feature pyramid. We then present a composite anchor loss function that trains the generated network end-to-end, which solves the accuracy-diversity tradeoff of these hybrid submodels. Finally, we apply the proposed approach to the a benchmark for face detection in unconstrained settings (FDDB) and WIDER FACE benchmarks and present the results using their respective standard evaluation protocols. Experiments result in our method surpassing the existing single-shot face detector and further improves the average accuracy of the original model.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.