Real-time multi-camera 3D human pose estimation at the edge for industrial applications

Michele Boldo,Mirco De Marchi,Enrico Martini,Stefano Aldegheri,Davide Quaglia,Franco Fummi,Nicola Bombieri

doi:10.1016/j.eswa.2024.124089

Michele Boldo, Mirco De Marchi + Show 5 more

Open Access

https://doi.org/10.1016/j.eswa.2024.124089

Copy DOI

Journal: Expert Systems With Applications	Publication Date: Apr 23, 2024
Citations: 1	License type: cc-by

Affiliation: University of Verona

Abstract

There is an increasing interest in exploiting human pose estimation (HPE) software in human–machine interaction systems. Nevertheless, adopting such a computer vision application in real industrial scenarios is challenging. To overcome occlusion limitations, it requires multiple cameras, which in turn require multiple, distributed, and synchronized HPE software nodes running on resource-constrained edge devices. We address this challenge by presenting a real-time distributed 3D HPE platform, which consists of a set of 3D HPE software nodes on edge devices (i.e., one per camera) to redundantly extrapolate the human pose from different points of view. A centralized aggregator collects the pose information through a shared communication network and merges them, in real time, through a pipeline of filtering, clustering and association algorithms. It addresses network communication issues (e.g., delay and bandwidth variability) through a two-levels synchronization, and supports both single and multi-person pose estimation. We present the evaluation results with a real case of study (i.e., HPE for human–machine interaction in an intelligent manufacturing line), in which the platform accuracy and scalability are compared with state-of-the-art approaches and with a marker-based infra-red motion capture system.

Full Text