Pose estimation is a fundamental technique in computer vision, allowing systems to recognize and analyze the positions and movements of objects or individuals in three-dimensional space. Its applications extend to healthcare for physical therapy, sports for performance analysis, and surveillance for behavior monitoring. The development of advanced algorithms and hardware has significantly improved its accuracy and real-time processing capabilities, driving innovation across industries. This study examines the integration of depth data with ArUco marker detection to improve pose estimation precision. By comparing translation vector (tvec) calculations using RGB-only methods and depth-enhanced approaches, this paper demonstrates that depth data provides superior consistency, particularly at extended distances. The results suggest that depth integration could address the limitations of traditional marker systems, improving accuracy and robustness. This work highlights the potential of depth-based enhancements and outlines future directions for comprehensive evaluation and system refinement.
Read full abstract