Abstract
The friendly communication can be more promoted between the human and computer if the function of gesture recognition is implemented to the computer system as the input interface along with the keyboards and mice. We propose a mouse-like function for estimating hand shape from input images with a monocular camera, with which a computer user feels no restraint or awkwardness. Our system involves conversion of sequential images from Cartesian coordinates to log-polar coordinates. Temporal and spatial subtractions and color information are used to extract the hand region. The origin of log-polar coordinates is chosen as the center of the acquired image, but once the hand has been extracted, the estimated centroid position of the hand region in the next frame, obtained from the current hand position and speed, is used as the origin to convert. Recognition of the hand shape is carried out by multiple regression analysis using higher order local autocorrelation features of log-polar coordinate space. Mouse-like functions are realized according to the hand shape and motion trajectory. Compared to conventional Cartesian coordinates, conversion to log-polar coordinates enables us to reduce image date and computation time, remove the variability by the scaling, and improve antinoise characteristics.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.