Hand pose estimation, hand tracking and recognition of hand signs are very important for human-computer interaction (HCI), understanding human grasping and robotics.
A decade ago the tasks seemed to be almost unsolvable with the data provided by a single RGB camera. Due to recent advances in sensing technologies and appearance of range cameras, there are new data sources available, making the solution for the mentioned above problems much more feasible.
There are several approaches possible, which can be divided into two main types: model-based approaches and machine learning approaches, depending on the requirements of a concrete application. Moreover, combining them should lead to more stable solutions.
Three persons performing static signs from the ASL alphabet were recorded using Intel Creative Gesture Camera. For each subject, RGB images, depth images and confidence maps were recorded. Additionally, binary masks are provided for a hand performing a sign on depth images.
When using the dataset, please cite:
Alina Kuznetsova, Laura Leal-Taixé, Bodo Rosenhahn
"Real-time sign language recognition using a consumer depth camera"
IEEE International Conference on Computer Vision Workshops (ICCVW),
3rd Workshop on Consumer Depth Cameras for Computer Vision (CDC4CV),
December 2013