The Beckman Institute and the Department of Electrical and Computer Engineering
University of Illinois at Urbana-Champaign
Rajeev Sharma
Department of Computer Science and Engineering
Pennsylvania State University at University Park
The project involves the development of computer vision techniques that are able to extract the user hand from the background, track the hand/arm motion, distinguish a meaningful gesture from unintentional hand movements using context, and resolve the conflicts between gestures from multiple users. A key challenge of gesture recognition in multimodal setting of a VR is to find ways of improving the performance of gesture recognition using, for example, speech recognition and gaze direction. Another challenge is to develop appropriate computational architectures for integrating two interaction modalities such as speech and hand gestures.
R. Sharma, T. S. Huang, V. I. Pavlovic, Y. Zhao, Z. Lo, S. Chu, K. Schulten, A. Dalke, J. Phillips, M. Zeller, and W. Humphrey. "Speech/gesture interface to a visual computing environment for molecular biologist." In Proc. International Conference on Pattern Recognition, pp. 964-968, 1996, Vienna, Austria.
T. S. Huang, V. I. Pavlovic, and R. Sharma. "Speech/gesture-based human computer interface in virtual environments." In Proc. Workshop on the Integration of Gesture in Language and Speech (WIGLS) , pp. 41-57, October 1996, Wilmington, DE.
V. I. Pavlovic, R. Sharma, and T. S. Huang. "Gestural Interface to a Visual Computing Environment for Molecular Biologists." In Second International Conference on Automatic Face and Gesture Recognition , pp. 30-35, October 1996, Killington, VT.
Yusuf Azoz. "Vision-Based Human Arm Tracking For Gesture Analysis." MS Thesis, Pennsylvania State University, Department of Electrical Engineering, 1997.
R. Sharma, V. I. Pavlovic, and T. S. Huang. "A multimodal framework for interacting with virtual environments." In C. A. Ntuen and E. H. Park, editors, Human Interaction with Complex Systems. pp. 53-71, Kluwer Academic Publishers, 1996.
The communication mode that seems most relevant to the manipulation of physical objects is the hand motion, also called hand gestures. To keep the interaction natural, it is necessary that there be a minimal number of devices attached to the user. Receiving of speech signals using statically mounted arrays of microphones has achieved this goal. To accomplish the same level of naturalness for HCI using hand gestures, it is possible to use computer vision techniques for analyzing free hand gestures. Although, some progress has been made in developing computer vision-based gesture recognition techniques, the problem is far from being solved. It is hoped that computer vision-based gesture recognition can be greatly improved by exploiting the other sensor modalities that might also be present in a multimodal human-computer interface.
J. Streeck. "Gesture as communication I: its coordination with gaze and speech." Communication monographs, 60:275--299, December 1993.
V. I. Pavlovic, R. Sharma, and T. S. Huang. "Visual interpretation of hand gestures for human-computer interaction: A review." IEEE Transaction on Pattern Analysis and Machine Intelligence, 19(7), July 1997.
M. T. Vo and C. Wood, "Building an application framework for speech and pen input integration in multimodal learning interfaces", Proc. Int'l Conference on Acoustics, Speech, and Signal Processing, 3545-3548, 1996
P. R. Cohen, M. Johnston, D. McGee, S. Oviatt, and Jay Pittman, "QuickSet: Multimodal Interaction for Simulation Set-Up and Control", Proc. of the 5th Applied Natural Language Processing Meeting, 1997, Washington, DC.
F. K. H. Quek. "Eyes in the interface." Image and Vision Computing, vol. 13, August 1995.
J. M. Rehg and T. Kanade. "Model-based tracking of self-occluding articulated objects." In Proc. IEEE International Conference on Computer Vision, pp. 612--617, June 1995, Cambridge, MA.
T. E. Starner and A. Pentland. "Visual recognition of american sign language using hidden Markov models." In Proc. International Workshop on Automatic Face and Gesture Recognition , (Zurich, Switzerland), pp. 189--194, June 1995.
Virtual Environments: Integration of gesture recognition into the control of Virtual Environments could benefit from collaboration with researchers involved in human factor studies for virtual environments. This collaboration could shed some light on how to eventually evaluate the use of glove-free gestures as a interaction modality in virtual environments.