A framework for sign language recognition using support vector machines and active learning for skin segmentation and boosted temporal sub-units

Awad, George M.

Abstract

This dissertation describes new techniques that can be used in a sign language recognition (SLR) system, and more generally in human gesture systems. Any SLR system consists of three main components: Skin detector, Tracker, and Recognizer. The skin detector is responsible for segmenting skin objects like the face and hands from video frames. The tracker keeps track of the hand location (more specifically the bounding box) and detects any occlusions that might happen between any skin objects. Finally, the recognizer tries to classify the performed sign into one of the sign classes in our vocabulary using the set of features and information provided by the tracker. In this work, we propose a new technique for skin segmentation using SVM (support vector machine) active learning combined with region segmentation information. Having segmented the face and hands, we need to track them across the frames. So, we have developed a unified framework for segmenting and tracking skin objects and detecting occlusions, where both components of segmentation and tracking help each other. Good tracking helps to reduce the search space for skin objects, and accurate segmentation increases the overall tracker accuracy. Instead of dealing with the whole sign for recognition, the sign can be broken down into elementary subunits, which are far less in number than the total number of signs in the vocabulary. This motivated us to propose a novel algorithm to model and segment these subunits, then try to learn the informative combinations of subunits/features using a boosting framework. Our results reached above 90% recognition rate using very few training samples.

Item Type:

Thesis (PhD)

Date of Award:

2007

Refereed:

Uncontrolled Keywords:

sign language recognition system; slr; support vector machine; svm; segmentation; tracking

Subjects:

Computer Science > Machine learning

DCU Faculties and Centres:

DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing

Use License:

This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License

ID Code:

16936

Deposited On:

03 May 2012 14:28 by INVALID USER. Last Modified 19 Jul 2018 14:55

DORAS | DCU Research Repository

A framework for sign language recognition using support vector machines and active learning for skin segmentation and boosted temporal sub-units

Downloads