Yu, Dahai, Ghita, Ovidiu, Sutherland, Alistair and Whelan, Paul F. ORCID: 0000-0001-9230-7656 (2006) Dictionary-based lip reading classification. In: CIICT 2006 - China-Ireland International Conference on Information and Communications Technologies, 18-19 October 2006, Hangzhou, China.
Abstract
Visual lip reading recognition is an essential stage in many multimedia systems such as “Audio Visual Speech
Recognition” [6], “Mobile Phone Visual System for deaf people”, “Sign Language Recognition System”, etc.
The use of lip visual features to help audio or hand recognition is appropriate because this information is robust
to acoustic noise. In this paper, we describe our work towards developing a robust technique for lip reading
classification that extracts the lips in a colour image by using EMPCA feature extraction and k-nearest-neighbor
classification. In order to reduce the dimensionality of the feature space the lip motion is characterized by three
templates that are modelled based on different mouth shapes: closed template, semi-closed template, and wideopen
template. Our goal is to classify each image sequence based on the distribution of the three templates and
group the words into different clusters. The words that form the database were grouped into three different
clusters as follows: group1(‘I’, ‘high’, ‘lie’, ‘hard’, ‘card’, ‘bye’), group2(‘you, ‘owe’, ‘word’), group3(‘bird’).
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | Lip Reading; Template Model; EMPCA; K-Nearest Neighbour Classification; |
Subjects: | Computer Science > Multimedia systems Computer Science > Information retrieval |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Institutes and Centres > Centre for Digital Video Processing (CDVP) Research Institutes and Centres > Research Institute for Networks and Communications Engineering (RINCE) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 313 |
Deposited On: | 12 Mar 2008 by DORAS Administrator . Last Modified 16 Jan 2019 12:10 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
490kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record