Dictionary-based lip reading classification

Yu, Dahai, Ghita, Ovidiu, Sutherland, Alistair and Whelan, Paul F. ORCID: 0000-0001-9230-7656 (2006) Dictionary-based lip reading classification. In: CIICT 2006 - China-Ireland International Conference on Information and Communications Technologies, 18-19 October 2006, Hangzhou, China.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

Visual lip reading recognition is an essential stage in many multimedia systems such as “Audio Visual Speech Recognition” [6], “Mobile Phone Visual System for deaf people”, “Sign Language Recognition System”, etc. The use of lip visual features to help audio or hand recognition is appropriate because this information is robust to acoustic noise. In this paper, we describe our work towards developing a robust technique for lip reading classification that extracts the lips in a colour image by using EMPCA feature extraction and k-nearest-neighbor classification. In order to reduce the dimensionality of the feature space the lip motion is characterized by three templates that are modelled based on different mouth shapes: closed template, semi-closed template, and wideopen template. Our goal is to classify each image sequence based on the distribution of the three templates and group the words into different clusters. The words that form the database were grouped into three different clusters as follows: group1(‘I’, ‘high’, ‘lie’, ‘hard’, ‘card’, ‘bye’), group2(‘you, ‘owe’, ‘word’), group3(‘bird’).

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	Yes
Uncontrolled Keywords:	Lip Reading; Template Model; EMPCA; K-Nearest Neighbour Classification;
Subjects:	Computer Science > Multimedia systems Computer Science > Information retrieval
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Initiatives and Centres > Centre for Digital Video Processing (CDVP) Research Initiatives and Centres > Research Institute for Networks and Communications Engineering (RINCE) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:	313
Deposited On:	12 Mar 2008 by DORAS Administrator . Last Modified 16 Jan 2019 12:10

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
490kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record