A new manifold representation for visual speech recognition
Yu, Dahai, Ghita, Ovidiu, Sutherland, Alistair and Whelan, Paul F.ORCID: 0000-0001-9230-7656
(2007)
A new manifold representation for visual speech recognition.
In: IMVIP 2007 - 11th International Machine Vision and Image Processing Conference, 5-7 September 2007, Maynooth, Ireland.
In this paper, we propose a new manifold representation for visual speech recognition. The developed system consists of three main steps:
a. Lip extraction from input video data.
b. Generate the Expectation-Maximization PCA (EMPCA) manifolds for the entire image sequence and perform manifold interpolation and re-sampling.
c. Classify the manifolds using a HMM classifier to identify the words described by the lips motions in the input video sequence.