A new manifold representation for visual speech recognition
Yu, Dahai, Ghita, Ovidiu, Sutherland, Alistair and Whelan, Paul F.ORCID: 0000-0001-9230-7656
(2007)
A new manifold representation for visual speech recognition.
In: CAIP 2007 - 12th International Conference on Computer Analysis of Images and Patterns, 27-29 August 2007, Vienna, Austria.
ISBN 978-3-540-74271-5
In this paper, we propose a new manifold representation capable of being applied for visual speech recognition. In this regard, the real time input video data is compressed using Principal Component Analysis (PCA) and the low-dimensional points calculated for each frame define the manifolds. Since the number of frames that from the video sequence is dependent on the word complexity, in order to use these manifolds for visual speech classification it is required to re-sample them into a fixed number of keypoints that are used as input for classification. In this paper two classification schemes, namely the k Nearest Neighbour (kNN) algorithm that is used in conjunction with the two-stage PCA and Hidden-Markov-Model (HMM) classifier are evaluated. The classification results for a group of English words indicate that the proposed approach is able to produce accurate classification results.