Chatbri, Houssem, Oliveira, Marlon, McGuinness, Kevin, Little, Suzanne, Kameyama, Keisuke, Kwan, Paul, Sutherland, Alistair and O'Connor, Noel E. (2017) Educational video classification by using a transcript to image transform and supervised learning. In: 7th International Conference on Image Processing Theory, Tools and Applications (IPTA), 28 Nov - 1 Dec 2017, Montreal, Canada. ISBN 978-1-5386-1842-4 (In Press)
Abstract
In this work, we present a method for automatic topic classification of educational videos using a speech transcript transform. Our method works as follows: First, speech recognition is used to generate video transcripts. Then, the transcripts are converted into images using a statistical co-occurrence transformation that we designed. Finally, a classifier is used to produce video category labels for a transcript image input. For our classifiers, we report results using a convolutional neural network (CNN) and a principal component analysis (PCA) model.
In order to evaluate our method, we used the Khan Academy on a Stick dataset that contains 2,545 videos, where each video is labeled with one or two of 13 categories. Experiments show that our method is effective and strongly competitive against other supervised learning-based methods.
Metadata
Item Type: | Conference or Workshop Item (Speech) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Additional Information: | Research Centre: Insight Centre for Data Analytics |
Uncontrolled Keywords: | Educational video classification; transcript features; convolutional neural networks (CNN); principal component analysis (PCA) |
Subjects: | Computer Science > Machine learning Computer Science > Artificial intelligence Computer Science > Multimedia systems Computer Science > Digital video |
DCU Faculties and Centres: | UNSPECIFIED |
Published in: | International Conference on Image Processing Theory, Tools and Applications (IPTA), Proceedings. . ISBN 978-1-5386-1842-4 |
Copyright Information: | © 2017 IEEE |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Irish Research Council for Science Engineering and Technology |
ID Code: | 22181 |
Deposited On: | 10 Jan 2018 12:52 by Houssem Chatbri . Last Modified 19 Jul 2018 15:12 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
755kB |
Downloads
Downloads
Downloads per month over past year
Available Versions of this Item
- Educational video classification by using a transcript to image transform and supervised learning. (deposited 10 Jan 2018 12:52) [Currently Displayed]
Archive Staff Only: edit this record