Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Short user-generated videos classification using accompanied audio categories

Guo, Jinlin and Gurrin, Cathal orcid logoORCID: 0000-0003-4395-7702 (2012) Short user-generated videos classification using accompanied audio categories. In: AMVA’12, 2 Nov. 2012, Nara, Japan. ISBN 978-1-4503-1585-2

Abstract
This paper investigates the classification of short user-generated videos (UGVs) using the accompanied audio data since short UGVs accounts for a great proportion of the Internet UGVs and many short UGVs are accompanied by singlecategory soundtracks. We define seven types of UGVs corresponding to seven audio categories respectively. We also investigate three modeling approaches for audio feature representation, namely, single Gaussian (1G), Gaussian mixture (GMM) and Bag-of-Audio-Word (BoAW) models. Then using Support Vector Machine (SVM) with three different distance measurements corresponding to three feature representations, classifiers are trained to categorize the UGVs. The accompanying evaluation results show that these approaches are effective for categorizing the short UGVs based on their audio track. Experimental results show that a GMM representation with approximated Bhattacharyya distance (ABD) measurement produces the best performance, and BoAW representation with chi-square kernel also reports comparable results.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Uncontrolled Keywords:User-generated Video; MFCC; Video Classification
Subjects:Computer Science > Machine learning
Computer Science > Algorithms
Computer Science > Information retrieval
Computer Science > Digital video
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering
Research Institutes and Centres > CLARITY: The Centre for Sensor Web Technologies
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in: Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis. . Association for Computing Machinery. ISBN 978-1-4503-1585-2
Publisher:Association for Computing Machinery
Official URL:http://doi.acm.org/10.1145/2390214.2390220
Copyright Information:© ACM, 2012. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in "Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis" http://doi.acm.org/10.1145/2390214.2390220
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:17529
Deposited On:06 Mar 2013 11:00 by Jinlin Guo . Last Modified 02 Nov 2018 15:33
Documents

Full text available as:

[thumbnail of file.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
217kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record