Short user-generated videos classification using
accompanied audio categories

Guo, Jinlin; Gurrin, Cathal

Guo, Jinlin and Gurrin, Cathal ORCID: 0000-0003-4395-7702 (2012) Short user-generated videos classification using accompanied audio categories. In: AMVA’12, 2 Nov. 2012, Nara, Japan. ISBN 978-1-4503-1585-2

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

This paper investigates the classification of short user-generated videos (UGVs) using the accompanied audio data since short UGVs accounts for a great proportion of the Internet UGVs and many short UGVs are accompanied by singlecategory soundtracks. We define seven types of UGVs corresponding to seven audio categories respectively. We also investigate three modeling approaches for audio feature representation, namely, single Gaussian (1G), Gaussian mixture (GMM) and Bag-of-Audio-Word (BoAW) models. Then using Support Vector Machine (SVM) with three different distance measurements corresponding to three feature representations, classifiers are trained to categorize the UGVs. The accompanying evaluation results show that these approaches are effective for categorizing the short UGVs based on their audio track. Experimental results show that a GMM representation with approximated Bhattacharyya distance (ABD) measurement produces the best performance, and BoAW representation with chi-square kernel also reports comparable results.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Workshop
Refereed:	Yes
Uncontrolled Keywords:	User-generated Video; MFCC; Video Classification
Subjects:	Computer Science > Machine learning Computer Science > Algorithms Computer Science > Information retrieval Computer Science > Digital video
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Institutes and Centres > CLARITY: The Centre for Sensor Web Technologies DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in:	Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis. . Association for Computing Machinery. ISBN 978-1-4503-1585-2
Publisher:	Association for Computing Machinery
Official URL:	http://doi.acm.org/10.1145/2390214.2390220
Copyright Information:	© ACM, 2012. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in "Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis" http://doi.acm.org/10.1145/2390214.2390220
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:	17529
Deposited On:	06 Mar 2013 11:00 by Jinlin Guo . Last Modified 02 Nov 2018 15:33

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
217kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Short user-generated videos classification using accompanied audio categories

Downloads