Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Speech-music discrimination from MPEG-1 bitstream

Jarina, Roman and Murphy, Noel and O'Connor, Noel E. and Marlow, Seán (2001) Speech-music discrimination from MPEG-1 bitstream. In: SSIP 2001 - WSES International Conference on Speech, Signal and Image Processing, 1-6 September 2001, Malta.

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


This paper describes a proposed algorithm for speech/music discrimination, which works on data directly taken from MPEG encoded bitstream thus avoiding the computationally difficult decoding-encoding process. The method is based on thresholding of features derived from the modulation envelope of the frequency-limited audio signal. The discriminator is tested on more than 2 hours of audio data, which contain clean and noisy speech from several speakers and a variety of music content. The discriminator is able to work in real time and despite its simplicity, results are very promising.

Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Uncontrolled Keywords:audio; video; classification; speech; music; signal processing; MPEG;
Subjects:Computer Science > Digital video
Computer Science > Information retrieval
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Digital Video Processing (CDVP)
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:332
Deposited On:13 Mar 2008 by DORAS Administrator. Last Modified 03 Feb 2009 16:04

Download statistics

Archive Staff Only: edit this record