Rhythm detection for speech-music discrimination in MPEG compressed domain
Jarina, Roman, O'Connor, Noel E.ORCID: 0000-0002-4033-9135, Marlow, Seán and Murphy, Noel
(2002)
Rhythm detection for speech-music discrimination in MPEG compressed domain.
In: DSP 2002 - 14th IEEE International Conference on Digital Signal Processing, 1-3 July 2002, Santorini, Greece.
A novel approach to speech-music discrimination based on rhythm (or beat) detection is introduced. Rhythmic pulses are detected by applying a long-term autocorrelation method on band-passed signals. This approach is combined with another, in which the features describe the energy peaks of the signal. The discriminator uses just three features that are computed from data directly taken from an MPEG-1 bitstream. The discriminator was tested on more than 3 hours of audio data. Average recognition rate is 97.7%.
Metadata
Item Type:
Conference or Workshop Item (Paper)
Event Type:
Conference
Refereed:
Yes
Uncontrolled Keywords:
audio signal processing; correlation methods; feature extraction; music; speech processing;