Browse DORAS
Browse Theses
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Spoken content retrieval: A survey of techniques and technologies

Larson, Martha and Jones, Gareth J.F. (2012) Spoken content retrieval: A survey of techniques and technologies. Foundations and Trends in Information Retrieval, 5 (4-5). pp. 235-422. ISSN 1554-0669

Full text available as:

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings. This growth has motivated extensive research on techniques and technologies that facilitate reliable indexing and retrieval. Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR). SCR research initially investigated planned speech structured in document-like units, but has subsequently shifted focus to more informal spoken content produced spontaneously, outside of the studio and in conversational settings. This survey provides an overview of the field of SCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues. It is aimed at researchers with backgrounds in speech technology or IR who are seeking deeper insight on how these fields are integrated to support research and development, thus addressing the core challenges of SCR.

Item Type:Article (Published)
Uncontrolled Keywords:spoken content retrieval; speech retrieval
Subjects:Computer Science > Multimedia systems
Computer Science > Information retrieval
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:
Copyright Information:©2012 now publishing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland, European Framework Programme 7
ID Code:17158
Deposited On:15 Aug 2012 10:41 by Gareth Jones. Last Modified 15 Aug 2012 10:41

Download statistics

Archive Staff Only: edit this record