Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

DCU at NTCIR-10 spokenDoc2 passage retrieval task

Eskevich, Maria orcid logoORCID: 0000-0002-1242-0753 and Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365 (2013) DCU at NTCIR-10 spokenDoc2 passage retrieval task. In: NTCIR-10 Conference, 18-21 June 2013, Tokyo, Japan.

Abstract
We describe details of our runs and the results obtained for the "2nd round of IR for Spoken Documents (SpokenDoc2)" task. We participated in the passage retrieval from the Corpus of Spoken Document Processing Workshop (SDPWS) task. For our participation in the NTCIR-9 SpokenDoc task, we investigated the use of different content-based segmentation methods that attempt to identify topically coherent units for retrieval. For NTCIR-10 we compare content-based segmentation (the TextTiling algorithm) to division of the content into segments of a fixed number of Inter-Pausal Units (IPUs) using a sliding window, and subsequent combination of overlapping segments into single units in the ranked list of results. Another focus of our submissions to NTCIR-10 is the potential for use of external data for document expansion. For this we used a DBpedia collection for IPU expansion for all segmentation methods.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Speech search; Passage retrieval; Automatic segmentation; Document expansion
Subjects:Computer Science > Multimedia systems
Computer Science > Information retrieval
DCU Faculties and Centres:Research Institutes and Centres > Centre for Digital Video Processing (CDVP)
Research Institutes and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in: Proceedings of NTCIR 10. .
Official URL:http://research.nii.ac.jp/ntcir/workshop/OnlinePro...
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland
ID Code:20368
Deposited On:13 Jan 2015 14:42 by Gareth Jones . Last Modified 10 Oct 2018 09:18
Documents

Full text available as:

[thumbnail of NTCIR-10.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
308kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record