Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Domain-specific query translation for multilingual information access using machine translation augmented with dictionaries mined from Wikipedia

Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365, Fantino, Fabio, Newman, Eamonn orcid logoORCID: 0000-0002-0310-0539 and Zhang, Ying (2008) Domain-specific query translation for multilingual information access using machine translation augmented with dictionaries mined from Wikipedia. In: CLIA 2008 - 2nd International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies, 11 Jan 2008, Hyderabad, India.

Abstract
Accurate high-coverage translation is a vital component of reliable cross language information access (CLIA) systems. While machine translation (MT) has been shown to be effective for CLIA tasks in previous evaluation workshops, it is not well suited to specialized tasks where domain specific translations are required. We demonstrate that effective query translation for CLIA can be achieved in the domain of cultural heritage (CH). This is performed by augmenting a standard MT system with domainspecific phrase dictionaries automatically mined from the online Wikipedia. Experiments using our hybrid translation system with sample query logs from users of CH websites demonstrate a large improvement in the accuracy of domain specific phrase detection and translation.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Additional Information:Workshop in conjunction with IJCNLP 2008 - The Third International Joint Conference on Natural Language Processing, 7-12 Jan, 2008, Hyderabad, India.
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Institutes and Centres > Centre for Digital Video Processing (CDVP)
Official URL:http://search.iiit.ac.in/CLIA2008/
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:386
Deposited On:01 Apr 2008 by DORAS Administrator . Last Modified 25 Oct 2018 11:57
Documents

Full text available as:

[thumbnail of cross_ling_info_2008.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
432kB
[thumbnail of cross_ling_info_pres_2008.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
941kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record