Browse DORAS
Browse Theses
Search
Latest Additions
Creative Commons License
Except where otherwise noted, content on this site is licensed for use under a:

Domain-specific query translation for multilingual information access using machine translation augmented with dictionaries mined from Wikipedia

Jones, Gareth J.F. and Fantino, Fabio and Newman, Eamonn and Zhang, Ying (2008) Domain-specific query translation for multilingual information access using machine translation augmented with dictionaries mined from Wikipedia. In: CLIA 2008 - 2nd International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies, 11 Jan 2008, Hyderabad, India.

Full text available as:

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
422Kb
[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
919Kb

Abstract

Accurate high-coverage translation is a vital component of reliable cross language information access (CLIA) systems. While machine translation (MT) has been shown to be effective for CLIA tasks in previous evaluation workshops, it is not well suited to specialized tasks where domain specific translations are required. We demonstrate that effective query translation for CLIA can be achieved in the domain of cultural heritage (CH). This is performed by augmenting a standard MT system with domainspecific phrase dictionaries automatically mined from the online Wikipedia. Experiments using our hybrid translation system with sample query logs from users of CH websites demonstrate a large improvement in the accuracy of domain specific phrase detection and translation.

Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Additional Information:Workshop in conjunction with IJCNLP 2008 - The Third International Joint Conference on Natural Language Processing, 7-12 Jan, 2008, Hyderabad, India.
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Digital Video Processing (CDVP)
Official URL:http://search.iiit.ac.in/CLIA2008/
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:386
Deposited On:01 Apr 2008 by DORAS Administrator. Last Modified 04 Feb 2009 13:55

Download statistics

Archive Staff Only: edit this record