Multilingual search for cultural heritage archives via combining multiple translation resources
Jones, Gareth J.F. and Zhang, Ying and Newman, Eamonn and Fantino, Fabio and Debole, Franca (2007) Multilingual search for cultural heritage archives via combining multiple translation resources. In: LaTeCH 2007 - ACL Workshop on Language Technology for Cultural Heritage Data, 28 June 2007, Prague, Czech Republic.
Full text available as:
The linguistic features of material in Cultural Heritage (CH) archives may be in various languages requiring a facility for effective multilingual search. The specialised
language often associated with CH content introduces problems for automatic translation to support search applications. The MultiMatch project is focused on enabling
users to interact with CH content across different media types and languages. We present results from a MultiMatch study exploring various translation techniques for
the CH domain. Our experiments examine translation techniques for the English language CLEF 2006 Cross-Language
Speech Retrieval (CL-SR) task using Spanish, French and German queries. Results compare effectiveness of our query
translation against a monolingual baseline and show improvement when combining a domain-specific translation lexicon with a standard machine translation system.
Archive Staff Only: edit this record