Gupta, Rohit, Orăsan, Constantin, Liu, Qun ORCID: 0000-0002-7000-1792 and Mitkov, Ruslan ORCID: 0000-0003-2522-066X (2016) A dynamic programming approach to improving translation memory matching and retrieval using paraphrases. In: 19th International Conference on Text, Speech, and Dialogue (TSD 2016), 12-16 Sept 2016, Brno, Czech Republic.
Abstract
Translation memory tools lack semantic knowledge like paraphrasing when they perform matching and retrieval. As a result, paraphrased segments are often not retrieved. One of the primary reasons
for this is the lack of a simple and efficient algorithm to incorporate
paraphrasing in the TM matching process. Gupta and Or˘asan [1] proposed an algorithm which incorporates paraphrasing based on greedy
approximation and dynamic programming. However, because of greedy
approximation, their approach does not make full use of the paraphrases
available. In this paper we propose an efficient method for incorporating
paraphrasing in matching and retrieval based on dynamic programming
only. We tested our approach on English-German, English-Spanish and
English-French language pairs and retrieved better results for all three
language pairs compared to the earlier approach
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | Edit distance with paraphrasing; Translation memory; TM matching and retrieval; Computer aided translation; Paraphrasing |
Subjects: | UNSPECIFIED |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > ADAPT |
Published in: | Sojka, Petr, Horák, Aleš, Kopeček, Ivan and Pala, Karel, (eds.) Text, Speech, and Dialogue. 19th International Conference, TSD 2016 Proceedings. Lecture Notes in Computer Science (LNCS) 9924. Springer. |
Publisher: | Springer |
Official URL: | https://doi.org/10.1007/978-3-319-45510-5_30 |
Copyright Information: | © 2016 Springer |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | People Programme (Marie Curie Actions) of the European Union’s Seventh Framework Programme FP7/2007-2013/ under REA grant agreement No. 317471. |
ID Code: | 23207 |
Deposited On: | 26 Apr 2019 09:05 by Thomas Murtagh . Last Modified 26 Apr 2019 09:05 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
275kB |
Metrics
Altmetric Badge
Dimensions Badge
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record