A Dynamic Programming Approach to Improving Translation Memory Matching and Retrieval Using Paraphrases
Abstract
Translation memory tools lack semantic knowledge like paraphrasing when they perform matching and retrieval. As a result, paraphrased segments are often not retrieved. One of the primary reasons for this is the lack of a simple and efficient algorithm to incorporate paraphrasing in the TM matching process. Gupta and Orăsan [1] proposed an algorithm which incorporates paraphrasing based on greedy approximation and dynamic programming. However, because of greedy approximation, their approach does not make full use of the paraphrases available. In this paper we propose an efficient method for incorporating paraphrasing in matching and retrieval based on dynamic programming only. We tested our approach on English-German, English-Spanish and English-French language pairs and retrieved better results for all three language pairs compared to the earlier approachCitation
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (eds), Text, Speech, and Dialogue: olume 9924 of the series Lecture Notes in Computer Science pp 259-269Publisher
SpringerAdditional Links
http://link.springer.com/chapter/10.1007%2F978-3-319-45510-5_30Type
Chapter in bookLanguage
enISBN
97833194550999783319455105 on-line