Loading...
Thumbnail Image
Item

Bilingual contexts from comparable corpora to mine for translations of collocations

Taslimipoor, Shiva
Mitkov, Ruslan
Corpas Pastor, Gloria
Fazly, Afsaneh
Alternative
Abstract
Due to the limited availability of parallel data in many languages, we propose a methodology that benefits from comparable corpora to find translation equivalents for collocations (as a specific type of difficult-to-translate multi-word expressions). Finding translations is known to be more difficult for collocations than for words. We propose a method based on bilingual context extraction and build a word (distributional) representation model drawing on these bilingual contexts (bilingual English-Spanish contexts in our case). We show that the bilingual context construction is effective for the task of translation equivalent learning and that our method outperforms a simplified distributional similarity baseline in finding translation equivalents.
Citation
Publisher
Research Unit
DOI
PubMed ID
PubMed Central ID
Embedded videos
Type
Conference contribution
Language
en
Description
Proceedings of the 17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing2016
Series/Report no.
ISSN
0302-9743
EISSN
ISBN
9783319754871
ISMN
Gov't Doc #
Sponsors
Rights
Research Projects
Organizational Units
Journal Issue
Embedded videos