Show simple item record

dc.contributor.authorHidalgo-Ternero, Carlos Manuel
dc.contributor.authorPastor, Gloria Corpas
dc.date.accessioned2021-01-14T09:57:03Z
dc.date.available2021-01-14T09:57:03Z
dc.date.issued2020-11-25
dc.identifier.citationHidalgo-Ternero, C. and Corpas Pastor, G. (2020) Bridging the “gApp”: improving neural machine translation systems for multiword expression detection, Yearbook of Phraseology, 11(1), pp. 61–80. DOI: https://doi.org/10.1515/phras-2020-0005en
dc.identifier.issn1868-632Xen
dc.identifier.doi10.1515/phras-2020-0005en
dc.identifier.urihttp://hdl.handle.net/2436/623878
dc.descriptionThis is the published version of an article published by De Gruyter in Yearbook of Phraseology on 25/11/2020, available online: https://doi.org/10.1515/phras-2020-0005en
dc.description.abstractThe present research introduces the tool gApp, a Python-based text preprocessing system for the automatic identification and conversion of discontinuous multiword expressions (MWEs) into their continuous form in order to enhance neural machine translation (NMT). To this end, an experiment with semi-fixed verb–noun idiomatic combinations (VNICs) will be carried out in order to evaluate to what extent gApp can optimise the performance of the two main free open-source NMT systems —Google Translate and DeepL— under the challenge of MWE discontinuity in the Spanish into English directionality. In the light of our promising results, the study concludes with suggestions on how to further optimise MWE-aware NMT systems.en
dc.formatapplication/pdfen
dc.language.isoenen
dc.publisherWalter de Gruyter GmbHen
dc.relation.urlhttps://www.degruyter.com/view/journals/yop/11/1/article-p61.xmlen
dc.subjecttext preprocessing systemen
dc.subjectneural machine translation (NMT)en
dc.subjectmultiword expression (MWE)en
dc.subjectverb–noun idiomatic combinations (VNICs)en
dc.subjectdiscontinuityen
dc.titleBridging the “gApp”: improving neural machine translation systems for multiword expression detectionen
dc.typeJournal articleen
dc.identifier.eissn1868-6338
dc.identifier.journalYearbook of Phraseologyen
dc.date.updated2021-01-11T09:20:34Z
rioxxterms.funderSpanish Ministry of Educationen
rioxxterms.identifier.projectFPU16/02032en
rioxxterms.versionVoRen
rioxxterms.licenseref.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/en
rioxxterms.licenseref.startdate2021-11-25en
dc.source.volume11
dc.source.issue1
dc.source.beginpage61
dc.source.endpage80
dc.description.versionPublished version
refterms.dateFCD2021-01-14T09:56:09Z
refterms.versionFCDVoR


Files in this item

Thumbnail
Name:
Ternero_Corpas_Bridging the ...
Embargo:
2021-11-25
Size:
1.249Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

https://creativecommons.org/licenses/by-nc-nd/4.0/
Except where otherwise noted, this item's license is described as https://creativecommons.org/licenses/by-nc-nd/4.0/