Loading...
Thumbnail Image
Item

Cross-lingual Dependency Parsing of Related Languages with Rich Morphosyntactic Tagsets

Agić, Željko
Tiedemann, Jörg
Merkler, Danijela
Krek, Simon
Dobrovoljc, Kaja
Moze, Sara
Editors
Other contributors
Affiliation
Epub Date
Issue Date
2014
Submitted date
Subjects
Alternative
Abstract
This paper addresses cross-lingual dependency parsing using rich morphosyntactic tagsets. In our case study, we experiment with three related Slavic languages: Croatian, Serbian and Slovene. Four different dependency treebanks are used for monolingual parsing, direct cross-lingual parsing, and a recently introduced crosslingual parsing approach that utilizes statistical machine translation and annotation projection. We argue for the benefits of using rich morphosyntactic tagsets in cross-lingual parsing and empirically support the claim by showing large improvements over an impoverished common feature representation in form of a reduced part-of-speech tagset. In the process, we improve over the previous state-of-the-art scores in dependency parsing for all three languages.
Citation
Language Technology for Closely Related Languages and Language Variants (LT4CloseLang), pages 13–24, October 29, 2014, Doha, Qatar
Research Unit
PubMed ID
PubMed Central ID
Embedded videos
Type
Conference contribution
Language
en
Description
Series/Report no.
ISSN
EISSN
ISBN
9781937284961
ISMN
Gov't Doc #
Sponsors
Rights
Research Projects
Organizational Units
Journal Issue
Embedded videos