Loading...
Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions
Taslimipoor, Shiva ; Desantis, Anna ; Cherchi, Manuela ; Mitkov, Ruslan ; Monti, Johanna
Taslimipoor, Shiva
Desantis, Anna
Cherchi, Manuela
Mitkov, Ruslan
Monti, Johanna
Editors
Other contributors
Affiliation
Epub Date
Issue Date
2016-12-05
Submitted date
Alternative
Abstract
This paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented.
Citation
Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016)
Publisher
Journal
Research Unit
DOI
PubMed ID
PubMed Central ID
Embedded videos
Additional Links
Type
Conference contribution
Language
en
Description
Napoli, Italy, December 5-7, 2016
Series/Report no.
ISSN
1613-0073