Loading...
Thumbnail Image
Publication

Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions

Taslimipoor, Shiva
Desantis, Anna
Cherchi, Manuela
Mitkov, Ruslan
Monti, Johanna
Alternative
Abstract
This paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented.
Citation
Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016)
Publisher
Journal
Research Unit
DOI
PubMed ID
PubMed Central ID
Embedded videos
Type
Conference contribution
Language
en
Description
Napoli, Italy, December 5-7, 2016
Series/Report no.
ISSN
1613-0073
EISSN
ISBN
ISMN
Gov't Doc #
Sponsors
Rights
Research Projects
Organizational Units
Journal Issue
Embedded videos