Browsing Research Institute in Information and Language Processing by Publisher "ceur-ws"
Now showing items 1-1 of 1
Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressionsThis paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented.