Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions

2.50
Hdl Handle:
http://hdl.handle.net/2436/620360
Title:
Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions
Authors:
Taslimipoor, Shiva; Desantis, Anna; Cherchi, Manuela; Mitkov, Ruslan; Monti, Johanna
Abstract:
This paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented.
Publisher:
ceur-ws
Journal:
Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016)
Issue Date:
Dec-2016
URI:
http://hdl.handle.net/2436/620360
Additional Links:
http://ceur-ws.org/Vol-1749/
Type:
Article
Language:
en
ISSN:
1613-0073
Appears in Collections:
Computational Linguistics Group

Full metadata record

DC FieldValue Language
dc.contributor.authorTaslimipoor, Shivaen
dc.contributor.authorDesantis, Annaen
dc.contributor.authorCherchi, Manuelaen
dc.contributor.authorMitkov, Ruslanen
dc.contributor.authorMonti, Johannaen
dc.date.accessioned2017-02-01T15:19:16Z-
dc.date.available2017-02-01T15:19:16Z-
dc.date.issued2016-12-
dc.identifier.issn1613-0073en
dc.identifier.urihttp://hdl.handle.net/2436/620360-
dc.description.abstractThis paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented.en
dc.language.isoenen
dc.publisherceur-wsen
dc.relation.urlhttp://ceur-ws.org/Vol-1749/en
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectNatural Language Processingen
dc.subjectMultiword Expressionsen
dc.titleLanguage resources for Italian: Towards the development of a corpus of annotated Italian multiword expressionsen
dc.typeArticleen
dc.identifier.journalProceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016)en
dc.date.accepted2016-11-
rioxxterms.funderInternalen
rioxxterms.identifier.projectUoW010217STen
rioxxterms.versionAMen
rioxxterms.licenseref.urihttps://creativecommons.org/CC BY-NC-ND 4.0en
rioxxterms.licenseref.startdate2017-02-01en
This item is licensed under a Creative Commons License
Creative Commons
All Items in WIRE are protected by copyright, with all rights reserved, unless otherwise indicated.