Wolverhampton Intellectual Repository and E-Theses
>
Research Institutes
>
Research Institute in Information and Language Processing
>
Computational Linguistics Group
>
Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions
2.50
- Hdl Handle:
- http://hdl.handle.net/2436/620360
- Title:
- Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions
- Authors:
- Abstract:
- This paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented.
- Publisher:
- Journal:
- Issue Date:
- Dec-2016
- URI:
- http://hdl.handle.net/2436/620360
- Additional Links:
- http://ceur-ws.org/Vol-1749/
- Type:
- Article
- Language:
- en
- ISSN:
- 1613-0073
- Appears in Collections:
- Computational Linguistics Group
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Taslimipoor, Shiva | en |
dc.contributor.author | Desantis, Anna | en |
dc.contributor.author | Cherchi, Manuela | en |
dc.contributor.author | Mitkov, Ruslan | en |
dc.contributor.author | Monti, Johanna | en |
dc.date.accessioned | 2017-02-01T15:19:16Z | - |
dc.date.available | 2017-02-01T15:19:16Z | - |
dc.date.issued | 2016-12 | - |
dc.identifier.issn | 1613-0073 | en |
dc.identifier.uri | http://hdl.handle.net/2436/620360 | - |
dc.description.abstract | This paper describes the first resource annotated for multiword expressions (MWEs) in Italian. Two versions of this dataset have been prepared: the first with a fast markup list of out-of-context MWEs, and the second with an in-context annotation, where the MWEs are entered with their contexts. The paper also discusses annotation issues and reports the inter-annotator agreement for both types of annotations. Finally, the results of the first exploitation of the new resource, namely the automatic extraction of Italian MWEs, are presented. | en |
dc.language.iso | en | en |
dc.publisher | ceur-ws | en |
dc.relation.url | http://ceur-ws.org/Vol-1749/ | en |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Natural Language Processing | en |
dc.subject | Multiword Expressions | en |
dc.title | Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions | en |
dc.type | Article | en |
dc.identifier.journal | Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016) | en |
dc.date.accepted | 2016-11 | - |
rioxxterms.funder | Internal | en |
rioxxterms.identifier.project | UoW010217ST | en |
rioxxterms.version | AM | en |
rioxxterms.licenseref.uri | https://creativecommons.org/CC BY-NC-ND 4.0 | en |
rioxxterms.licenseref.startdate | 2017-02-01 | en |
All Items in WIRE are protected by copyright, with all rights reserved, unless otherwise indicated.