A sequence labelling approach for automatic analysis of ello: tagging pronouns, antecedents, and connective phrases
Ha, Le An
Olivares-López, Raúl Ignacio
MetadataShow full item record
AbstractEncapsulators are linguistic units which establish coherent referential connections to the preceding discourse in a text. In this paper, we address the challenge of automatically analysing the pronominal encapsulator ello in Spanish text. Our method identifies, for each occurrence, the antecedent of the pronoun (including its grammatical type), the connective phrase which combines with the pronoun to express a discourse relation linking the antecedent text segment to the following text segment, and the type of semantic relation expressed by the complex discourse marker formed by the connective phrase and pronoun. We describe our annotation of a corpus to inform the development of our method and to finetune an automatic analyser based on bidirectional encoder representation transformers (BERT). On testing our method, we find that it performs with greater accuracy than three baselines (0.76 for the resolution task), and sets a promising benchmark for the automatic annotation of occurrences of the pronoun ello, their antecedents, and the semantic relations between the two text segments linked by the connective in combination with the pronoun.
CitationParodi, G., Evans, R., Ha, L.A., Mitkov, R., Vergara, C.J.J., Olivares-López,, R.I. (2022) A sequence labelling approach for automatic analysis of ello: tagging pronouns, antecedents, and connective phrases. Language Resources and Evaluation 56, pp. 139–164. https://doi.org/10.1007/s10579-021-09559-z
JournalLanguage Resources and Evaluation
DescriptionThis is an accepted manuscript of an article published by Springer in Language Resources and Evaluation on 04/09/2021, available online: https://doi.org/10.1007/s10579-021-09559-z The accepted version of the publication may differ from the final published version.
Except where otherwise noted, this item's license is described as https://creativecommons.org/licenses/by-nc-nd/4.0/