Item

Grammatical annotation of historical Portuguese: Generating a corpus-based diachronic dictionary

Bick, Eckhard
Zampieri, Marcos
Editors
Other contributors
Affiliation
Epub Date
Issue Date
2016-09-03
Submitted date
Alternative
Abstract
In this paper, we present an automatic system for the morphosyntactic annotation and lexicographical evaluation of historical Portuguese corpora. Using rule-based orthographical normalization, we were able to apply a standard parser (PALAVRAS) to historical data (Colonia corpus) and to achieve accurate annotation for both POS and syntax. By aligning original and standardized word forms, our method allows to create tailor-made standardization dictionaries for historical Portuguese with optional period or author frequencies.
Citation
Bick E., Zampieri M. (2016) Grammatical Annotation of Historical Portuguese: Generating a Corpus-Based Diachronic Dictionary. In: Sojka P., Horák A., Kopeček I., Pala K. (eds) Text, Speech, and Dialogue. TSD 2016. Lecture Notes in Computer Science, vol 9924. Springer, Cham
Publisher
Journal
Research Unit
DOI
PubMed ID
PubMed Central ID
Embedded videos
Type
Chapter in book
Language
en
Description
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9924)
Series/Report no.
ISSN
EISSN
ISBN
9783319455099
ISMN
Gov't Doc #
Sponsors
Rights
Attribution-NonCommercial-NoDerivs 3.0 United States
Research Projects
Organizational Units
Journal Issue
Embedded videos