Loading...
Modeling morpheme triplets with a three-level hierarchical Dirichlet process
Kumyol, Serkan ; Can, Burcu
Kumyol, Serkan
Can, Burcu
Authors
Editors
Other contributors
Affiliation
Epub Date
Issue Date
2017-03-13
Submitted date
Alternative
Abstract
Morphemes are not independent units and attached to each other based on morphotactics. However, they are assumed to be independent from each other to cope with the complexity in most of the models in the literature. We introduce a language independent model for unsupervised morphological segmentation using hierarchical Dirichlet process (HDP). We model the morpheme dependencies in terms of morpheme trigrams in each word. Trigrams, bigrams and unigrams are modeled within a three-level HDP, where the trigram Dirichlet process (DP) uses the bigram DP and bigram DP uses unigram DP as the base distribution. The results show that modeling morpheme dependencies improve the F-measure noticeably in English, Turkish and Finnish.
Citation
Kumyol, S. and Can, B. (2016) Modeling morpheme triplets with a three-level hierarchical Dirichlet process, 2016 International Conference on Asian Language Processing (IALP), 21-23 November, 2016, Tainan, Taiwan.
Publisher
Journal
Research Unit
PubMed ID
PubMed Central ID
Embedded videos
Additional Links
Type
Conference contribution
Language
en
Description
This is an accepted manuscript of an article published by IEEE in 2016 International Conference on Asian Language Processing (IALP) on 13/03/2017, available online: https://ieeexplore.ieee.org/document/7876007
The accepted version of the publication may differ from the final published version.
Series/Report no.
ISSN
2159-1962
EISSN
2159-1970