Tree structured Dirichlet processes for hierarchical morphological segmentation
Abstract
This article presents a probabilistic hierarchical clustering model for morphological segmentation. In contrast to existing approaches to morphology learning, our method allows learning hierarchical organization of word morphology as a collection of tree structured paradigms. The model is fully unsupervised and based on the hierarchical Dirichlet process. Tree hierarchies are learned along with the corresponding morphological paradigms simultaneously. Our model is evaluated on Morpho Challenge and shows competitive performance when compared to state-of-the-art unsupervised morphological segmentation systems. Although we apply this model for morphological segmentation, the model itself can also be used for hierarchical clustering of other types of data.Citation
Can, B. and Manandhar, S. (2018) Tree structured Dirichlet processes for hierarchical morphological segmentation, Computational Linguistics, 44(2), pp. 349-374.Publisher
MIT PressJournal
Computational LinguisticsAdditional Links
https://www.mitpressjournals.org/doi/full/10.1162/coli_a_00318Type
Journal articleLanguage
enDescription
© 2018 The Authors. Published by MIT Press/ACL. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: https://doi.org/10.1162/coli_a_00318ISSN
0891-2017EISSN
1530-9312ae974a485f413a2113503eed53cd6c53
10.1162/COLI_a_00318
Scopus Count
Except where otherwise noted, this item's license is described as https://creativecommons.org/licenses/by-nc-nd/4.0/