What matters more: the size of the corpora or their quality? The case of automatic translation of multiword expressions using comparable corpora.
AbstractThis study investigates (and compares) the impact of the size and the similarity/quality of comparable corpora on the specific task of extracting translation equivalents of verb-noun collocations from such corpora. The comprehensive evaluation of different configurations of English and Spanish corpora sheds some light on the more general and perennial question: what matters more – the quantity or quality of corpora?
TypeChapter in book