Published January 1, 2020 | Version v1
Journal article Open

Learning Word-vector Quantization: A Case Study in Morphological Disambiguation

  • 1. Cukurova Univ, Dept Comp Engn, TR-01330 Adana, Turkey

Description

We introduced a new classifier named Learning Word-vector Quantization (LWQ) to solve morphological ambiguities in Turkish, which is an agglutinative language. First, a new and morphologically annotated corpus, and then its datasets are prepared with a series of processes. According to datasets, LWQ finds optimal word-vectors positions by moving them in the Euclidean space. LWQ does morphological disambiguation in two steps: First, it defines all solution candidates of an ambiguous word using a morphological analyzer; second, it chooses the best candidate according to its total distances to neighbor words that are not ambiguous. To show LWQ's performance, we have conducted many tests on the corpus by considering the consistency of classification. In the experiments, we achieve 98.4% correct classification ratio to choose correct parse output, which is an excellent level for the literature.

Files

bib-a8f57948-1de9-4572-860c-0e984466d420.txt

Files (194 Bytes)

Name Size Download all
md5:d43da8059f313ff2a20a56712fe11cdc
194 Bytes Preview Download