Published January 1, 2018 | Version v1
Conference paper Open

Evaluation of Semantic Relatedness Measures for Turkish Language

  • 1. Cankaya Univ, Dept Comp Engn, Eskisehir Yolu 29 Km Etimesgut, Ankara, Turkey
  • 2. Hacettepe Univ, Inst Informat, Ankara, Turkey

Description

The problem of quantifying semantic relatedness level of two words is a fundamental sub-task for many natural language processing systems. While there is a large body of research on measuring semantic relatedness in the English language, the literature lacks detailed analysis for these methods in agglutinative languages. In this research, two new evaluation resources for the Turkish language are constructed. An extensive set of experiments involving multiple tasks: word association, semantic categorization, and automatic WordNet relationship discovery are performed to evaluate different semantic relatedness measures in the Turkish language. As Turkish is an agglutinative language, the morphological processing component is important for distributional similarity algorithms. For languages with rich morphological variations and productivity, methods ranging from simple stemming strategies to morphological disambiguation exists. In our experiments, different morphological processing methods for the Turkish language are investigated.

Files

bib-72f9f480-2f71-4513-8e2b-31b914a22508.txt

Files (178 Bytes)

Name Size Download all
md5:c50b030346abafa965e72cea63905b88
178 Bytes Preview Download