Yayınlanmış 1 Ocak 2011 | Sürüm v1
Dergi makalesi Açık

Resources for Turkish morphological processing

  • 1. Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
  • 2. Bogazici Univ, Dept Elect & Elect Engn, TR-34342 Istanbul, Turkey

Açıklama

We present a set of language resources and tools-a morphological parser, a morphological disambiguator, and a text corpus-for exploiting Turkish morphology in natural language processing applications. The morphological parser is a state-of-the-art finite-state transducer-based implementation of Turkish morphology. The disambiguator is based on the averaged perceptron algorithm and has the best accuracy reported for Turkish in the literature. The text corpus has been compiled from the web and contains about 500 million tokens. This is the largest Turkish web corpus published.

Dosyalar

bib-ca3a0269-71d8-4be9-b9c5-f16abcca6b04.txt

Dosyalar (141 Bytes)

Ad Boyut Hepisini indir
md5:69c14e72e89417c7c347d6abcc72228c
141 Bytes Ön İzleme İndir