Published January 1, 2011
| Version v1
Journal article
Open
Resources for Turkish morphological processing
Creators
- 1. Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
- 2. Bogazici Univ, Dept Elect & Elect Engn, TR-34342 Istanbul, Turkey
Description
We present a set of language resources and tools-a morphological parser, a morphological disambiguator, and a text corpus-for exploiting Turkish morphology in natural language processing applications. The morphological parser is a state-of-the-art finite-state transducer-based implementation of Turkish morphology. The disambiguator is based on the averaged perceptron algorithm and has the best accuracy reported for Turkish in the literature. The text corpus has been compiled from the web and contains about 500 million tokens. This is the largest Turkish web corpus published.
Files
bib-ca3a0269-71d8-4be9-b9c5-f16abcca6b04.txt
Files
(141 Bytes)
| Name | Size | Download all |
|---|---|---|
|
md5:69c14e72e89417c7c347d6abcc72228c
|
141 Bytes | Preview Download |