Konferans bildirisi Açık Erişim

A Named Entity Recognition Dataset for Turkish

   Kucuk, Dilek; Kucuk, Dogan; Arici, Nursal

Named entity recognition is one of the important topics in the research area of natural language processing. Named entity recognition studies conducted on Turkish texts are quite limited, compared to the studies on other languages. Besides, the lack of common data sets makes the comparison of different approaches harder. In this study, a dataset comprising news articles in Turkish annotated with named entities is presented. The annotations comprise the basic named entity types of person, location, and organization names. Additionally, to be used as reference in future studies, a rule-based named entity recognition system is evaluated on the final form of this data set and the corresponding evaluation results are presented. It is envisioned that our study will contribute to the advancement of named entity recognition studies on Turkish texts.

Dosyalar (163 Bytes)
Dosya adı Boyutu
bib-34741bea-237c-4469-ac0b-55976b819f55.txt
md5:6e796722998295413c006f13218f5860
163 Bytes İndir
38
10
görüntülenme
indirilme
Görüntülenme 38
İndirme 10
Veri hacmi 1.6 kB
Tekil görüntülenme 32
Tekil indirme 10

Alıntı yap