Yayınlanmış 1 Ocak 2013
| Sürüm v1
Konferans bildirisi
Açık
Named Entity Recognition on Real Data: A Preliminary Investigation for Turkish
Oluşturanlar
- 1. Huawei Technol Co Ltd, R&D Dept, TR-34768 Istanbul, Turkey
- 2. Tech Univ Istanbul, Inst Sci & Technol, TR-34469 Istanbul, Turkey
- 3. Tech Univ Istanbul, Dept Comp Engn, TR-34469 Istanbul, Turkey
Açıklama
Named Entity Recognition (NER) is a well-studied area in natural language processing (NLP) and the reported results in the literature are generally very high (similar to>%95) for most of the languages. Today, the focus area of most practical natural language applications (i.e. web mining, sentiment analysis, machine translation) is real natural language data such as Web2.0 or speech data. Nevertheless, the NER task is rarely investigated on this type of data which differs severely from formal written text. In this paper, we present 3 new Turkish data sets from different domains (on this focused area; namely from Twitter, a Speech-to-Text Interface and a Hardware Forum) annotated specifically for NER and report our first results on them. We believe, the paper draws light to the difficulty of these new domains for NER and the possible future work.
Dosyalar
bib-32df8f02-6d62-47d5-86a7-49b53e475ec9.txt
Dosyalar
(232 Bytes)
| Ad | Boyut | Hepisini indir |
|---|---|---|
|
md5:53839e40532ae81303682c1e701e1a17
|
232 Bytes | Ön İzleme İndir |