Yayınlanmış 1 Ocak 2012 | Sürüm v1
Konferans bildirisi Açık

Construction of the Turkish National Corpus (TNC)

  • 1. Mersin Univ, Fen Edebiyat Fak, TR-33343 Mersin, Turkey
  • 2. Yasar Univ, Muhendislik Fak, TR-35100 Izmir, Turkey

Açıklama

This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced, large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense, TNC generally follows the framework of British National Corpus, yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process, different types of open-source software are used for specific tasks, and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features, web-based corpus management system, carefully planned workflow and its web-based user-friendly search interface.

Dosyalar

bib-06f92fba-cee3-4ddf-93c4-dd9e16d1c389.txt

Dosyalar (275 Bytes)

Ad Boyut Hepisini indir
md5:adb96628cf5092dcc084938d4a3a123d
275 Bytes Ön İzleme İndir