Konferans bildirisi Açık Erişim

Turkish Paraphrase Corpus

   Demir, Seniz; El-Kahlout, Ilknur Durgar; Unal, Erdem; Kaya, Hamza

Paraphrases are alternative syntactic forms in the same language expressing the same semantic content. Speakers of all languages are inherently familiar with paraphrases at different levels of granularity (lexical, phrasal, and sentential). For quite some time, the concept of paraphrasing is getting a growing attention by the research community and its potential use in several natural language processing applications (such as text summarization and machine translation) is being investigated. In this paper, we present, what is to our best knowledge, the first Turkish paraphrase corpus. The corpus is gleaned from four different sources and currently contains 1270 paraphrase pairs. All paraphrase pairs are carefully annotated by native Turkish speakers with the identified semantic correspondences between paraphrases. The work for expanding the corpus is still under way.

Dosyalar (162 Bytes)
Dosya adı Boyutu
bib-1b525b92-e56c-4886-b0b7-30d0c77693e0.txt
md5:107a9def02df5231437f27afc8b1fa4c
162 Bytes İndir
47
12
görüntülenme
indirilme
Görüntülenme 47
İndirme 12
Veri hacmi 1.9 kB
Tekil görüntülenme 47
Tekil indirme 12

Alıntı yap