Konferans bildirisi Açık Erişim

Turkish Treebanking: Unifying and Constructing Efforts

Turk, Utku; Atmaca, Furkan; Ozates, Saziye Betul; Koksal, Abdullatif; Ozturk, Balkiz; Gungor, Tunga; Ozgur, Arzucan


Dublin Core

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Turk, Utku</dc:creator>
  <dc:creator>Atmaca, Furkan</dc:creator>
  <dc:creator>Ozates, Saziye Betul</dc:creator>
  <dc:creator>Koksal, Abdullatif</dc:creator>
  <dc:creator>Ozturk, Balkiz</dc:creator>
  <dc:creator>Gungor, Tunga</dc:creator>
  <dc:creator>Ozgur, Arzucan</dc:creator>
  <dc:date>2019-01-01</dc:date>
  <dc:description>In this paper, we present the re-annotation of the Turkish PUD Treebank and the first annotation of the Turkish National Corpus Universal Dependency (henceforth TNC-UD) Treebank as part of our efforts for unifying and extending the Turkish universal dependency treebanks. In accordance with the Universal Dependencies' guidelines and the necessities of Turkish grammar, both treebanks, the Turkish PUD Treebank and TNC-UD, were revised with regards to their syntactic relations. The TNC-UD is planned to have 10,000 sentences. In this paper, we present the first 500 sentences along with the re-annotation of the PUD Treebank. Moreover, this paper also offers the parsing results of a graph-based neural parser on the previous and re-annotated PUD, as well as the TNC-UD. In light of the comparisons, even though we observe a slight decrease in the attachment scores of the Turkish PUD treebank, we demonstrate that the annotation of the TNC-UD improves the parsing accuracy of Turkish. In addition to the treebanks, we have also constructed a custom annotation software with advanced filtering and morphological editing options. Both of the treebanks, including a full edit-history and the annotation guidelines, as well as the custom software are publicly available online under an open license.</dc:description>
  <dc:identifier>https://aperta.ulakbim.gov.trrecord/74453</dc:identifier>
  <dc:identifier>oai:zenodo.org:74453</dc:identifier>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>http://www.opendefinition.org/licenses/cc-by</dc:rights>
  <dc:title>Turkish Treebanking: Unifying and Constructing Efforts</dc:title>
  <dc:type>info:eu-repo/semantics/conferencePaper</dc:type>
  <dc:type>publication-conferencepaper</dc:type>
</oai_dc:dc>
39
4
görüntülenme
indirilme
Görüntülenme 39
İndirme 4
Veri hacmi 764 Bytes
Tekil görüntülenme 36
Tekil indirme 4

Alıntı yap