Konferans bildirisi Açık Erişim

Turkish Treebanking: Unifying and Constructing Efforts

Turk, Utku; Atmaca, Furkan; Ozates, Saziye Betul; Koksal, Abdullatif; Ozturk, Balkiz; Gungor, Tunga; Ozgur, Arzucan


DataCite XML

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="URL">https://aperta.ulakbim.gov.tr/record/74453</identifier>
  <creators>
    <creator>
      <creatorName>Turk, Utku</creatorName>
      <givenName>Utku</givenName>
      <familyName>Turk</familyName>
      <affiliation>Bogazici Univ, Dept Linguist, TR-34342 Istanbul, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Atmaca, Furkan</creatorName>
      <givenName>Furkan</givenName>
      <familyName>Atmaca</familyName>
      <affiliation>Bogazici Univ, Dept Linguist, TR-34342 Istanbul, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Ozates, Saziye Betul</creatorName>
      <givenName>Saziye Betul</givenName>
      <familyName>Ozates</familyName>
      <affiliation>Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Koksal, Abdullatif</creatorName>
      <givenName>Abdullatif</givenName>
      <familyName>Koksal</familyName>
      <affiliation>Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Ozturk, Balkiz</creatorName>
      <givenName>Balkiz</givenName>
      <familyName>Ozturk</familyName>
      <affiliation>Bogazici Univ, Dept Linguist, TR-34342 Istanbul, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Gungor, Tunga</creatorName>
      <givenName>Tunga</givenName>
      <familyName>Gungor</familyName>
      <affiliation>Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Ozgur, Arzucan</creatorName>
      <givenName>Arzucan</givenName>
      <familyName>Ozgur</familyName>
      <affiliation>Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey</affiliation>
    </creator>
  </creators>
  <titles>
    <title>Turkish Treebanking: Unifying And Constructing Efforts</title>
  </titles>
  <publisher>Aperta</publisher>
  <publicationYear>2019</publicationYear>
  <dates>
    <date dateType="Issued">2019-01-01</date>
  </dates>
  <resourceType resourceTypeGeneral="Text">Conference paper</resourceType>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://aperta.ulakbim.gov.tr/record/74453</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.81043/aperta.74452</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsIdenticalTo">10.81043/aperta.74453</relatedIdentifier>
  </relatedIdentifiers>
  <rightsList>
    <rights rightsURI="http://www.opendefinition.org/licenses/cc-by">Creative Commons Attribution</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">In this paper, we present the re-annotation of the Turkish PUD Treebank and the first annotation of the Turkish National Corpus Universal Dependency (henceforth TNC-UD) Treebank as part of our efforts for unifying and extending the Turkish universal dependency treebanks. In accordance with the Universal Dependencies' guidelines and the necessities of Turkish grammar, both treebanks, the Turkish PUD Treebank and TNC-UD, were revised with regards to their syntactic relations. The TNC-UD is planned to have 10,000 sentences. In this paper, we present the first 500 sentences along with the re-annotation of the PUD Treebank. Moreover, this paper also offers the parsing results of a graph-based neural parser on the previous and re-annotated PUD, as well as the TNC-UD. In light of the comparisons, even though we observe a slight decrease in the attachment scores of the Turkish PUD treebank, we demonstrate that the annotation of the TNC-UD improves the parsing accuracy of Turkish. In addition to the treebanks, we have also constructed a custom annotation software with advanced filtering and morphological editing options. Both of the treebanks, including a full edit-history and the annotation guidelines, as well as the custom software are publicly available online under an open license.</description>
  </descriptions>
</resource>
30
4
görüntülenme
indirilme
Görüntülenme 30
İndirme 4
Veri hacmi 764 Bytes
Tekil görüntülenme 29
Tekil indirme 4

Alıntı yap