Konferans bildirisi Açık Erişim

Turkish language resources: Morphological parser, morphological disambiguator and web corpus

Sak, Hasim; Guengor, Tunga; Saraclar, Murat


JSON

{
  "conceptdoi": "10.81043/aperta.39998", 
  "conceptrecid": "39998", 
  "created": "2021-03-15T20:22:47.413206+00:00", 
  "doi": "10.81043/aperta.39999", 
  "files": [
    {
      "bucket": "67f3efe7-406b-4887-9a2f-f8163803e5ec", 
      "checksum": "md5:7d02e30113cae26e2b948abd8d530080", 
      "key": "bib-29014de1-4d5e-4c26-9725-4c7760af0139.txt", 
      "links": {
        "self": "https://aperta.ulakbim.gov.tr/api/files/67f3efe7-406b-4887-9a2f-f8163803e5ec/bib-29014de1-4d5e-4c26-9725-4c7760af0139.txt"
      }, 
      "size": 190, 
      "type": "txt"
    }
  ], 
  "id": 39999, 
  "links": {
    "badge": "https://aperta.ulakbim.gov.tr/badge/doi/10.81043/aperta.39999.svg", 
    "bucket": "https://aperta.ulakbim.gov.tr/api/files/67f3efe7-406b-4887-9a2f-f8163803e5ec", 
    "conceptbadge": "https://aperta.ulakbim.gov.tr/badge/doi/10.81043/aperta.39998.svg", 
    "conceptdoi": "https://doi.org/10.81043/aperta.39998", 
    "doi": "https://doi.org/10.81043/aperta.39999", 
    "html": "https://aperta.ulakbim.gov.tr/record/39999", 
    "latest": "https://aperta.ulakbim.gov.tr/api/records/39999", 
    "latest_html": "https://aperta.ulakbim.gov.tr/record/39999"
  }, 
  "metadata": {
    "access_right": "open", 
    "access_right_category": "success", 
    "communities": [
      {
        "id": "tubitak-destekli-proje-yayinlari"
      }
    ], 
    "creators": [
      {
        "affiliation": "Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey", 
        "name": "Sak, Hasim"
      }, 
      {
        "affiliation": "Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey", 
        "name": "Guengor, Tunga"
      }, 
      {
        "affiliation": "Bogazici Univ, Elect & Elect Engn Dept, TR-34342 Bebek, Turkey", 
        "name": "Saraclar, Murat"
      }
    ], 
    "description": "In this paper, we propose a set of language resources for building Turkish language processing applications. Specifically, we present a finite-state implementation of a morphological parser, an averaged perceptron-based morphological disambiguator, and compilation of a web corpus. Turkish is an agglutinative language with a highly productive inflectional and derivational morphology, We present an implementation of a morphological parser based on two-level morphology. This parser is one of the most complete parsers for Turkish and it runs independent of any other external system such as PC-KIMMO in contrast to existing parsers. Due to complex phonology and morphology of Turkish, parsing introduces some ambiguous parses. We developed a morphological disambiguator with accuracy of about 98% using averaged perceptron algorithm. We also present our efforts to build a Turkish web corpus of about 423 million words.", 
    "doi": "10.81043/aperta.39999", 
    "has_grant": false, 
    "license": {
      "id": "cc-by"
    }, 
    "meeting": {
      "title": "ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS"
    }, 
    "publication_date": "2008-01-01", 
    "related_identifiers": [
      {
        "identifier": "10.81043/aperta.39998", 
        "relation": "isVersionOf", 
        "scheme": "doi"
      }
    ], 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "39999"
          }, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "39998"
          }
        }
      ]
    }, 
    "resource_type": {
      "subtype": "conferencepaper", 
      "title": "Konferans bildirisi", 
      "type": "publication"
    }, 
    "title": "Turkish language resources: Morphological parser, morphological disambiguator and web corpus"
  }, 
  "owners": [
    1
  ], 
  "revision": 1, 
  "stats": {
    "downloads": 22.0, 
    "unique_downloads": 21.0, 
    "unique_views": 129.0, 
    "version_downloads": 22.0, 
    "version_unique_downloads": 21.0, 
    "version_unique_views": 123.0, 
    "version_views": 131.0, 
    "version_volume": 4180.0, 
    "views": 140.0, 
    "volume": 4180.0
  }, 
  "updated": "2021-03-15T20:22:47.465577+00:00"
}
140
22
görüntülenme
indirilme
Görüntülenme 140
İndirme 22
Veri hacmi 4.2 kB
Tekil görüntülenme 129
Tekil indirme 21

Alıntı yap