Dergi makalesi Açık Erişim

Named-entity recognition in Turkish legal texts

Cetindag, Can; Yazicioglu, Berkay; Koc, Aykut


JSON

{
  "conceptrecid": "254558", 
  "created": "2023-07-28T21:18:36.490864+00:00", 
  "doi": "10.1017/S1351324922000304", 
  "files": [
    {
      "bucket": "f647d021-0a17-40ff-82aa-ade493489741", 
      "checksum": "md5:30ee6c1a8a142b00509ce1dc0950c61b", 
      "key": "bib-8a4a0baa-462c-4ba8-8497-b4a5d8e38edc.txt", 
      "links": {
        "self": "https://aperta.ulakbim.gov.tr/api/files/f647d021-0a17-40ff-82aa-ade493489741/bib-8a4a0baa-462c-4ba8-8497-b4a5d8e38edc.txt"
      }, 
      "size": 141, 
      "type": "txt"
    }
  ], 
  "id": 254559, 
  "links": {
    "badge": "https://aperta.ulakbim.gov.tr/badge/doi/10.1017/S1351324922000304.svg", 
    "bucket": "https://aperta.ulakbim.gov.tr/api/files/f647d021-0a17-40ff-82aa-ade493489741", 
    "doi": "https://doi.org/10.1017/S1351324922000304", 
    "html": "https://aperta.ulakbim.gov.tr/record/254559", 
    "latest": "https://aperta.ulakbim.gov.tr/api/records/254559", 
    "latest_html": "https://aperta.ulakbim.gov.tr/record/254559"
  }, 
  "metadata": {
    "access_right": "open", 
    "access_right_category": "success", 
    "communities": [
      {
        "id": "tubitak-destekli-proje-yayinlari"
      }
    ], 
    "creators": [
      {
        "name": "Cetindag, Can"
      }, 
      {
        "name": "Yazicioglu, Berkay"
      }, 
      {
        "name": "Koc, Aykut"
      }
    ], 
    "description": "Natural language processing (NLP) technologies and applications in legal text processing are gaining momentum. Being one of the most prominent tasks in NLP, named-entity recognition (NER) can substantiate a great convenience for NLP in law due to the variety of named entities in the legal domain and their accentuated importance in legal documents. However, domain-specific NER models in the legal domain are not well studied. We present a NER model for Turkish legal texts with a custom-made corpus as well as several NER architectures based on conditional random fields and bidirectional long-short-term memories (BiLSTMs) to address the task. We also study several combinations of different word embeddings consisting of GloVe, Morph2Vec, and neural network-based character feature extraction techniques either with BiLSTM or convolutional neural networks. We report 92.27% F1 score with a hybrid word representation of GloVe and Morph2Vec with character-level features extracted with BiLSTM. Being an agglutinative language, the morphological structure of Turkish is also considered. To the best of our knowledge, our work is the first legal domain-specific NER study in Turkish and also the first study for an agglutinative language in the legal domain. Thus, our work can also have implications beyond the Turkish language.", 
    "doi": "10.1017/S1351324922000304", 
    "has_grant": false, 
    "journal": {
      "issue": "3", 
      "pages": "615-642", 
      "title": "NATURAL LANGUAGE ENGINEERING", 
      "volume": "29"
    }, 
    "license": {
      "id": "cc-by"
    }, 
    "publication_date": "2023-01-01", 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "254559"
          }, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "254558"
          }
        }
      ]
    }, 
    "resource_type": {
      "subtype": "article", 
      "title": "Dergi makalesi", 
      "type": "publication"
    }, 
    "science_branches": [
      "Di\u011fer"
    ], 
    "title": "Named-entity recognition in Turkish legal texts"
  }, 
  "owners": [
    1
  ], 
  "revision": 1, 
  "stats": {
    "downloads": 8.0, 
    "unique_downloads": 8.0, 
    "unique_views": 27.0, 
    "version_downloads": 8.0, 
    "version_unique_downloads": 8.0, 
    "version_unique_views": 27.0, 
    "version_views": 28.0, 
    "version_volume": 1128.0, 
    "views": 28.0, 
    "volume": 1128.0
  }, 
  "updated": "2023-07-28T21:18:36.547431+00:00"
}
28
8
görüntülenme
indirilme
Görüntülenme 28
İndirme 8
Veri hacmi 1.1 kB
Tekil görüntülenme 27
Tekil indirme 8

Alıntı yap