Named-entity recognition in Turkish legal texts

Cetindag, Can; Yazicioglu, Berkay; Koc, Aykut

doi:10.1017/S1351324922000304

Yayınlanmış 1 Ocak 2023 | Sürüm v1

Dergi makalesi Açık

Named-entity recognition in Turkish legal texts

Natural language processing (NLP) technologies and applications in legal text processing are gaining momentum. Being one of the most prominent tasks in NLP, named-entity recognition (NER) can substantiate a great convenience for NLP in law due to the variety of named entities in the legal domain and their accentuated importance in legal documents. However, domain-specific NER models in the legal domain are not well studied. We present a NER model for Turkish legal texts with a custom-made corpus as well as several NER architectures based on conditional random fields and bidirectional long-short-term memories (BiLSTMs) to address the task. We also study several combinations of different word embeddings consisting of GloVe, Morph2Vec, and neural network-based character feature extraction techniques either with BiLSTM or convolutional neural networks. We report 92.27% F1 score with a hybrid word representation of GloVe and Morph2Vec with character-level features extracted with BiLSTM. Being an agglutinative language, the morphological structure of Turkish is also considered. To the best of our knowledge, our work is the first legal domain-specific NER study in Turkish and also the first study for an agglutinative language in the legal domain. Thus, our work can also have implications beyond the Turkish language.

Dosyalar

bib-8a4a0baa-462c-4ba8-8497-b4a5d8e38edc.txt

Dosyalar (141 Bytes)

Ad	Boyut	Hepisini indir
bib-8a4a0baa-462c-4ba8-8497-b4a5d8e38edc.txt md5:30ee6c1a8a142b00509ce1dc0950c61b	141 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	7	7
İndirilenler	3	3
Veri miktarı	423 Bytes	423 Bytes

Named-entity recognition in Turkish legal texts

Dosyalar

bib-8a4a0baa-462c-4ba8-8497-b4a5d8e38edc.txt

Dosyalar (141 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Named-entity recognition in Turkish legal texts

Oluşturanlar

Açıklama

Dosyalar

bib-8a4a0baa-462c-4ba8-8497-b4a5d8e38edc.txt

Dosyalar (141 Bytes)