Published January 1, 2018 | Version v1
Conference paper Open

Supervised Author Recognition with Aggregated Word Embeddings

  • 1. TUBITAK Uzay Teknol Arastirma Enstitusu, Goruntu Isleme Grubu, Ankara, Turkey

Description

The number of texts has been remarkably increased with each passing day due to the rapid development of technology. This situation creates a need for the development of new techniques in the fields of text mining and natural language processing. Highly successful methods are developed by especially using word embedding based on artificial neural network. In this paper, an application is produced by using Word2vFisher based on word embedding and Fisher vector for the analysis of Turkish texts. A dataset containing 237 different columnist are created by collecting columns of last 20 years from the electronic archive of Hurriyet and Sabah newspapers. One of the important points of this study is that the experiments are conducted on the largest-ever dataset that contains Turkish newspaper columns. The effectiveness of the method on analysis of the Turkish texts is another important point of this study. It is believed that the method can be utilized in many other domains.

Files

bib-7ff289c6-9763-4828-9f41-22aa66f36566.txt

Files (180 Bytes)

Name Size Download all
md5:d345ec126e194e53793043d1c45d2d49
180 Bytes Preview Download