Compositional Neural Network Language Models for Agglutinative Languages

Arisoy, Ebru; Saraclar, Murat

doi:10.21437/Interspeech.2016-1239

Published January 1, 2016 | Version v1

Conference paper Open

Compositional Neural Network Language Models for Agglutinative Languages

1. MEF Univ, Istanbul, Turkey
2. Bogazici Univ, Istanbul, Turkey

Continuous space language models (CSLMs) have been proven to be successful in speech recognition. With proper training of the word embeddings, words that are semantically or syntactically related are expected to be mapped to nearby locations in the continuous space. In agglutinative languages, words are made up of concatenation of stems and suffixes and, as a result, compositional modeling is important. However, when trained on word tokens, CSLMs do not explicitly consider this structure. In this paper, we explore compositional modeling of stems and suffixes in a long short-term memory neural network language model. Our proposed models jointly learn distributed representations for stems and endings (concatenation of suffixes) and predict the probability for stem and ending sequences. Experiments on the Turkish Broadcast news transcription task show that further gains on top of a state-of-theart stem-ending-based n-gram language model can be obtained with the proposed models.

Files

bib-e597aba6-b79d-4889-a38c-39f6597539f3.txt

Files (213 Bytes)

Name	Size	Download all
bib-e597aba6-b79d-4889-a38c-39f6597539f3.txt md5:fca12286f8feb0fd589273e6f1060b9a	213 Bytes	Preview Download

	All versions	This version
Views	39	39
Downloads	14	14
Data volume	3.2 kB	3.2 kB

Compositional Neural Network Language Models for Agglutinative Languages

Files

bib-e597aba6-b79d-4889-a38c-39f6597539f3.txt

Files (213 Bytes)

TÜBİTAK ULAKBİM

CONTACT

Compositional Neural Network Language Models for Agglutinative Languages

Creators

Description

Files

bib-e597aba6-b79d-4889-a38c-39f6597539f3.txt

Files (213 Bytes)