Published January 1, 2019 | Version v1
Conference paper Open

Translating Between Morphologically Rich Languages: An Arabic-to-Turkish Machine Translation System

  • 1. Tubitak Bilgem, Kocaeli, Turkey

Description

This paper introduces the work on building a machine translation system for Arabic-to-Turkish in the news domain. Our work includes collecting parallel datasets in several ways for a new and low-resource language pair, building baseline systems with state-of-the-art architectures and developing language specific algorithms for better translation. Parallel datasets are mainly collected three different ways; i) translating Arabic texts into Turkish by professional translators, ii) exploiting the web for open-source Arabic-Turkish parallel texts, iii) using back-translation. We performed preliminary experiments for Arabicto-Turkish machine translation with neural (Marian) machine translation tools with a novel morphologically motivated vocabulary reduction method.

Files

bib-c3389b71-b27c-47e5-8156-4c8904ac48a8.txt

Files (221 Bytes)

Name Size Download all
md5:a6941f86e372aadeca7eccbe8b1e6c7a
221 Bytes Preview Download