Yayınlanmış 1 Ocak 2007 | Sürüm v1
Konferans bildirisi Açık

Analysis and compensation of sparse packet losses in distributed Turkish continuous speech recognition system

  • 1. EKOM Iletisim Teknol, TUBITAK MAM Teknol Serbest Bolgesi, TR-41470 Gebze, Kocaeli, Turkey
  • 2. Bogazici Univ, Elekt Elekt Muhendisligi Bolumu, TR-34342 Istanbul, Turkey
  • 3. Eskisehir Osmangazi Univ, Elekt Elekt Muhendisligi Bolumu, TR-26480 Eskisehir, Turkey

Açıklama

In this study, we investigated word error rate performance of Turkish continuous speech recognition system with sparse packet losses in a distributed architecture. In this distributed architecture, speech feature vectors consisting of MFCCs and logarithmic power are transmitted with UDP protocol. A special UDP header is defined to be in the distributed system. Sparse packet losses are artificially generated by considering different scenarios. Two packet loss concealment methods, Lagrange and Spline interpolation, are used as a front-end process in the recognition system. In the experimental study, speech feature vectors are obtained by using HTK The SRI Language Modelling toolkit is used to generate statistical language models. Acoustic modeling and recognition are performed using AT&T software. The Word Error Rate (WER) of the baseline system is 32.1% This error rate is increased up to 34.2% with the sparse packet losses. In our study, we have seen that the packet concealment methods reduce the WER of the speech recognition system to 32.4%.

Dosyalar

bib-a6b4ee2b-30ac-4dd0-88a2-b13e9a36b21c.txt

Dosyalar (242 Bytes)

Ad Boyut Hepisini indir
md5:fdab12703b18ce0d272cf7eec5d1d2fe
242 Bytes Ön İzleme İndir