Published January 1, 2019 | Version v1
Conference paper Open

Analysis of Detected Silent Segments in Call Center Recordings

  • 1. AdresGezgini Inc, R&D Dept, Izmir, Turkey

Description

Interpreting speech signals by making a speech to text translation is an active research area especially in current machine learning/deep learning literature. The speech to text translation of call center recordings is an important and specialized application for speech to text translation. Detecting silence in audio recordings can be a pre-processing step in order to optimize processing speed by not-considering audio parts not having significant information. In this work, such a pre-processing framework for detecting silence parts in an audio signal is considered. It is shown that further statistical analysis on the silence distributions results in detecting interesting audio features which can help in finding audio recordings which do not have actual speech sound but a fax machine tone sequence. This foundation can be directly implemented in a call center management software and makes it possible to discriminate between a normal conversation recording and a fax sound recording.

Files

bib-23f482be-213f-4809-96de-ded44227afe8.txt

Files (181 Bytes)

Name Size Download all
md5:c72b160c9e4859a99a44442edb3972d2
181 Bytes Preview Download