Published January 1, 2009 | Version v1
Conference paper Open

EFFECT OF PRONUNCIATIONS ON OOV QUERIES IN SPOKEN TERM DETECTION

  • 1. Bogazici Univ, TR-80815 Bebek, Turkey
  • 2. MIT, Cambridge, MA 02139 USA
  • 3. IBM Corp, Armonk, NY 10504 USA
  • 4. Johns Hopkins Univ, Baltimore, MD 21218 USA

Description

The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for Out-of-Vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using Weighted Finite State Transducers (WFST). We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted.

Files

bib-5131204a-19cb-4d7e-9116-c7460367336f.txt

Files (250 Bytes)

Name Size Download all
md5:ccb6cd685e2de62ca2003c8037b0a557
250 Bytes Preview Download