Published January 1, 2009
| Version v1
Conference paper
Open
EFFECT OF PRONUNCIATIONS ON OOV QUERIES IN SPOKEN TERM DETECTION
Creators
- 1. Bogazici Univ, TR-80815 Bebek, Turkey
- 2. MIT, Cambridge, MA 02139 USA
- 3. IBM Corp, Armonk, NY 10504 USA
- 4. Johns Hopkins Univ, Baltimore, MD 21218 USA
Description
The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for Out-of-Vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using Weighted Finite State Transducers (WFST). We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted.
Files
bib-5131204a-19cb-4d7e-9116-c7460367336f.txt
Files
(250 Bytes)
| Name | Size | Download all |
|---|---|---|
|
md5:ccb6cd685e2de62ca2003c8037b0a557
|
250 Bytes | Preview Download |