Published January 1, 2019
| Version v1
Journal article
Open
Generative RNNs for OOV Keyword Search
- 1. Natl Def Univ, Naval Acad, Dept Elect & Elect Engn, TR-34940 Istanbul, Turkey
- 2. Bogazici Univ, Dept Elect & Elect Engn, TR-34342 Istanbul, Turkey
Description
The modeling of text queries as sequences of embeddings for conducting similarity matching based search within speech features has been recently shown to improve keyword search (KWS) performance, especially for the out-of-vocabulary (OOV) terms. This technique uses a dynamic time warping based search methodology, converting the KWS problem into a pattern search problem by artificially modeling the text queries as pronunciation-based embedding sequences. This query modeling is done by concatenating and repeating frame representations for each phoneme in the keyword's pronunciation. In this letter, we propose a query model that incorporates temporal context information using recurrent neural networks (RNN) trained to generate realistic query representations. With experiments conducted on the IARPA Babel Program's Turkish and Zulu datasets, we show that the proposed RNN-based query generation yields significant improvements over the statistical query models of earlier work, and yields a comparable performance to the state-of-the-art techniques for OOV KWS.
Files
bib-0dbd3b31-fa7c-4cfa-88f4-ae7c4c64149f.txt
Files
(134 Bytes)
| Name | Size | Download all |
|---|---|---|
|
md5:eaf41f774e9fa1091f0dfd2576d72e18
|
134 Bytes | Preview Download |