Yayınlanmış 1 Ocak 2011 | Sürüm v1
Dergi makalesi Açık

Lattice Indexing for Spoken Term Detection

  • 1. Univ So Calif, Los Angeles, CA 90089 USA
  • 2. Bogazici Univ, Dept Elect & Elect Engn, TR-34342 Istanbul, Turkey

Açıklama

This paper considers the problem of constructing an efficient inverted index for the spoken term detection (STD) task. More specifically, we construct a deterministic weighted finite-state transducer storing soft-hits in the form of (utterance ID, start time, end time, posterior score) quadruplets. We propose a generalized factor transducer structure which retains the time information necessary for performing STD. The required information is embedded into the path weights of the factor transducer without disrupting the inherent optimality. We also describe how to index all substrings seen in a collection of raw automatic speech recognition lattices using the proposed structure. Our STD indexing/search implementation is built upon the OpenFst Library and is designed to scale well to large problems. Experiments on Turkish and English data sets corroborate our claims.

Dosyalar

bib-7a13dfbb-32d9-4718-b18b-ea3b1f2e3f13.txt

Dosyalar (151 Bytes)

Ad Boyut Hepisini indir
md5:a6d197264912730afd4c4fa11c58bf6e
151 Bytes Ön İzleme İndir