Konferans bildirisi Açık Erişim

Database construction for speech to lip-readable animation conversion

   acs, Gyorgy Ta; Tihanyi, Atilla; Bardi, Tamas; Feldhoffer, Gergo; Srancsi, Balint

The training database was one of the critical element in our speech to facial animation conversion system. This system was developed as a communication aid for deaf people. The specific database was constructed from audio and visual records of professional lip-speakers. The standardized MPE -4 system was used to animate the talking head model. The trained neural net is able to calculate with acceptable error the principal component weights of feature points from the speech frames. The feature point coordinates are, calculated from PC weights. The whole system can be implemented in mobile phones. Deaf persons were able to recognize about 50 of words from the speech driven animation in the final test.

Dosyalar (163 Bytes)
Dosya adı Boyutu
bib-8e7c2af4-58dd-4cf7-ae21-a4a2332a4fed.txt
md5:f4bb16df541153b9128159fd51eb7fdf
163 Bytes İndir
42
9
görüntülenme
indirilme
Görüntülenme 42
İndirme 9
Veri hacmi 1.5 kB
Tekil görüntülenme 40
Tekil indirme 9

Alıntı yap