Yayınlanmış 1 Ocak 2006 | Sürüm v1
Konferans bildirisi Açık

Database construction for speech to lip-readable animation conversion

  • 1. Peter Pazmany Catholic Univ, Fac Informat Technol, Prater U 50-a, H-1083 Budapest, Hungary

Açıklama

The training database was one of the critical element in our speech to facial animation conversion system. This system was developed as a communication aid for deaf people. The specific database was constructed from audio and visual records of professional lip-speakers. The standardized MPE -4 system was used to animate the talking head model. The trained neural net is able to calculate with acceptable error the principal component weights of feature points from the speech frames. The feature point coordinates are, calculated from PC weights. The whole system can be implemented in mobile phones. Deaf persons were able to recognize about 50 of words from the speech driven animation in the final test.

Dosyalar

bib-8e7c2af4-58dd-4cf7-ae21-a4a2332a4fed.txt

Dosyalar (163 Bytes)

Ad Boyut Hepisini indir
md5:f4bb16df541153b9128159fd51eb7fdf
163 Bytes Ön İzleme İndir