Cross-lingual few-shot sign language recognition
- 1. HAVELSAN Inc, Image & Video Proc Grp, TR-06800 Ankara, Turkiye
- 2. Hacettepe Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye
- 3. Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye
Açıklama
There are over 150 sign languages worldwide, each with numerous local variants and thousands of signs. However, collecting annotated data for each sign language to train a model is a laborious and expert-dependent task. To address this issue, this paper introduces the problem of few-shot sign language recognition (FSSLR) in a cross-lingual setting. The central motivation is to be able to recognize a novel sign, even if it belongs to a sign language unseen during training, based on a small set of examples. To tackle this problem, we propose a novel embedding-based framework that first extracts a spatio-temporal visual representation based on video and hand features, as well as hand landmark estimates. To establish a comprehensive test bed, we propose three meta-learning FSSLR benchmarks that span multiple languages, and extensively evaluate the proposed framework. The experimental results demonstrate the effectiveness and superiority of the proposed approach for few-shot sign language recognition in both monolingual and cross-lingual settings.
Dosyalar
bib-a9d82d24-3623-4545-9abb-748ad17d579d.txt
Dosyalar
(142 Bytes)
| Ad | Boyut | Hepisini indir |
|---|---|---|
|
md5:804f4df99ed0e3ce2a2163aaa7151fe0
|
142 Bytes | Ön İzleme İndir |