Yayınlanmış 1 Ocak 2024 | Sürüm v1
Dergi makalesi Açık

Cross-lingual few-shot sign language recognition

  • 1. HAVELSAN Inc, Image & Video Proc Grp, TR-06800 Ankara, Turkiye
  • 2. Hacettepe Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye
  • 3. Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye

Açıklama

There are over 150 sign languages worldwide, each with numerous local variants and thousands of signs. However, collecting annotated data for each sign language to train a model is a laborious and expert-dependent task. To address this issue, this paper introduces the problem of few-shot sign language recognition (FSSLR) in a cross-lingual setting. The central motivation is to be able to recognize a novel sign, even if it belongs to a sign language unseen during training, based on a small set of examples. To tackle this problem, we propose a novel embedding-based framework that first extracts a spatio-temporal visual representation based on video and hand features, as well as hand landmark estimates. To establish a comprehensive test bed, we propose three meta-learning FSSLR benchmarks that span multiple languages, and extensively evaluate the proposed framework. The experimental results demonstrate the effectiveness and superiority of the proposed approach for few-shot sign language recognition in both monolingual and cross-lingual settings.

Dosyalar

bib-a9d82d24-3623-4545-9abb-748ad17d579d.txt

Dosyalar (142 Bytes)

Ad Boyut Hepisini indir
md5:804f4df99ed0e3ce2a2163aaa7151fe0
142 Bytes Ön İzleme İndir