Relaxed Spatio-Temporal Deep Feature Aggregation for Real-Fake Expression Prediction

Ozkan, Savas; Akar, Gozde Bozdagi

doi:10.1109/ICCVW.2017.366

Published January 1, 2017 | Version v1

Conference paper Open

Relaxed Spatio-Temporal Deep Feature Aggregation for Real-Fake Expression Prediction

1. Middle East Tech Univ, Multimedia Lab, Ankara, Turkey

Frame-level visual features are generally aggregated in time with the techniques such as LSTM, Fisher Vectors, NetVLAD etc. to produce a robust video-level representation. We here introduce a learnable aggregation technique whose primary objective is to retain short-time temporal structure between frame-level features and their spatial interdependencies in the representation. Also, it can be easily adapted to the cases where there have very scarce training samples. We evaluate the method on a real-fake expression prediction dataset to demonstrate its superiority. Our method obtains 65% score on the test dataset in the official MAP evaluation and there is only one misclassified decision with the best reported result in the Chalearn Challenge (i.e. 66.7%). Lastly, we believe that this method can be extended to different problems such as action/event recognition in future.

Files

bib-b4dc114a-24bc-42c1-95f5-d1db33749b4b.txt

Files (191 Bytes)

Name	Size	Download all
bib-b4dc114a-24bc-42c1-95f5-d1db33749b4b.txt md5:88bf7b454150430ecd05e01d30d03dea	191 Bytes	Preview Download

	All versions	This version
Views	57	57
Downloads	21	21
Data volume	4.0 kB	4.0 kB

Relaxed Spatio-Temporal Deep Feature Aggregation for Real-Fake Expression Prediction

Files

bib-b4dc114a-24bc-42c1-95f5-d1db33749b4b.txt

Files (191 Bytes)

TÜBİTAK ULAKBİM

CONTACT

Relaxed Spatio-Temporal Deep Feature Aggregation for Real-Fake Expression Prediction

Creators

Description

Files

bib-b4dc114a-24bc-42c1-95f5-d1db33749b4b.txt

Files (191 Bytes)