Dergi makalesi Açık Erişim
Akay, Simge; Arica, Nafiz
In this study, we develop a deep learning-based stacking scheme to detect facial action units (AU) in video data. Given a sequence of video frames, it combines multiple cues extracted from the AU detectors employing in frame, segment, and transition levels. Frame-based detector takes a single frame to determine the existence of AU by employing static face features. Segment-based detector examines various length of subsequences in the neighborhood of a frame to detect whether that frame is an element of an AU segment. Transition-based detector attempts to find the transitions from neutral faces containing no AUs to emotional faces or vice versa, by analyzing fixed size subsequences. The frame subsequences in segment and transition detectors are represented by motion history image, which models the temporal changes in faces. Each detector employs a separate convolutional neural network and, then their results are fed into a meta-classifier to learn the combining method. Combining multiple cues in different levels with a framework containing entirely deep networks improves the detection performance by both locating subtle AUs and tracking small changes in the facial muscles' movements. In performance analysis, it is shown that the proposed approach significantly outperforms the state of the art methods, when compared on CK+, DISFA, and BP4D databases.
Dosya adı | Boyutu | |
---|---|---|
bib-2fa4f63b-fca9-4aae-a67a-f8bbc89e4722.txt
md5:c4090415ae37ad3e2385d2fa28605e7e |
102 Bytes | İndir |
Görüntülenme | 30 |
İndirme | 8 |
Veri hacmi | 816 Bytes |
Tekil görüntülenme | 26 |
Tekil indirme | 8 |