Cross-task weakly supervised learning from instructional videos

Zhukov, Dimitri; Alayrac, Jean-Baptiste; Cinbis, Ramazan Gokberk; Fouhey, David; Laptev, Ivan; Sivic, Josef

doi:10.1109/CVPR.2019.00365

Published January 1, 2019 | Version v1

Conference paper Open

Cross-task weakly supervised learning from instructional videos

1. Middle East Tech Univ, Ankara, Turkey
2. Univ Michigan, Ann Arbor, MI 48109 USA

In this paper we investigate learning visual models for the steps of ordinary tasks using weak supervision via instructional narrations and an ordered list of steps instead of strong supervision via temporal annotations. At the heart of our approach is the observation that weakly supervised learning may be easier if a model shares components while learning different steps: "pour egg" should be trained jointly with other tasks involving "pour" and "egg". We formalize this in a component model for recognizing steps and a weakly supervised learning framework that can learn this model under temporal constraints from narration and the list of steps. Past data does not permit systematic studying of sharing and so we also gather a new dataset, CrossTask, aimed at assessing cross-task sharing. Our experiments demonstrate that sharing across tasks improves performance, especially when done at the component level and that our component model can parse previously unseen tasks by virtue of its compositionality.

Files

bib-be5907f2-3fef-488c-bea0-d28de4fcca8a.txt

Files (224 Bytes)

Name	Size	Download all
bib-be5907f2-3fef-488c-bea0-d28de4fcca8a.txt md5:a3a5fca0d1b5771a6a220c90c57d7371	224 Bytes	Preview Download

	All versions	This version
Views	129	129
Downloads	10	10
Data volume	2.2 kB	2.2 kB

Cross-task weakly supervised learning from instructional videos

Files

bib-be5907f2-3fef-488c-bea0-d28de4fcca8a.txt

Files (224 Bytes)

TÜBİTAK ULAKBİM

CONTACT

Cross-task weakly supervised learning from instructional videos

Creators

Description

Files

bib-be5907f2-3fef-488c-bea0-d28de4fcca8a.txt

Files (224 Bytes)