Published January 1, 2023 | Version v1
Journal article Open

Learning actionness from action/background discrimination

  • 1. Princeton Univ, Comp Sci, Princeton, NJ 08544 USA
  • 2. Hacettepe Univ, Comp Engn, Ankara, Turkey

Description

Localizing actions in instructional web videos is a complex problem due to background scenes that are unrelated to the task described in the video. Wrong prediction of the action step labels could be reduced by separating backgrounds from actions. Yet, discrimination of actions from backgrounds is challenging due to various styles for the same activity. In this study, we aim to improve the action localization results through learning the actionness of video clips to determine the possibility of a clip having an action. We present a method to learn an actionness score for each video clip to be used for post-processing baseline video clip to step label assignment scores. We propose to use auxiliary representation formed from baseline video to step label assignment scores to reinforce the discrimination of video clips. The experiments on CrossTask and COIN datasets show that our actionness score helps to improve the performance of action step localization and also action segmentation.

Files

bib-5defec01-2737-4643-9388-ae2b71cbd4e7.txt

Files (172 Bytes)

Name Size Download all
md5:235192b27f9c81cae86b50e9d043747d
172 Bytes Preview Download