How Shall We Evaluate Egocentric Action Recognition?

Antonino Furnari, Sebastiano Battiato, Giovanni Maria Farinella; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 2373-2382

Abstract


Egocentric action analysis methods often assume that input videos are trimmed and hence they tend to focus on action classification rather than recognition. Consequently, adopted evaluation schemes are often unable to assess important properties of the desired action video segmentation output, which are deemed to be meaningful in real scenarios (e.g., oversegmentation and boundary localization precision). To overcome the limits of current evaluation methodologies, we propose a set of measures aimed to quantitatively and qualitatively assess the performance of egocentric action recognition methods. To improve exploitability of current action classification methods in the recognition scenario, we investigate how frame-wise predictions can be turned into action-based temporal video segmentations. Experiments on both synthetic and real data show that the proposed set of measures can help to improve evaluation and to drive the design of egocentric action recognition methods.

Related Material


[pdf]
[bibtex]
@InProceedings{Furnari_2017_ICCV,
author = {Furnari, Antonino and Battiato, Sebastiano and Maria Farinella, Giovanni},
title = {How Shall We Evaluate Egocentric Action Recognition?},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops},
month = {Oct},
year = {2017}
}