Feature Weighting via Optimal Thresholding for Video Analysis

Zhongwen Xu, Yi Yang, Ivor Tsang, Nicu Sebe, Alexander G. Hauptmann; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013, pp. 3440-3447

Abstract


Fusion of multiple features can boost the performance of large-scale visual classification and detection tasks like TRECVID Multimedia Event Detection (MED) competition [1]. In this paper, we propose a novel feature fusion approach, namely Feature Weighting via Optimal Thresholding (FWOT) to effectively fuse various features. FWOT learns the weights, thresholding and smoothing parameters in a joint framework to combine the decision values obtained from all the individual features and the early fusion. To the best of our knowledge, this is the first work to consider the weight and threshold factors of fusion problem simultaneously. Compared to state-of-the-art fusion algorithms, our approach achieves promising improvements on HMDB [8] action recognition dataset and CCV [5] video classification dataset. In addition, experiments on two TRECVID MED 2011 collections show that our approach outperforms the state-of-the-art fusion methods for complex event detection.

Related Material


[pdf]
[bibtex]
@InProceedings{Xu_2013_ICCV,
author = {Xu, Zhongwen and Yang, Yi and Tsang, Ivor and Sebe, Nicu and Hauptmann, Alexander G.},
title = {Feature Weighting via Optimal Thresholding for Video Analysis},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}
}