Sentence-Based Image Description with Scalable, Explicit Models

Micah Hodosh, Julia Hockenmaier; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2013, pp. 294-300

Abstract


Associating photographs with complete sentences that describe what is depicted in them is a challenging problem. This paper examines how an approach that is inspired by image tagging techniques which can scale to very large data sets performs on this much harder task, and examines some of the linguistic difficulties that this bag-of-words model faces.

Related Material


[pdf]
[bibtex]
@InProceedings{Hodosh_2013_CVPR_Workshops,
author = {Hodosh, Micah and Hockenmaier, Julia},
title = {Sentence-Based Image Description with Scalable, Explicit Models},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2013}
}