Handwritten Word Spotting with Corrected Attributes

Jon Almazan, Albert Gordo, Alicia Fornes, Ernest Valveny; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013, pp. 1017-1024


We propose an approach to multi-writer word spotting, where the goal is to find a query word in a dataset comprised of document images. We propose an attributes-based approach that leads to a low-dimensional, fixed-length representation of the word images that is fast to compute and, especially, fast to compare. This approach naturally leads to an unified representation of word images and strings, which seamlessly allows one to indistinctly perform queryby-example, where the query is an image, and query-bystring, where the query is a string. We also propose a calibration scheme to correct the attributes scores based on Canonical Correlation Analysis that greatly improves the results on a challenging dataset. We test our approach on two public datasets showing state-of-the-art results.

Related Material

author = {Almazan, Jon and Gordo, Albert and Fornes, Alicia and Valveny, Ernest},
title = {Handwritten Word Spotting with Corrected Attributes},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}