Publications

Advanced search

Abstract

Mauricio Villegas, Joan Puigcerver, Alejandro H. Toselli, Joan-Andreu Sánchez, Enrique Vidal. Overview of the ImageCLEF 2016 Handwritten Scanned Document Retrieval Task. CLEF2016 Working Notes, 2016. pp. 233-253. CEUR-WS.org.

The ImageCLEF 2016 Handwritten Scanned Document Retrieval Task was the first edition of a challenge aimed at developing retrieval systems for handwritten documents. Several novelties were introduced in comparison to other recent related evaluations, specifically: multiple word queries, finding local blocks of text, results in transition between consecutive pages, handling words broken between lines, words unseen in training and queries with zero relevant results. To evaluate the systems, a dataset of manuscripts written by Jeremy Bentham was used, and has been left publicly available after the evaluation. The participation was not as good as expected, receiving results from four groups. Despite the low participation, the results were very interesting. One group obtained very good performance, handling relatively well the cases of queries with words not observed in the training data and locating words broken between two lines.