Publications

Advanced search

Abstract

Joan-Andreu Sánchez, Alejandro H. Toselli, Verónica Romero, Enrique Vidal. ICDAR 2015 Competition HTRtS: Handwritten Text Recognition on the tranScriptorium Dataset. 13th International Conference on Document Analysis and Recognition (ICDAR), 2017. pp. 1166-1170. Zenodo.

This dataset comprises the dataset used for the ICDAR 2015 Competition on Handwritten Text Recognition on the tranScriptorium Dataset. The handwritten images for this contest were drawn from the English “Bentham collection” dataset used in the TRAN SCRIPTORIUM project. The selected data has been written by several hands and entails significant variabilities and difficulties regarding the quality of text images, writing styles and crossed-out text. This contest is clearly more difficult than the the first edition both for training and for testing. A portion of the training dataset and the full test dataset were provided in the form of carefully segmented line images, along with the corresponding transcripts. Another portion of the training dataset was provided as raw images and their corresponding transcripts at region level.