Publications

Advanced search

Abstract

Vicente Bosch, Alejandro H. Toselli, Enrique Vidal. Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents. Language Technology for Cultural Heritage, Social Sciences, and Humanities (2012), 2012. pp. 107-111.

Document layout analysis is an important task needed for handwritten text recognition among other applications. Text layout commonly found in handwritten legacy documents is in the form of one or more paragraphs composed of parallel text lines. An approach for handwritten text line detection is presented which uses machine-learning techniques and methods widely used in natural language processing. It is shown that text line detection can be accurately solved using a formal methodology, as opposed to most of the proposed heuristic approaches found in the literature. Experimental results show the impact of using increasingly constrained ”vertical layout language models” in text line detection accuracy.