Publications

Advanced search

Abstract

Carlos D. Martínez-Hinarejos. Automatic Annotation of Dialogues Using n-grams. Proceedings of the Ninth International Conference on Text, Speech and Dialogue---TSD 2006, 2006. Petr Sojka, Ivan Kopeček, Karel Pala (Editors). pp. 653-660. Springer-Verlag.

The development of a dialogue system for any task implies the acquisition of a dialogue corpus in order to study the structure of the dialogues used in that task. This structure is reflected in the dialogue system behaviour, which can be rule-based or corpus-based. In the case of corpus-based dialogue systems, the behaviour is defined by statistical models which are inferred from an annotated corpus of dialogues. This annotation task is usually difficult and expensive, and therefore, automatic dialogue annotation tools are necessary to reduce the annotation effort. An automatic dialogue labeller technique that is based on $n$-grams is presented in this work. Its different variants are evaluated with respect to manual human annotations of a dialogue corpus devoted to train queries.