Publications

Advanced search

Abstract

Vicent Alabau, Carlos D. Martínez-Hinarejos. Bilingual Speech Recognition in Two Phonetically Similar Languages. IV Jornadas en Tecnologia del Habla, 2006. pp. 197-202.

As Speech Recognition Systems improve, they become suitable for facing new problems. Multilingual speech recognition is one of such problems. In the present work, the case of the Comunitat Valenciana multilingual environment is studied. The official languages in the Comunitat Valenciana (Spanish and Valencian) share most of their acoustic units, and their vocabularies and syntax are quite similar. They have influenced each other for many years. A small corpus on an Information System task was developed for experimentation purposes. This choice will make it possible to develop a working prototype in the future, and it is simple enough to build semi-automatic language models. The design of the acoustic corpus is discussed, showing that all combinations of accents have been studied (native, non-native speakers, male, female, etc.). In addition, some experiments have been conducted with this corpus that show promising results for a Spanish-Valencian multilingual speech recognizer.