Publications

Advanced search

Abstract

Daniel Ortiz-Martínez, Ismael García-Varea, Francisco Casacuberta, Jorge González, Antonio L. Lagarda. On the use of statistical machine translation techniques within a memory-based translation system (AMETRA). Ninth Machine Translation Summit, 2003. pp. 299-306. Association for Machine Translation in the Americas.

The goal of the AMETRA project is to make a computer-assisted translation tool from the Spanish language to the Basque language under the memory-based translation framework. The system is based on a large collection of bilingual word-segments. These segments are obtained using linguistic or statistical techniques from a Spanish-Basque bilingual corpus consisting of sentences extracted from the Basque Country's official government record. One of the tasks within the global information document of the AMETRA project is to study the combination of well-known statistical techniques for the translation of short sequences and techniques for memory-based translation. In this paper, we address the problem of constructing a statistical module to deal with the task of translating segments. The task undertaken in the AMETRA project is compared with other existing translation tasks, This study includes the results of some preliminary experiments we have carried out using well-known statistical machine translation tools and techniques.