Publicaciones

Advanced search

Abstract

Ismael García-Varea, Francisco Casacuberta. Maximum Entropy Modeling: A Suitable Framework to Learn Context-Dependent Lexicon Models for Statistical Machine Translation. Machine Learning, 2005. Vol. 60 pp. 135-158.

Current statistical machine translation systems are mainly based on statistical word lexicons. However, these models are usually context-independent, therefore, the disambiguation of the translation of a source word must be carried aout using other prababilistic distributions. One efficient way to add contextual information to the statistical lexicon is based on maximum entropy modeling. In that framework, the context is introduced through feature functions that allow us to automatically learn context-dependent lexicon models. In a first approach, maximum entropy modeling is carried out after a process pf learning standard statistical models. In a second approach, the maximum entropy modeling is integrated in the expectation-maximization process of learning standard statistical models. Experimetnal results were obtained for two well-kown tasks, the French-English Canadian Parliament HANSARDS task and the German-English VERBMOBIL task. These results proved that the use of maximum entropy models in both approaches can help to improve the performance of the statistical machine translation systems.