nextupprevious
Next:Introduction

Journal of Artificial Intelligence Research 18 (2003), pp. 117-147. Submitted 7/02; published 2/03.
© 2001 AI Access Foundation and Morgan Kaufmann Publishers. All rights reserved.

Translation of Pronominal Anaphora between English and Spanish: Discrepancies and Evaluation

Jesús Peral
Dept. Lenguajes y Sistemas Informáticos
University of Alicante
Alicante, SPAIN
Antonio Ferrández
Dept. Lenguajes y Sistemas Informáticos
University of Alicante
Alicante, SPAIN
jperal@dlsi.ua.es  antonio@dlsi.ua.es

Abstract:

This paper evaluates the different tasks carried out in the translation of pronominal anaphora in a machine translation (MT) system. The MT interlingua approach named AGIR (Anaphora Generation with an Interlingua Representation) improves upon other proposals presented to date because it is able to translate intersentential anaphors, detect co-reference chains, and translate Spanish zero pronouns into English--issues hardly considered by other systems. The paper presents the resolution and evaluation of these anaphora problems in AGIR with the use of different kinds of knowledge (lexical, morphological, syntactic, and semantic). The translation of English and Spanish anaphoric third-person personal pronouns (including Spanish zero pronouns) into the target language has been evaluated on unrestricted corpora. We have obtained a precision of 80.4% and 84.8% in the translation of Spanish and English pronouns, respectively. Although we have only studied the Spanish and English languages, our approach can be easily extended to other languages such as Portuguese, Italian, or Japanese.




nextupprevious
Next:Introduction
Jesus Peral 2002-12-13