The METEOR Automatic Machine
Translation Evaluation System

Alon Lavie
Abhaya Agarwal
Michael Denkowski

Carnegie Mellon University
Pittsburgh, PA, USA

Download METEOR

Please send any questions and bug reports to Michael Denkowski at mdenkows (at) cs (dot) cmu (dot) edu.

News


About METEOR

METEOR is a system that automatically evaluates the output of machine translation engines by comparing to them to one or more reference translations. For a given pair of a hypothesis and reference strings, the evaluation proceeds in a sequence of stages, with different criteria being used at each stage to find and score unigram matches. By default, in the first stage all exact matches are detected between the two strings. In the second stage, all stem matches are detected using the Snowball stemmers. In the third stage, all synonym matches are detected using data extracted from the WordNet 3 database. As of version 1.0, an optional fourth stage detects single word matches according to a paraphrase database.

The latest version of the system is written in pure Java with a full API to allow easy incorporation of METEOR scoring into existing systems. The sentence aligner can function independently of the scorer and thus be used in other systems that require monolingual sentence alignment. As of version 1.0, METEOR also includes a trainer which can be used to retune the metric's parameters to new data.

METEOR supports the SGML input file format is used by Bleu and NIST's Machine Translation Evaluation system. Thus all translation data that can be evaluated using Bleu (such as the TIDES data) can also be directly evaluated using METEOR. METEOR also supports a much faster plaintext mode.

References


This page last modified on: October 5, 2009