Rule Refinement end-to-end process: see avenue:/usr0/aria/eng2spa/required-refinements for potential tests of the RR module 1. Run Xfer engine on SL sentences that need to be evaluated/corrected in avenue:/usr0/aria/eng2spa transfer -if init-tctool.txt [aria@avenue eng2spa]$ transfer -if init-tctool.txt Turning on Latin-1 mode Setting normalizecase to UPPER-CASE Loading rule file /usr0/aria/eng2spa/grammars/simulation-grammar.trf with 13 rules added Loading rule file /usr0/aria/eng2spa/lexicons/simulation-lexicon.trf with 27 rules added Setting find all translations to ON Setting output source text to ON Setting showtrace to full trace with src indices Translating file: /usr0/aria/eng2spa/corpus/error-typology-simulation 1: Translating "I see the red car". 2: Translating "I see the red unicorn". 3: Translating "Mary plays the guitar". 4: Translating "John and Mary fell". 5: Translating "you saw the woman". 6: Translating "I see the red car". 7: Translating "they see water ". 8: Translating "Gaudi was a great artist". TOT 35 LEX 0 GRA 0 UNK 2 MORPH 0 COMP 0 COV 0.942857 UNKNOWNS UNICORN 2 2. Convert output into the input format expected by the TCTool: /usr0/aria/bin/postprocess-xfer.out.debug.pl (extracts alignment information) ./postprocess-xfer.out.debug.pl ../corpus/error-typology-simulation.out.debug > ! ../corpus/error-typology-simulation.tctoolIN 1/13/06: modified to also include parse trace, i.e. tree sl: I see the red car tl: veo el auto roja al: ((2,1),(3,2),(5,3),(4,4)) tree: tree: <((S,0 (VP,2 (VP,1 (V,1:2 "VEO") ) (NP,8 (DET,1:3 "EL") (N,1:5 "AUTO ") (ADJ,1:4 "ROJA") ) ) ) )> tl: veo la auto roja al: ((2,1),(3,2),(5,3),(4,4)) tree: tree: <((S,0 (VP,2 (VP,1 (V,1:2 "VEO") ) (NP,8 (DET,2:3 "LA") (N,1:5 "AUTO ") (ADJ,1:4 "ROJA") ) ) ) )> ... sl: I see the red unicorn tl: veo el unicorn roja al: ((2,1),(3,2),(5,3),(4,4)) tree: tree: <((S,0 (VP,2 (VP,1 (V,1:2 "VEO") ) (NP,8 (DET,1:3 "EL") (N,9:5 "UNIC ORN") (ADJ,1:4 "ROJA") ) ) ) )> ... -> Need to make sure it passes all the TL sentences and its parses to the TCTool! !!!!!!! Currently intro-test.cgi picks the first 5 tl as stored in %$sl (intro(-test).cgi), which means that we need to apply the Spanish LM to input-tct before it's given to the TCTool, namely have the LM choose the 5 best translations form input-tct and pass only those on to the TCTool: $c = 0; $sl .= "|"; foreach $tl (keys(%$sl)){ if ($tl ne "con") { if ($c == 5) { last; } # displays 5 first translations only $c++; $al = $$sl{$tl}; $tlname = "tl" . "$c"; $tlnameal = $tlname . "-al"; $html .= "
$tl