Greg Hanneman

General Contact:
Language Technologies Institute
Carnegie Mellon University
Newell-Simon Hall 4502
5000 Forbes Avenue
Pittsburgh, PA 15213

Personal Contact:
Office: Newell-Simon Hall 4622
Phone: (412) 268-6748
E-mail: ghannema [at] cs [dot] cmu [dot] edu

Current Work

I came to Carnegie Mellon in the fall of 2005, got my master's degree in the spring of 2007, and am currently a third-year graduate student (first-year Ph.D. student) at the LTI. My advisor is Dr. Alon Lavie.

Multi-Engine Machine Translation (MEMT)

Given translations of a common input produced by independent machine translation systems, the MEMT engine produces a new synthetic translation by combining elements of the individual outputs. The original translations are first aligned to each other in a word-matching phase; after that, synthetic combinations of words are produced by a decoder and evaluated against a language model.

The MEMT algorithm is explained in more detail in Jayaraman and Lavie, "Multi-Engine Machine Translation Guided by Explicit Word Matching," Proceedings of the 10th Annual Conference of the European Association for Machine Translation, 2005.

I gave a talk called "When Does Multi-Engine Machine Translation Work and Why?" at the LTI's Student Research Symposium in October 2006.

French–English MT

I'm also working on a French-to-English translation system that grew out of a semester project for the LTI's machine translation class. The system makes use of a large parallel corpus, statistical word alignment, and part-of-speech tagging to automatically build a large translation lexicon. The lexicon is supplemented with a small number of hand-developed entries for closed-class words, as well as a hand-written transfer grammar. Translation is carried out using the lexicon and grammar by the AVENUE project's transfer framework, which constructs a lattice of possible translations for a given input and selects a final output hypothesis after a decoding process.

More details are available in my final report for the class project.

Teaching

I was the Fall 2007 TA for Algorithms for Natural Language Processing (11-711). You can visit the class website or the detailed schedule.

Courses

Fall 2007:
     None.
Spring 2008:
     11-734: Advanced MT Seminar
     76-857: Historical Linguistics
Fall 2006:
     11-762: Language and Statistics II
     11-791: Software Engineering
     82-303: French Culture
Spring 2007:
     11-731: Machine Translation
     11-792: Software Engineering II
Fall 2005:
     11-711: Algorithms for Natural Language Processing
     11-721: Grammars and Lexicons
Spring 2006:
     11-712: Self-Directed NLP Lab
     11-722: Grammar Formalisms
     11-761: Language and Statistics

Past Life

In May 2005 I received a B.S. degree in computer science with a minor in French from Case Western Reserve University in Cleveland, Ohio. I spent all eight semesters at Case as a staff member of The Observer student newspaper, where I was a reporter, layout editor, copy editor, and news editor. I'm originally from the Cleveland suburbs.

Outside Interests

I work on the copy staff of The Tartan, CMU's student newspaper, where I also write occasional articles for the news section. Aside from a certain neurotic obsession with writing and editing that I picked up from my newspaper work, I first got interested in language by studying French in high school and as an undergraduate. I also enjoy picking out speech variations and "accents" from different parts of the U.S. When I'm not thinking about things academic, I try to make time for photography (and developing my own black-and-white prints), reading, camping, cooking with friends, and indoor rock climbing. I've also recently started learning Thai.


Page created 26 August 2005; last updated 11 March 2008.