Stephan Vogel

I am Research Scientist in the

Language Technologies Institute, School of Computer Science, Carnegie Mellon University,

especially affiliated with the InterACT Lab.

 

Office:

407 South Craig Avenue

Pittsburgh, PA 15213

Tel: 412-268-4526

Email: vogel at cs dot cmu another dot and finally edu ;-)

 

Before coming to CMU I worked from 1995-2000 as research assistant at the Lehrstuhl IV (Prof. Hermann Ney), Technical University (RWTH) of Aachen.  I also spend a couple of months in the InterACT Lab (Prof. Alex Waibel) at the University of Karlsruhe.

In a previous academic life, I studied physics at the Philipps University Marburg, Germany, and got an MPhil in History and Philosophy of Science, from the University of Cambridge, England.


Research Interests:

My research interests focus on machine translation, i.e. developing techniques to automatically translate between different languages.  This includes all aspects of alignment models, word reordering models, language models, decoding algorithms, and system combination.  Applications include both text-to-text and speech-to-speech translation, and goes from MT on hand-held devices to very large scale data situations, where we use parallel processing under the Hadoop framework.


Research Projects:

Current:

Old:


Publications:

Recent Publications:

 

Nguyen Bach, Qin Gao, & Stephan Vogel: Improving word alignment with language model based confidence scores. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.151-154.

 

Matthias Eck, Stephan Vogel, & Alex Waibel: Communicating unknown words in machine translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp.

 

Sanjika Hewavitharana and Stephan Vogel:  Enhancing a Statistical Machine Translation System by using an Automatically Extracted Parallel Corpus from Comparable Sources, LREC 2008 Workshop on Comparable Corpora, Marrakech, Morocco, May 2008.

 

Almut Silja Hildebrand & Stephan Vogel: Combination of machine translation systems via hypothesis selection from combined n-best lists. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.254-261.

 

Almut Silja Hildebrand, Kay Rottmann, Mohamed Noamany, Qin Gao, Sanjika Hewavitharana, Nguyen Bach, & Stephan Vogel: Recent improvements in the CMU large scale Chinese-English SMT system. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 77-80.

 

Jan Niehues & Stephan Vogel: Discriminative word alignment via alignment matrix modelling. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.18-25.

 

Muntsin Kolss, Stephan Vogel, Alex Waibel.  Steam Decoding for Simultanous Spoken Language Translation.  Proc of Interspeech 2008.  Brisbane, Australia, 22-26 Sept 2008.

 

Matthias Paulik, Sharath Rao, Ian Lane, Stephan Vogel and Tanja Schultz.  Sentence Segmentation and Punctuation Recovery for SLT.  ICASSP 2008, Las Vegas, April 2008.

 

Tim Schlippe, ThuyLinh Nguyen, & Stephan Vogel: Diacritization as a machine translation problem and as a sequence labeling problem. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.270-278.

 

Ashish Venugopal, Andreas Zollmann, Noah A. Smith, & Stephan Vogel: Wider pipelines: n-best alignments and parses in MT training. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.192-201.

 

Andreas Zollmann, Ashish Venugopal, & Stephan Vogel: The CMU syntax-augmented machine translation system: SAMT on Hadoop with n-best alignments.  IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 18-25.

 

 

See also Publications page.


Teaching:


Students:

Current PhD and Master Students at LTI, CMU:

 

Former LTI PhD and Master Students I advised or co-advised:

 

I regularly advise and co-advise students from the University of Karlsruhe.  Some of them have come to CMU as InterACT exchange students, to work on a Studienarbeit (3 month research project) or Diplomarbeit (Masters Thesis):

 


Links:


 


Visitors to this site:
Locations of visitors to this page
Statistics since: 2008-12-13 15:27:33