Long Qin

Office

GHC 6225

Carnegie Mellon University

5000 Forbes Avenue

Pittsburgh, PA 15213

lqin (at) cs (dot) cmu (dot) edu

long.qin (at) mmodal (dot) com

I’m currently a Software Engineer at Duolingo working on speech tasks in the Duolingo learning app and test center. Before that, I worked at M*Modal as a Research Scientist on improving speech recognition for medical transcription. I received my PhD and MS degrees from the Language Technologies Institute of Carnegie Mellon University under the supervision of Prof. Alex Rudnicky. I also received a MS and a BS degree from the University of Science and Technology of China.

CV [pdf]

Research

•Deep learning (DNN) in speech recognition
•Automatic Speech Assessment
•Voice Activity Detection (VAD)
•Out-of-vocabulary (OOV) word learning
•Discriminative acoustic modeling
•Speaker adaptive training (SAT)
•Unsupervised / semi-supervised lexicon learning
•Statistical parametric speech synthesis

Selected PublicationS

•PhD Dissertation: Learning out-of-vocabulary words in automatic speech recognition, Carnegie Mellon University. [document] [presentation]
•Building a vocabulary self-learning speech recognition system, Interspeech-2014. [pdf]
•Learning better lexical properties for recurrent OOV words, ASRU-2013. [pdf]
•Using web text to improve keyword spotting in speech, ASRU-2013. [pdf]
•Finding recurrent OOV words, Interspeech-2013. [pdf]
•OOV word detection using hybrid models with mixed types of fragments, Interspeech-2012. [pdf]
•System combination for out-of-vocabulary word detection, ICASSP-2012. [pdf]
•OOV detection and recovery using hybrid models with different fragments, Interspeech-2011. [pdf]
•The effect of lattice pruning on MMIE training, ICASSP-2010. [pdf]
•Implementing and improving MMIE training in SphinxTrain, CMU Sphinx Workshop 2010. [pdf]