Alexander I. Rudnicky
I'm a Research Professor at the Computer Science Department in the School of Computer Science at Carnegie Mellon University. I'm a part of the Carnegie Mellon Speech Group, one of the oldest such groups in the country, founded by Raj Reddy. I'm also the Director of the Carnegie Mellon Speech Consortium, a partnership between Carnegie Mellon University and industry. I am part of the faculty in the Language Technologies Institute and, actually, most of my students are based there.
My current interests center on language-based communication between humans and robots and on aspects of core speech recognition, such as out-of-vocabulary (OOV) word processing. I am also interested in approaches to learning based on implicit supervision and on improvement of speech system knowledge through dialog. Check this list of publications.
Over time my research interests have revolved around speech perception and recognition, speech interfaces, spoken dialog systems and language in general. I've headed the SpeechWear project that produced an early mobile speech system; I'm still interested in the topic of mobile speech. I also headed the Communicator project, which dealt with spoken dialog system architectures. Olympus, a successor system, is available in Open Source. The Ravenclaw dialog manager came out of the Communicator work and is the current foundation for many of the systems that we create. RavenClaw incorporates dialog management ideas that were first introduced in the Office Manager (OM) system and further developed in successor systems, including Scheduler and AGENDA. Papers listed on the publications describe this work. Some videos from our group.
In 1996 I implemented a set of web-based tools that are used to generate a knowledge base for the open source Sphinx recognition system, including a language model and a pronouncing dictionary. It's proved to be quite popular, and I've continued to maintain the tool and expand its capabilities. Do try it out if you are building a recognition system (the model formats work for any ARPA-compliant system). We continue to maintain Ravenclaw and the Olympus dialog system toolkit in open source.If you'd like to know more about me by reading either a short or a long biography. You can also find me at LinkedIn and on Plaxo; these sites will have somewhat different information about me. I'm involved in several organizations focused on spoken language interaction (SIGdial) and speech technologies (AVIOS).
Papers covering our work in the past year or so; what we're doing right now. Check the publications page for a complete list.
Chen, YN & Rudnicky, AI Dynamically Supporting Unexplored Domains in Conversational Interactions by Enriching Semantics with Neural Word Embeddings Proceedings of SLT, December 2014, Lake Tahoe, NV.
Chen, YN, Wang, WY & Rudnicky, AI Leveraging Frame Semantics and Distributional Semantics for Unsupervised Semantic Slot Induction in Spoken Dialogue Systems Proceedings of SLT, December 2014, Lake Tahoe, NV.
Pappu, A. & Rudnicky, A.I. Learning Situated Knowledge Bases through Dialog. Proceedings of Interspeech, September 2014, Singapore.
Justin Chiu, J., Wang, Y., Trmal, J., Povey, D., Chen, G., Rudnicky, A. Combination of FST and CN Search in Spoken Term Detection Proceedings of Interspeech, September 2014, Singapore.
Qin, L. & Rudnicky, AI Building a vocabulary self-learning speech recognition system, Proceedings of Interspeech, September 2014, Singapore.
Pappu, A. & Rudnicky, A.I. Knowledge Acquisition Strategies for Goal-Oriented Dialog Systems. Proceedings of SIGDIAL, June 2014, Philadelphia, PA.
Chen, YN & Rudnicky, AI Two-Stage Stochastic Natural Language Generation for Email Synthesis by Modeling Sender Style and Topic Structure, Proceedings of the 8th Int'l Natural Language Generation Conference (INLG), June 2014, Philadelphia, PA.
Pappu, A., Sun, M., Sridharan, S. & Rudnicky, A.I. Conversational Strategies for Robustly Managing Dialog in Public Spaces Proceedings of EACL Dialog in Motion Workshop, 2014, Gothenburg, Sweden.
Smailagic, A., D. Siewiorek, A. Rudnicky, S. N. Chakravarthula, A. Kar, N. Jagdale, S. Gautam, R. Vijayaraghavan, S. Jagtap: Emotion Recognition Modulating the Behavior of Intelligent Systems. Int'l Symp on Multimedia, 2013: 378-383, Anaheim, CA.
Gandhe, A., L. Qin, F. Metze, A. Rudnicky, I. Lane, M. Eck Using Web Text to Improve Keyword Spotting in Speech, Proceedings of ASRU , 2013, Olomouc CZ.
Qin, L. & A. Rudnicky Learning Better Lexical Properties for Recurrent OOV Words, Proceedings of ASRU, 2013, Olomouc, CZ.
Y-N Chen, W. Y. Wang, A. I. Rudnicky Unsupervised Induction and Filling of Semantic Slots for Spoken Dialogue Systems Using Frame-Semantic Parsing, Proceedings of ASRU 2013, Olomouc, CZ.
Pappu, A. & Rudnicky, A. Predicting Tasks in Goal-Oriented Spoken Dialog Systems using Semantic Knowledge Bases, In Proceedings of the SIGDIAL 2013 Conference, 2013, Metz, France.
Chiu, J. & A.I. Rudnicky Using Conversational Word Bursts in Spoken Term Detection, Proc. of Interspeech, 2013, Lyon, France.
Qin, L. & A Rudnicky Finding Recurrent Out-of-Vocabulary Words. Proc. of Interspeech, 2013, Lyon, France.
Teodoro, G., Martin, N., Keshner, E., Shi, J. Y. and Rudnicky, A. Virtual clinicians for the treatment of aphasia and speech disorders Proceedings of ICVR, 2013, Philadelphia, PA US, 158-159.
If you need to reach me, E-mail is usually a good bet, particularly when I'm away on travel, since I do check my mail regularly.
office is room 7711 in the Gates-Hillman
Center (10a on the map). Or you can reach me in one of the following ways:
Prof. Alexander Rudnicky
School of Computer Science
Carnegie Mellon University
5000 Forbes Avenue
Pittsburgh, PA 15213