A partial list of our theses and papers
- Robust Speech Group, Carnegie Mellon University
Ph.D Theses
- S.-J. Doh, Enhancements to Transformation-Based Speaker Adaptation: Principal Component and Inter-Class Maximum Likelihood Linear Regression, Ph.D Thesis, ECE Department, CMU, July, 2000.
- J. Huerta, Robust Speech Recognition in GSM Codec Environments, Ph.D Thesis, ECE Department, CMU, April, 2000.
- B. Raj, Reconstruction of Incomplete Spectrograms for Robust Speech Recognition, Ph.D Thesis, ECE Department, CMU, April, 2000.
- M. Siegler, Integration of Continuous Speech Recognition and Information Retrieval for Mutually Optimal Performance, Ph.D Thesis, ECE Department, CMU, December, 1999.
- E. Gouvea, Acoustic-Feature-Based Frequency Warping for Speaker Normalization, Ph.D Thesis, ECE Department, CMU, February, 1999.
- Thomas M. Sullivan, Multi-Microphone Correlation-Based Processing for Robust Automatic Speech Recognition (2.2MB), Ph.D Thesis, ECE Department, CMU, August 1996. (Compressed, 0.7MB) (Abstract)
- Pedro J. Moreno, Speech Recognition in Noisy Environments (1.3MB), Ph.D Thesis, ECE Department, CMU, May, 1996. (Compressed, 0.5MB) (Abstract)
- Fu-Hua Liu, Environmental Adaptation for Robust Speech Recognition (2.3MB), Ph.D Thesis, ECE Department, CMU, June, 1994. (Compressed, 0.6MB) (abstract)
- Y. Ohshima, Environmental Robustness in Speech Recognition using Physiologically-Motivated Signal Processing (2.1MB), Ph.D Thesis, ECE Department, CMU, December, 1993. (Compressed, 0.7MB) (abstract)
- W. A. Rozzi, Speaker Adaptation in Automatic Speech Recognition via Estimation of Correlated Mean Vectors (2MB), Ph.D Thesis, ECE Department, CMU, May, 1991. (Compressed, 0.6MB) (abstract)
- A. Acero, Acoustical and Environmental Robustness for Automatic Speech Recognition (.pdf, 1.4MB), Ph.D Thesis, ECE Department, CMU, September, 1990. (abstract)
MS Reports
- M.Seltzer, Automatic Detection of Corrupted Speech Features for Robust Speech Recognition, ECE Department, CMU, May, 2000.
- U. Jain, Connected Digit Recognition over Long Distance Telephone Linesusing the SPHINX-II System, Master's Report, ECE Department, CMU, May, 1995. (abstract)
- M. Siegler, Effects of Speech Rate on Speech Recognition Accuracy, Master's Report, ECE Department, CMU, December, 1995. (compress ps file) (abstract)
- P. J. Moreno, Speech Recognition in Telephone Environments, Master's Report, ECE Department, CMU, January, 1993. (abstract)
Papers
2000 (currently being revised and updated)
- S.-J. Doh and R. M. Stern, "Inter-class MLLR for speaker adaptation," Proc. of ICASSP 2000. (Poster)
- S.-J. Doh and R. M. Stern, Inter-Class MLLR for Speaker Adaptation, Proc. IEEE Conf. on Acoustics, Speech, and Sig. Proc., June, 2000, Istanbul, Turkey.
- R. Singh, B. Raj, and R. M. Stern, Automatic Generation of Phone Sets and Lexical Transcriptions, Proc. IEEE Conf. on Acoustics, Speech, and Sig. Proc., June, 2000, Istanbul, Turkey.
1999
- Sam-Joo Doh and Richard M. Stern, "Weighted principal component MLLR for speeaker adaptation," Proc. of Automatic Speech Recognition and Understanding Workshop (ASRU 99), Colorado, USA, 1999. (Poster)
- R. Singh, B. Raj and R.M. Stern, "Automatic Clustering And Generation of Contextual Questions For Tied States In Hidden Markov Models," Proc. of the ICASSP., Phoenix, Arizona, March, 1999.
- J. M. Huerta and R. M. Stern, "Distortion-Class Weighted Acoustic Modeling for Robust Recognition under GSM RPE-LTP Coding", Proc. of the International Symposium on Robust Speech Recognition, Tampere, Finland, June, 1999.
- R. Singh, B. Raj, and R. M. Stern, Domain Adduced State Tying for Cross-domain Acoustic Modelling, Proc. Eurospeech-99, September, 1999, Budapest, Hungary.
1998
- P.J. Moreno, B. Raj, and R. M. Stern. Data-Driven Environmental Compensation for Speech Recognition: A Unified Approach, Speech Communication , 24: 267-85, 1998.
- J.M. Huerta and R.M. Stern, "Speech Recognition From GSM Codec Parameters," Proc. of the International Conference on Spoken Language Processing, Sydney, Australia, November, 1998.
- B. Raj, R. Singh, and R. M. Stern, "Inference of Missing Spectrographic Features for Robust Speech Recognition," Proc. of the International Conference on Spoken Language Processing, Sydney, Australia, November, 1998.
1997
- R. M. Stern, B. Raj, and P. J. Moreno, (1997). Compensation for Environmental Degradation in Automatic Speech Recognition, Proc. of the ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, April, 1997, Pont-au-Mousson, France, pp. 33-42.
- M.A. Siegler, U. jain, B. Raj, and R. M. Stern, "Automatic Segmentation, Classification and Clustering of Broadcast News Audio," Proc. of the Speech Recognition Workshop (DARPA), Chantilly, VA, Feb. 1997.
- R. M. Stern, B. Raj, and P. J. Moreno, "Compensation for Environmental Degradation in Automatic Speech Recognition," Proc. of the ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, pp. 33-42., Pont-au-Mousson, France, April, 1997.
- E. Gouvêa, and R. M. Stern, "Speaker Normalization Through Formant-Based Warping Of The Frequency Scale," Proc. of the EUROSPEECH, 1997. (PDF)
- B. Raj, E. Gouvêa, and R. M. Stern, "Vector Polynomial Approximations For Robust Speech Recognition," Proc. of the ETRW, 1997.
- B. Raj, E. Gouvea, and R. M. Stern, Cepstral Compensation using Statistical Linearization, Proc. of the ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-au-Mousson, France, April, 1997.
- B. Raj, V. N. Parikh, and R. M. Stern, "The Effects Of Background Music On Speech Recognition Accuracy," Proc. of the ICASSP, Munich, Germany, April 1997.
- J. M. Huerta and R. M. Stern, Compensation for Environmental and Speaker Variability by Normalization of Pole Locations, Proc. Eurospeech-97, September, 1997, Rhodes, Greece.
1996
- R. M. Stern, A. Acero, F.-H. Liu, and Y. Ohshima, Signal Processing for Robust Speech Recognition, Chapter in Speech Recognition, pp. 351-378, C.-H. Lee and F. Soong, Eds., Boston: Kluwer Academic Publishers, 1996.
- P. J. Moreno, B. Raj, and R. M. Stern, "A Vector Taylor Series Approach For Environment-Independent Speech Recognition," Proc. of the ICASSP, Atlanta, GA, May 1996.
- B. Raj, E. Gouvêa, P. J. Moreno, and R. M. Stern, "Cepstral Compensation By Polynomial Approximation For Environment-Independent Speech Recognition," Proc. of the ICSLP, Philadelphia, PA, Oct. 1996.
- R. M. Stern, A. Acero, F.-H. Liu, and Y. Ohshima, Signal Processing for Robust Speech Recognition, Invited chapter in Speech Recognition, pp. 351-378, C.-H. Lee and F. Soong, Eds., Boston: Kluwer Academic Publishers, 1996.
1995
- P. J. Moreno, B. Raj, E. Gouvêa, and R. M. Stern, "Multivariate-Gaussian-Based Cepstral Normalization for Robust Speech Recognition," Proc. of the ICASSP, Detroit, Michigan, 1995.
- M. A. Siegler, and R. M. Stern, "On the Effects of Speech Rate in Large Vocabulary Speech Recgonition Systems," Proc. of the ICASSP, Detroit, Michigan, 1995.
- MORENO, P. J., RAJ, B., and STERN, R. M. (1995). A Unified Approach to Robust Speech Recognition, Proc. of Eurospeech-95, Madrid, Spain, September, 1995.
- P. J. Moreno, M. A. Siegler, U. Jain, and R. M. Stern, "Continuous Speech Recognition of Large Vocabulary Telephone Quality Speech," Proc. of the Eighth Spoken Language Systems Technology Workshop, 1995.
- P. J. Moreno, U. Jain, B. Raj, and R. M. Stern, "Approaches to Microphone Independence in Automatic Speech Recognition," Proc. of the Eigth Spoken Language Systems Technology Workshop, 1995.
- P. J. Moreno, B. Raj, and R. M. Stern, "Approaches to Environment Compensation in Automatic Speech Recognition," Proc. 15th International Conference on Acoustics, Trondheim, Norway, Vol. III, pp. 109-112, June, 1995.
- Stern, R. M. and Sullivan, T. M. Robust Speech Recognition Based on Human Binaural Perception, Proc. of the ATR workshop on A Biological Framework for Speech Perception and Production, Kansai Science City, September, 1994, Reprinted as ATR Technical Report TR-H-121, (1995).
1994
- F.-H. Liu, R. M. Stern, A. Acero, and P. J. Moreno, "Environment Normalization for Robust Speech Recognition using Direct Cepstral Comparison," Proc. of the ICASSP, Adelaide, Australia, 1994.
- P. J. Moreno, and R. M. Stern, "Sources of Degradation of Speech Recognition in the Telephone Network," Proc. of the ICASSP, Adelaide, Australia, 1994.
- F.-H. Liu, P. J. Moreno, R. M. Stern, and A. Acero, "Signal Processing For Robust Speech Recognition," Proc. of the Spoken Language Technology Workshop, March, 1994.
- N. Hanai, and R. M. Stern, "Robust Speech Recognition in the Automobile," Proc. of the International Conference on Spoken Language Processing, Yokohama, Japan, September, 1994.
- OHSHIMA, Y., and STERN, R. M. (1994). Environmental Robustness in Automatic Speech Recognition Using Physiologically-Motivated Signal Processing, Proc. of the International Conference on Spoken Language Processing, Yokohama, Japan, September, 1994.
- LIU, F.-H., MORENO, P. J., STERN, R. M., and ACERO, A. (1994). Signal Processing For Robust Speech Recognition, Proceedings of the Seventh ARPA Workshop on Human Language Technology, Princeton, New Jersey, Morgan Kaufmann, C. J. Weinstein, Ed.
- LIU, F.-H., MORENO, P. J., STERN, R. M., and ACERO, A. (1994). Signal Processing For Robust Speech Recognition, Proceedings of the ARPA Workshop on Spoken Language Technology, Princeton, New Jersey, R. M. Stern, Ed.
1993
- T. M. Sullivan and R. M. Stern, "Multi-Microphone Correlation-Based Processing for Robust Speech Recognition," Proc. of the ICASSP, Minneapolis, Minnesota, April, 1993.
- F.-H. Liu, R. M. Stern, X. Huang, and A. Acero, "Efficient Cepstral Normalization For Robust Speech Recognition," Proc. of the Sixth ARPA Workshop on Human Language Technology, Princeton, NJ, Morgan Kaufmann, March, 1993.
1992
- R. M. Stern, F.-H. Liu, Y. Ohshima, T. M. Sullivan, and A. Acero, "Multiple Approaches to Robust Speech Recognition," Proc. of the Fifth DARPA Speech and Natural Language Workshop, Harriman, New York, February, 1992.
- F.-H. Liu, A. Acero, and R. M. Stern, "Efficient Joint Compensation of Speech for the Effects of Additive Noise and Linear Filtering," Proc. of the ICASSP, San Francisco, CA, March, 1992.
- R. M. Stern, F.-H. Liu, Y. Ohshima, T. M. Sullivan, and A. Acero, "Multiple Approaches to Robust Speech Recognition," Proc. of the ICSLP, 1992.
1991
- A. Acero, and R. M. Stern, "Robust Speech Recognition by Normalization of the Acoustic Space," Proc. of the ICASSP, Toronto, Ontario, 1991.
- ROZZI, W. A. and STERN, R. M. (1991). Fast Estimation of Mean Vectors using Adaptive Filtering, Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Toronto, Ontario, pp. 865-868.
1990
- A. Acero, and R. M. Stern, "Environmental Robustness in Automatic Speech Recognition," Proc. of the ICASSP, Albuquerque, New Mexico, 1990.
- ACERO, A. and STERN, R. M. (1990b). Toward Microphone-Independent Spoken Language Systems, Proceedings of the DARPA Speech and Natural Language Workshop , Hidden Valley, PA, R. M. Stern , Ed., Morgan Kaufmann Publishers, Inc., San Mateo, CA,
- ACERO, A. and STERN, R. M. (1990c). Acoustical Pre-Processing for Robust Spoken Language Systems, Proc. First International Conference on Spoken Language Processing, pp. 1121-1124, Kobe, Japan, November, 1990.
Ph.D Theses
- S.-J. Doh, Enhancements to Transformation-Based Speaker Adaptation: Principal Component and Inter-Class Maximum Likelihood Linear Regression, Ph.D Thesis, ECE Department, CMU, July, 2000.
- J. Huerta, Robust Speech Recognition in GSM Codec Environments, Ph.D Thesis, ECE Department, CMU, April, 2000.
- B. Raj, Reconstruction of Incomplete Spectrograms for Robust Speech Recognition, Ph.D Thesis, ECE Department, CMU, April, 2000.
- M. Siegler, Integration of Continuous Speech Recognition and Information Retrieval for Mutually Optimal Performance, Ph.D Thesis, ECE Department, CMU, December, 1999.
- E. Gouvea, Acoustic-Feature-Based Frequency Warping for Speaker Normalization, Ph.D Thesis, ECE Department, CMU, February, 1999.
- Thomas M. Sullivan, Multi-Microphone Correlation-Based Processing for Robust Automatic Speech Recognition (2.2MB), Ph.D Thesis, ECE Department, CMU, August 1996. (Compressed, 0.7MB) (Abstract)
- Pedro J. Moreno, Speech Recognition in Noisy Environments (1.3MB), Ph.D Thesis, ECE Department, CMU, May, 1996. (Compressed, 0.5MB) (Abstract)
- Fu-Hua Liu, Environmental Adaptation for Robust Speech Recognition (2.3MB), Ph.D Thesis, ECE Department, CMU, June, 1994. (Compressed, 0.6MB) (abstract)
- Y. Ohshima, Environmental Robustness in Speech Recognition using Physiologically-Motivated Signal Processing (2.1MB), Ph.D Thesis, ECE Department, CMU, December, 1993. (Compressed, 0.7MB) (abstract)
- W. A. Rozzi, Speaker Adaptation in Automatic Speech Recognition via Estimation of Correlated Mean Vectors (2MB), Ph.D Thesis, ECE Department, CMU, May, 1991. (Compressed, 0.6MB) (abstract)
- A. Acero, Acoustical and Environmental Robustness for Automatic Speech Recognition (.pdf, 1.4MB), Ph.D Thesis, ECE Department, CMU, September, 1990. (abstract)
MS Reports
- M.Seltzer, Automatic Detection of Corrupted Speech Features for Robust Speech Recognition, ECE Department, CMU, May, 2000.
- U. Jain, Connected Digit Recognition over Long Distance Telephone Linesusing the SPHINX-II System, Master's Report, ECE Department, CMU, May, 1995. (abstract)
- M. Siegler, Effects of Speech Rate on Speech Recognition Accuracy, Master's Report, ECE Department, CMU, December, 1995. (compress ps file) (abstract)
- P. J. Moreno, Speech Recognition in Telephone Environments, Master's Report, ECE Department, CMU, January, 1993. (abstract)
ICASSP : IEEE International Conference on Acoustics, Speech, and Signal Processing ICSLP : International Conference on Spoken Language Processing EUROSPEECH : European Conference on Speech Communication And Technology