Publication List

Roni Rosenfeld, School of Computer Science, Carnegie Mellon University

Note: Many papers that appear in .pdf format below are also available, in the same directory, in the following formats: pdf.gz, ps, ps.gz.   Files that appear in a .doc.gz format below are also available, in the same directory, in a .doc format.

Spoken Language Technologies for Development:

Human-Machine Speech Communication:

Computational Biology:

Statistical/Machine Learning Methods in Speech and Language Processing:

  • Mark Johnson, Sanjeev Khudanpur, Mari Ostendorf and Roni Rosenfeld (eds.), Mathematical Foundations of Speech and Language Processing, the IMA volumes in Mathematics and Its Applications, 138, Springer, 2004.
  • Xiaojin Zhu and Roni Rosenfeld, Improving Trigram Language Modeling with the World Wide Web. In Proc ICASSP 2001, longer version published as Technical Report CMU-CS-00-171.
  • Roni Rosenfeld, Stanley F. Chen and Xiaojin Zhu.  Whole-Sentence Exponential Language Models: a Vehicle for Linguistic-Statistical Integration.  Computers Speech and Language, 15(1), 2001. 
  • Can Cai, Roni Rosenfeld and Larry Wasserman.  Exponential Language Models, Logistic Regression, and Semantic Coherence.  In Proc. NIST/DARPA Speech Transcription Workshop, May 2000.
  • Chris Paciorek and Roni RosenfeldMinimum Classification Error Training in Exponential Language Models.  In Proc. NIST/DARPA Speech Transcription Workshop, May 2000.
  • Ronald Rosenfeld.  Two decades of Statistical Language Modeling: Where Do We Go From Here?  Proceedings of the IEEE88(8), 2000.
  • Ronald Rosenfeld, Incorporating Linguistic Structure into Statistical Language Models, Philosophical Transactions of the Royal Society, Series A, 358 (1769), pp. 1311--1324, April 2000.
  • Ronald Rosenfeld, Larry Wasserman, Can Cai, Xiaojin Zhu. Interactive Feature Induction and Logistic Regression for Whole Sentence Exponential Language Models. In Proc. IEEE workshop on Automatic Speech Recognition and Understanding, Keystone, Colorado, December 1999.
  • Xiaojin Zhu, Stanley Chen and Ronald Rosenfeld. Linguistic Features for Whole Sentence Maximum Entropy Language Models. In Proc. Eurospeech '99, Hungary, September 1999.
  • Kristie Seymore, Andrew McCallum and Ronald Rosenfeld. Learning Hidden Markov Model Structure for Information Extraction.  AAAI'99 Workshop on Machine Learning for Information Extraction.
  • Stanley Chen and Ronald Rosenfeld.  Efficient Sampling and Feature Selection in Whole Sentence Maximum Entropy Language Models.  In Proc. ICASSP '99, Phoenix, Arizona, March 1999.
  • Adam Kalai, Stanley Chen, Avrim Blum and Ronald Rosenfeld. On-Line Algorithms for Combining Language Models. In Proc. ICASSP '99, Phoenix, Arizona, March 1999.
  • Stanley Chen and Ronald Rosenfeld. A Survey of Smoothing Techniques for ME Models.  IEEE Trans. Speech and Audio Processing,8(1), pp. 37--50. January 2000.  Also published as A Gaussian Prior for Smoothing Maximum Entropy Models, Technical Report CMU-CS-99-108, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, February 1999.
  • Kristie Seymore, Stan Chen and Ronald Rosenfeld.  Nonlinear Interpolation of Topic Models for Language Model Adaptation.  In Proc. ICSLP98, Sydney, Australia.
  • Andrew McCallum, Ronald Rosenfeld, Tom Mitchell and Andrew Ng.  Improving Text Classification by Shrinkage in a Hierarchy of Classes.  Intl. Conference on Machine Learning, ICML-98, July 1998.
  • Stanley Chen, Kristie Seymore and Ronald Rosenfeld. Topic Adaptation for Language Modeling using Unnormalized Exponential Models.  In Proc. Int'l Conf. on Acoustics, Speech and Signal Processing, Seattle, Washington, May 1998.
  • Stanley Chen, Douglas Beeferman and Ronald Rosenfeld. Evaluation Metrics for Language Models.  In Proc. DARPA Broadcast News Transcription and Understanding Workshop (BNTUW), Lansdowne, Virginia, February 1998.
  • K. Seymore, S. Chen, S.J. Doh, M. Eskenazi, E. Gouvea, B. Raj, M. Ravishankar, R. Rosenfeld, M. Siegler, R. Stern and E. Thayer.  The 1997 CMU Sphinx-3 English Broadcast News Transcription System.  In Proc. DARPA Broadcast News Transcription and Understanding Workshop (BNTUW), Lansdowne, Virginia, February 1998.
  • Ronald Rosenfeld. A Whole Sentence Maximum Entropy Language Model.  In Proc. IEEE workshop on Automatic Speech Recognition and Understanding, Santa Barbara, California, December 1997.
  • Pierre Dupont and Ronald Rosenfeld. Lattice Based Language Models. Technical Report CMU-CS-97-173, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, September 1997.
  • Kristie Seymore and Ronald Rosenfeld. Using Story Topics for Language Model Adaptation.  In Proc. Eurospeech '97, September 1997.  Longer version published as Large-Scale Topic Detection and Language Model Adaptation, Technical Report CMU-CS-97-152, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, June 1997.
  • Philip Clarkson and Ronald Rosenfeld. Statistical Language Modeling using the CMU-Cambridge toolkit.  In Proc. Eurospeech '97, September 1997 (ELRA Best Student Paper Prize).
  • Andreas Stolcke, Ciprian Chelba, David Engle, Victor Jimenez, Lidia Mangu, Harry Printz, Eric Ristad, Ronald Rosenfeld, Dekai Wu. Structure and Performance of a dependency language model.  In Proc. Eurospeech'97, September 1997.
  • Kristie Seymore, Stanley Chen, Maxine Eskenazi and Ronald Rosenfeld. Language and Pronunciation Modeling in the CMU 1996 Hub 4 Evaluation.  In Proc. ARPA Spoken Langauge Technology Workshop, Chantilly, VA, February 1997.
  • P. Placeway, S. Chen, M. Eskenazi, U. Jain, V. Parikh, B. Raj, M. Ravishankar, R. Rosenfeld, K. Seymore, M. Siegler, R. Stern and E. Thayer.  The 1996 Hub-4 Sphinx-3 System.  In Proc. ARPA Spoken Langauge Technology Workshop,Chantilly, VA, February 1997.
  • Ronald Rosenfeld. A Maximum Entropy Approach to Adaptive Statistical Language Modeling. Computer, Speech and Language 10, 187--228, 1996 (2001 award for “Most Influential Paper in CSL in the Last 5 Years").  Longer version published as Adaptive Statistical Language Modeling: A Maximum Entropy Approach, Ph.D. thesis, Computer Science Department, Carnegie Mellon University,TR CMU-CS-94-138, April 1994.
  • Kristie Seymore and Ronald Rosenfeld.  Scalable Backoff Language Models.  In Proc. ICSLP'96, Philadelphia, October 1996.  Longer version published as Scalable Trigram Backoff Language Models, Technical Report CMU-CS-96-139, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, May 1996.
  • Lin Chase, Kristie Seymore and Roni RosenfeldLanguage Modeling for Large Vocabulary Conversational Speech Recognition. Workshop on Large Vocabulary Conversational Speech Recognition, Maritime Institue of Technology, Linthicum Heights, Maryland, April 29--May 1, 1996.
  • Ronald Rosenfeld, Rajeev Agarwal, Bill Byrne, Rukmini Iyer, MarkLiberman, Liz Shriberg, Jack Unverferth, Dimitra Vergyri, EnriqueVidalError Analysis and Language Modeling for Conversational Speech: TeamReport.  In Proceedings of the 1995 Language Modeling Workshop, Johns Hopkins University, July--August 1995.
  • Ronald Rosenfeld. Optimizing Lexical and Ngram Coverage Via Judicious Use of Linguistic Data.  In Proc. Eurospeech'95, Madrid, Spain, September 1995.
  • Ronald Rosenfeld. An Impact Matrix for the 1994 CSR Hub Evaluation.  In Proc. ARPA Spoken Language Technology Workshop, Austin, TX, January 1995.
  • Ronald Rosenfeld. The CMU Statistical Language Modeling Toolkit, and its use in the 1994 ARPA CSR Evaluation.  In Proc. ARPA Spoken Language Technology Workshop, Austin, TX, January 1995.
  • L. Chase, R. Rosenfeld, A. Hauptmann, M. Ravishankar, E. Thayer, P.Placeway, R. Weide, C. Lu.  Improvements in Language, Lexical, and Phonetic Modeling in Sphinx-II.  In Proc. ARPA Spoken Langauge Technology Workshop, Austin, TX, January 1995.
  • Lin Chase, Ron Rosenfeld, and Wayne Ward.  Error-Responsive Modifications to Speech Recognizers: Negative N-grams.  In Proc. International Conference on Spoken Language Processing, Yokohama, Japan, September 1994.
  • M. Hwang, R. Rosenfeld, E. Thayer, R. Mosur, L. Chase, R. Weide, X. Huang, and F. Alleva.  Improving Speech-Recognition Performance via Phone-Dependent VQ Codebooks and Adaptive Language Models in SPHINX-II.  In Proc. Int'l Conf. on Acoustics, Speech and Signal Processing, Australia, April 1994.
  • Ronald Rosenfeld. A Hybrid Approach to Adaptive Statistical Language Modeling.  In Proc. ARPA Human Language Technology Workshop, Plainsboro, NJ, March 1994.
  • R. Rosenfeld, E. Thayer, R. Mosur, L. Chase, R. Weide, M. Hwang,X. Huang and F. Alleva. Improved Acoustic and Adaptive Language Models for Continuous Speech Recognition.  In Proc. ARPA Spoken Language Systems Workshop, March 1994.
  • Francis Kubala, Jerome Bellegarda, Jordan Cohen, Dave Pallett, Doug Paul, Mike Phillips, Raja Rajasekaran, Fred Richardson, Mike Riley, Roni Rosenfeld, Bob Roth, MitchWeintraub. The Hub and Spoke Paradigm for CSR Evaluation.  In Proc. ARPA Human Language Technology Workshop, Plainsboro, NJ, March 1994.
  • Chase, L., Mosur, R., and Rosenfeld, R. Language Model Adaptation in the CSR EvaluationARPA Spoken Language Systems Workshop, Plainsboro, NJ, March 1994.
  • Ronald Rosenfeld. Modeling Long-Distance Linguistic Phenomena Within the Maximum Entropy Framework.  Invited speaker at IEEE Automatic Speech Recognition workshop, Snowbird, UT, December 1993.
  • Raymond Lau, Ronald Rosenfeld, and Salim Roukos. Trigger-based Language Models Using Maximum Likelihood Estimation of Exponential Distributions.  In Proc. Int'l Conf. on Acoustics, Speech and Signal Processing, Minneapolis, MN, April 1993.
  • Raymond Lau, Ronald Rosenfeld, and Salim Roukos. Adaptive Language Modeling Using the Maximum Entropy Principle.  In Proc. ARPA Human Language Technology Workshop, March 1993.
  • Raymond Lau, Ronald Rosenfeld, and Salim Roukos. Building Scalable N-gram Language Models Using Maximum Likelihood Maximum Entropy N-gram models. U.S. Patent 5,467,425, February 1993.
  • Xuedong Huang, Fil Alleva, Mei-Yuh Hwang, and Ronald Rosenfeld. An Overview of the SPHINX-II Speech Recognition System.  In Proc. ARPA Human Language Technology Workshop, March 1993.
  • Xuedong Huang, Fil Alleva, Mei-Yuh Hwang, Ronald Rosenfeld, and Rich Stern. The SPHINX-II system used in the DARPA 1992 evaluationDARPA Spoken Language Technology Workshop, Boston, MA, January 1993.
  • Ronald Rosenfeld. Adaptive Statistical Language Modeling: A Maximum Entropy ApproachPh.D. thesis proposal, Carnegie Mellon University, October 1992.
  • Ronald Rosenfeld, Xuedong Huang and Merrick Furst. Exploiting Correlations Among Competing Models with Application to Large Vocabulary Speech Recognition.  In Proc. Int'l Conf. on Acoustics, Speech and Signal Processing, San Francisco, CA, March 1992.
  • X.D. Huang, F. Alleva, H.W. Hon, M.Y. Hwang, K.F. Lee, and R. Rosenfeld. The SPHINX-II Speech Recognition System: An Overview.  Computer, Speech and Language, 2, pages 137--148, 1993.  Also published as Technical Report CMU-CS-92-112, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, February, 1992.
  • F. Alleva, H. Hon, X. Huang, M. Hwang, R. Rosenfeld, and R. Weide. Applying SPHINX-II to the DARPA Wall Street Journal CSR Task.  In Proc. DARPA Speech and Language Workshop, Morgan Kaufmann Publishers,San Mateo, CA, February 1992.
  • Ronald Rosenfeld and Xuedong Huang. Improvements in Stochastic Language Modeling.  In Proc. DARPA Speech and Language Workshop, Morgan Kaufmann Publishers, San Mateo, CA, February 1992.
  • Ronald Rosenfeld, Xuedong Huang and Merrick Furst. Exploiting Correlations Among Models with Application to Large Vocabulary Speech Recognition. Technical Report CMU-CS-91-148, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, May, 1991.
  • Barak Pearlmutter and Ronald Rosenfeld. Chaitin-Kolmogorov Complexity and Generalization in Neural Networks.  In D. Touretzky, J. Moody and R. Lippmann (eds.), Advances in Neural Information Processing Systems 3. San Mateo, CA: Morgan Kaufmann, 1991.
  • Ronald Rosenfeld and David S. Touretzky. Coarse-Coded Symbol Memories and Their PropertiesJournal of Complex Systems, 2(4), pp. 463-484, August 1988.
  • Ronald Rosenfeld and David S. Touretzky. A Survey of Coarse-Coded Symbol Memories.  In Proceedings of the 1988 Connectionist Models Summer School, Carnegie Mellon, June 17-26, 1988. Morgan Kaufmann, 1989.
  • Ronald Rosenfeld and David S. Touretzky. Four Capacity Models of Coarse-Coded Symbol Memories.  Technical Report CMU-CS-87-182, Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, December, 1987.
  • Ronald Rosenfeld and David S. Touretzky. Scaling Properties of Coarse-Coded Symbol Memories.  In Dana Z. Anderson (Ed.), Neural Information Processing Systems 1, pp.652--661, AIP, New York, 1988.
  • Ronald Rosenfeld, David S. Touretzky and the Boltzmann Research Group.  Connectionist Models as Neural Abstractions: commentary on ``Brains Make Chaos to Make Sense of the World'' by C.A. Skarda and W.J. Freeman.  In Behavioral and Brain Sciences, 10(2), June 1987, pp.181-183.