Readings on Spoken Dialog Systems (11-716 Graduate Seminar)

Next Meeting: Wednesday, September 15, 1999
Location: Scaife Hall 206
Papers: No papers.

Below is the current list of topics and papers. We welcome any suggestions, as well as any other comments and questions.

To join this reading group on spoken dialog systems, please send email to aliceo@cs.cmu.edu.

This is a class (11-716, Graduate Seminar in Dialog Systems) in Fall 1999 (starting September 15). Details, including a syllabus, will be passed around to LTI students and anyone on the email list. Interested students/faculty/others should email Alex Rudnicky (air@cs.cmu.edu).
See http://www.lti.cs.cmu.edu/Courses/fall99-course-desc.html for time/place of the class.

Click on the date of discussion for comments/questions from the reading group meeting.

Here is a list of links where you can find on-line copies of publications on spoken dialog systems.

Acknowledgements: We would like to thank Diane Litman at AT&T and Nancy Green at CMU for their help.
Note: Some of the papers may be available only within CMU. Email me for help.
 
Discussion Date(s)/
Presenter
Topic/Bibliography
3/19/99 ELVIS (EmaiL Voice Interactive System)
3/19/99 Candace Kamm, Diane J. Litman, and Marilyn A. Walker. From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Dialogue Systems. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), pp. 1211-1214, December 1998, Sydney, Australia. 
3/19/99 Marilyn Walker, Jeanne Fromer, Shrikanth Narayanan. Learning optimal dialogue strategies: a case study of a spoken dialogue agent for email. In Proceedings of ACL/COLING 98 , 1998.
3/29/99 Repair and clarification dialogs
3/29/99
air
Ronnie W. Smith. An evaluation of strategies for selectively verifying utterance meanings in spoken natural language dialog. International Journal of Human-Computer Studies 48:627-647, 1998.
3/29/99
aliceo
Diane J. Litman, Marilyn A. Walker, and Michael S. Kearns. Automatic Detection of Poor Speech Recognition at the Dialogue Level. Manuscript submitted for publication. 
3/29/99
xw
Matthias Denecke and Alex Waibel. Dialogue strategies guiding users to their communicative goals. Proceedings of Eurospeech97, September 1997, Rhodes, Greece.
4/7/99 PARADISE, evaluating dialog systems
3/12/99
4/7/99
air
Marilyn Walker, Diane J. Litman, Candace A. Kamm, and Alicia Abella. PARADISE: A framework for evaluating spoken dialogue agents. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL-97), pp. 271-280, Madrid, Spain, July, 1997.
3/15/99
4/7/99
aliceo
Diane J. Litman and Shimei Pan. Empirically evaluating an adaptable spoken dialogue system. In Proceedings of the 7th International Conference on User Modeling (UM), to appear June 1999.
(add'l
ref)
Diane J. Litman, Shimei Pan, and Marilyn A. Walker. Evaluating Response Strategies in a Web-Based Spoken Dialogue Agent. In Proceedings of COLING-ACL'98, pp. 780-786, Montreal, Canada, August 1998. 
4/12/99 Unfinished discussions
4/12/99
denecke
Matthias Denecke and Alex Waibel. Dialogue strategies guiding users to their communicative goals. Proceedings of Eurospeech97, September 1997, Rhodes, Greece.
3/19/99
4/12/99
max
Candace Kamm, Diane J. Litman, and Marilyn A. Walker. From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Dialogue Systems. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), pp. 1211-1214, December 1998, Sydney, Australia. 
4/19/99 Confidence
4/19/99
rkm
Dhananjay Bansal and Mosur Ravishankar. New features for confidence annotation. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), December 1998, Sydney, Australia.
4/19/99
rkm
C. Pao, P. Schmid, and J. Glass. Confidence Scoring for Speech Understanding Systems. Proc. ICSLP 98, Sydney, Australia, November 1998.
4/26/99 Evaluation of spoken MT and dialog systems
4/26/99
kavita
Kavita Thomas. Designing a task-based evaluation methodology for a spoken machine translation system. To be published.
4/26/99
air
J. Polifroni, S. Seneff, J. Glass, and T.J. Hazen. Evaluation Methodology for a Telephone-based Conversational System (postscript).PDF on their website. Proc. First International Conference on language Resources and Evaluation, pp. 42-50, Granada, Spain, May 1998. 
5/11/99 Empirical discourse analysis (DAMSL)
5/11/99
air
Mark G. Core. Analyzing and Predicting Patterns of DAMSL Utterance Tags. Working Notes of AAAI Spring Symposium on Applying Machine Learning to Discourse Processing to be held in Stanford, CA, March 1998. 
5/11/99
klaus ries
Stolcke, et al. Dialog act modeling for conversational speech. Applying Machine Learning to Discourse Processing (1998). Papers from the 1998 AAAI Spring Symposium, Technical Report SS-98-01, pp. 98-105. AAAI Press, Menlo Park, CA. 
(add'l ref) Mark G. Core and James F. Allen. Coding Dialogs with the DAMSL Annotation Scheme. Working Notes of AAAI Fall Symposium on Communicative Action in Humans and Machines held in Boston, MA, November 1997. 
5/17/99 NLP for spoken dialog systems
5/17/99 Marsal Gavalda. Growing Semantic Grammars.Proceedings of COLING/ACL-98, 1998.
5/17/99 Jerry Wright, Allen Gorin and Alicia Abella. Spoken language understanding within dialogs using a graphical model of task structure. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), December 1998, Sydney, Australia.
6/7/99 Modeling dialog
6/7/99 Esther Levin and Roberto Pieraccini. A Stochastic Model of Computer-Human Interaction for Learning Dialogue Strategies. Proceedings of Eurospeech97, September 1997, Rhodes, Greece.
6/7/99 Esther Levin, Roberto Pieraccini, and Wieland Eckert. Using Markov Decision Process for Learning Dialogue Strategies. Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'98), May 1998, Seattle, WA.
7/7/99 Cooperative response generation
7/7/99
Yan Qu
Yan Qu and Steve Beale. A constraint-based model for cooperative response generation in information dialogues. To appear in AAAI-99.
7/7/99
Nancy Green
J. Chu-Carroll and S. Carberry. A plan-based model for response generation in collaborative task-oriented dialogues. Proceedings of the 12th AAAI, pages 799-805, 1994.
7/12/99 Reactive planning in tutorial system
7/12/99
Reva Freedman
Reva Freedman. Atlas: A plan manager for mixed-initiative, multimodal dialogue. To appear in AAAI-99 Workshop on Mixed-Initiative Intelligence, Orlando.
7/12/99
Reva Freedman
Reva Freedman. Degrees of mixed-initiative interaction in an intelligent tutoring system. Proceedings of AAAI-97 Spring Symposium on Computational Models for Mixed-Initiative Interaction, 1997.
8/4/99 Discourse
8/4/99 Barbara J. Grosz and Candy Sidner. Attention, Intentions, and the Sturcture of Discourse. Computational Linguistics, Volume 12, Number 3. 1986.
8/18/99 Intonational characteristics of discourse
8/18/99 Christine Nakatani, Julia Hirschberg, and Barbara Grosz. Discourse structure in spoken language: studies on speech corpora. Working Notes of the AAAI-95 Spring Symposium on Empirical Methods in Discourse Interpretation, Palo Alto, 1995.
8/18/99 Barbara J. Grosz and Julia Hirschberg. Some intonational characteristics of discourse structure. Proceedings of ICSLP, 1992.
TBD  MIT systems
TBD
eht
S. Seneff, E. Hurley, R. Lau, C. Pao, P. Schmid, and V. Zue. Galaxy-II: A reference architecture for conversational system development. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), December 1998, Sydney, Australia. 
TBD
pcc
V. Zue, S. Seneff, J. Glass, L. Hetherington, E. Hurley, H. Meng, C. Pao, J. Polifroni, R. Schloming, and P. Schmid. From interface to content: Translingual access and delivery of on-line information. Proceedings of Eurospeech97, pp. 2227-2230, September 1997, Rhodes, Greece.
TBD
rkm
H. Meng, S. Busayapongchai, J. Glass, D. Goddeau, L. Hetherington, E. Hurley, C. Pao, J. Polifroni, S. Seneff, and V. Zue. Wheels: A Conversational System in the Automobile Classifieds Domain. Proc. ICSLP 96, pp. 542-545, Philadelphia, PA, October 1996.
TBD Integrated approaches and systems
TBD Diane J. Litman and J. Allen. A plan recognition model for subdialogues in conversation. Cognitive Science, Vol. 11, No. 2, p. 163-200, 1987.
TBD R.W. Smith, D.R. Hipp, and A.W. Biermann. An Architecture for Voice Dialog Systems Based on Prolog-Style Theorem Proving. Computational Linguistics, vol. 21, no. 3, pages 281-320, September 1995.
TBD Interface design
TBD N. Yankelovich, G. A. Levow, and M. Marx, Designing SpeechActs: Issues in Speech User Interfaces (html version), CHI '95 Proceedings, ACM Conference on Human Factors in Computing Systems, Denver, CO, May 7-11, 1995.
TBD Others (just haven't grouped these into the right topics yet)
TBD Shimei Pan and Kathleen McKeown. Integrating language generation with speech synthesis in a concept to speech system. In the Proceedings of ACL/EACL '97 Concept to Speech Workshop, Madrid, Spain, 1997.
TBD Barbara J. Grosz and Candy Sidner. Attention, Intentions, and the Sturcture of Discourse. Computational Linguistics, Volume 12, Number 3. 1986.
TBD M. Eskenazi, A. Rudnicky, K. Gregory, P. Constantinides, R. Brennan, C. Bennett, and J. Allen. Data collection and processing in the Carnegie Mellon communicator. To appear in Eurospeech99.
TBD Ronnie W. Smith and Steven A. Gordon. Effects of variable initiative on linguistic behavior in human-computer spoken natural language dialog. Computational Linguistics, 23(1), 1997.
TBD Hone and Baber. Modeling the effects of constraint upon speech-based human-computer interaction. International Journal of Human-Computer Studies, 50, 1999.
TBD M. Danieli and E. Gerbino. Metrics for evaluating dialogue strategies in a spoken language system. In Proceedings of the 1995 AAAI Spring Symposium on empirical methods in discourse interpretation and generation. 1995.
TBD Ronnie W. Smith and Richard Hipp. Spoken natural language dialog systems: a practical approach (selected chapter). Oxford University Press, New York, 1994.
TBD Eric K. Ringger and James F. Allen. Error Correction via a Post-Processor for Continuous Speech Recognition. Proceedings of the 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96). Atlanta, GA. May 1996. 
TBD Eric K. Ringger and James F. Allen. A Fertility Channel Model for Post-Correction of Continuous Speech Recognition.Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP'96). Philadelphia, PA. October 1996. 
TBD Eckert, W., Levin, E. and Pieraccini, R.User modeling for spoken dialogue system. 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, 1997.
TBD Jeanne C. FromerLearning Optimal Discourse Strategies in a Spoken Dialogue System.  Master's Thesis, Massachusetts Institute of Technology, September 1998. 
TBD Marilyn Walker, Jeanne Fromer, Giuseppe Di Fabbrizio, Craig Mestel and Don Hindle. What Can I Say? . In Proceedings of the Conference on Human Factors in Computing Systems , CHI98, 1997.
TBD Candace Kamm, Shrikanth Narayanan, Dawn Dutton, and Russell Ritenour. Evaluating spoken dialog systems for telecommunication services. In Eurospeech '97, Rhodes Greece, 1997.
TBD Irene Langkilde and Kevin Knight. The practical value of n-grams in generation. Proceedings of the International Natural Language Generation Workshop, 1998.
TBD Irene Langkilde and Kevin Knight. Generation that exploits corpus-based statistical knowledge. Proceedings of the Conference of the Association for Computational Linguistics (COLING-ACL), 1998.
TBD Christine Nakatani, Julia Hirschberg, and Barbara Grosz. Discourse structure in spoken language: studies on speech corpora. In Working Notes of the AAAI-95 Spring Symposium on Empirical Methods of Discourse Interpretation, 1995.


Suggested Topics (please also suggest specific papers)



Last Updated: September 15, 1999 by Alice Oh