Below is the current list of topics and papers. We welcome any suggestions, as well as any other comments and questions.
To join this reading group on spoken dialog systems, please send email to aliceo@cs.cmu.edu.
This
is a class (11-716, Graduate Seminar in Dialog Systems) in Fall 1999 (starting
September 15). Details, including a syllabus, will be passed around to
LTI students and anyone on the email list. Interested students/faculty/others
should email Alex Rudnicky (air@cs.cmu.edu).
See http://www.lti.cs.cmu.edu/Courses/fall99-course-desc.html
for time/place of the class.
Click on the date of discussion for comments/questions from the reading group meeting.
Here is a list of links where you can find on-line copies of publications on spoken dialog systems.
Acknowledgements: We would like to thank Diane
Litman at AT&T and Nancy
Green at CMU for their help.
Note: Some of the papers may be available only within CMU. Email
me for help.
| Discussion Date(s)/
Presenter |
Topic/Bibliography |
| 3/19/99 | ELVIS (EmaiL Voice Interactive System) |
| 3/19/99 | Candace Kamm, Diane J. Litman, and Marilyn A. Walker. From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Dialogue Systems. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), pp. 1211-1214, December 1998, Sydney, Australia. |
| 3/19/99 | Marilyn Walker, Jeanne Fromer, Shrikanth Narayanan. Learning optimal dialogue strategies: a case study of a spoken dialogue agent for email. In Proceedings of ACL/COLING 98 , 1998. |
| 3/29/99 | Repair and clarification dialogs |
| 3/29/99
air |
Ronnie W. Smith. An evaluation of strategies for selectively verifying utterance meanings in spoken natural language dialog. International Journal of Human-Computer Studies 48:627-647, 1998. |
| 3/29/99
aliceo |
Diane J. Litman, Marilyn A. Walker, and Michael S. Kearns. Automatic Detection of Poor Speech Recognition at the Dialogue Level. Manuscript submitted for publication. |
| 3/29/99
xw |
Matthias Denecke and Alex Waibel. Dialogue strategies guiding users to their communicative goals. Proceedings of Eurospeech97, September 1997, Rhodes, Greece. |
| 4/7/99 | PARADISE, evaluating dialog systems |
| 3/12/99
4/7/99 air |
Marilyn Walker, Diane J. Litman, Candace A. Kamm, and Alicia Abella. PARADISE: A framework for evaluating spoken dialogue agents. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL-97), pp. 271-280, Madrid, Spain, July, 1997. |
| 3/15/99
4/7/99 aliceo |
Diane J. Litman and Shimei Pan. Empirically evaluating an adaptable spoken dialogue system. In Proceedings of the 7th International Conference on User Modeling (UM), to appear June 1999. |
| (add'l
ref) |
Diane J. Litman, Shimei Pan, and Marilyn A. Walker. Evaluating Response Strategies in a Web-Based Spoken Dialogue Agent. In Proceedings of COLING-ACL'98, pp. 780-786, Montreal, Canada, August 1998. |
| 4/12/99 | Unfinished discussions |
| 4/12/99
denecke |
Matthias Denecke and Alex Waibel. Dialogue strategies guiding users to their communicative goals. Proceedings of Eurospeech97, September 1997, Rhodes, Greece. |
| 3/19/99
4/12/99 max |
Candace Kamm, Diane J. Litman, and Marilyn A. Walker. From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Dialogue Systems. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), pp. 1211-1214, December 1998, Sydney, Australia. |
| 4/19/99 | Confidence |
| 4/19/99
rkm |
Dhananjay Bansal and Mosur Ravishankar. New features for confidence annotation. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), December 1998, Sydney, Australia. |
| 4/19/99
rkm |
C. Pao, P. Schmid, and J. Glass. Confidence Scoring for Speech Understanding Systems. Proc. ICSLP 98, Sydney, Australia, November 1998. |
| 4/26/99 | Evaluation of spoken MT and dialog systems |
| 4/26/99
kavita |
Kavita Thomas. Designing a task-based evaluation methodology for a spoken machine translation system. To be published. |
| 4/26/99
air |
J. Polifroni, S. Seneff, J. Glass, and T.J. Hazen. Evaluation Methodology for a Telephone-based Conversational System (postscript).PDF on their website. Proc. First International Conference on language Resources and Evaluation, pp. 42-50, Granada, Spain, May 1998. |
| 5/11/99 | Empirical discourse analysis (DAMSL) |
| 5/11/99
air |
Mark G. Core. Analyzing and Predicting Patterns of DAMSL Utterance Tags. Working Notes of AAAI Spring Symposium on Applying Machine Learning to Discourse Processing to be held in Stanford, CA, March 1998. |
| 5/11/99
klaus ries |
Stolcke, et al. Dialog act modeling for conversational speech. Applying Machine Learning to Discourse Processing (1998). Papers from the 1998 AAAI Spring Symposium, Technical Report SS-98-01, pp. 98-105. AAAI Press, Menlo Park, CA. |
| (add'l ref) | Mark G. Core and James F. Allen. Coding Dialogs with the DAMSL Annotation Scheme. Working Notes of AAAI Fall Symposium on Communicative Action in Humans and Machines held in Boston, MA, November 1997. |
| 5/17/99 | NLP for spoken dialog systems |
| 5/17/99 | Marsal Gavalda. Growing Semantic Grammars.Proceedings of COLING/ACL-98, 1998. |
| 5/17/99 | Jerry Wright, Allen Gorin and Alicia Abella. Spoken language understanding within dialogs using a graphical model of task structure. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), December 1998, Sydney, Australia. |
| 6/7/99 | Modeling dialog |
| 6/7/99 | Esther Levin and Roberto Pieraccini. A Stochastic Model of Computer-Human Interaction for Learning Dialogue Strategies. Proceedings of Eurospeech97, September 1997, Rhodes, Greece. |
| 6/7/99 | Esther Levin, Roberto Pieraccini, and Wieland Eckert. Using Markov Decision Process for Learning Dialogue Strategies. Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'98), May 1998, Seattle, WA. |
| 7/7/99 | Cooperative response generation |
| 7/7/99
Yan Qu |
Yan Qu and Steve Beale. A constraint-based model for cooperative response generation in information dialogues. To appear in AAAI-99. |
| 7/7/99
Nancy Green |
J. Chu-Carroll and S. Carberry. A plan-based model for response generation in collaborative task-oriented dialogues. Proceedings of the 12th AAAI, pages 799-805, 1994. |
| 7/12/99 | Reactive planning in tutorial system |
| 7/12/99
Reva Freedman |
Reva Freedman. Atlas: A plan manager for mixed-initiative, multimodal dialogue. To appear in AAAI-99 Workshop on Mixed-Initiative Intelligence, Orlando. |
| 7/12/99
Reva Freedman |
Reva Freedman. Degrees of mixed-initiative interaction in an intelligent tutoring system. Proceedings of AAAI-97 Spring Symposium on Computational Models for Mixed-Initiative Interaction, 1997. |
| 8/4/99 | Discourse |
| 8/4/99 | Barbara J. Grosz and Candy Sidner. Attention, Intentions, and the Sturcture of Discourse. Computational Linguistics, Volume 12, Number 3. 1986. |
| 8/18/99 | Intonational characteristics of discourse |
| 8/18/99 | Christine Nakatani, Julia Hirschberg, and Barbara Grosz. Discourse structure in spoken language: studies on speech corpora. Working Notes of the AAAI-95 Spring Symposium on Empirical Methods in Discourse Interpretation, Palo Alto, 1995. |
| 8/18/99 | Barbara J. Grosz and Julia Hirschberg. Some intonational characteristics of discourse structure. Proceedings of ICSLP, 1992. |
| TBD | MIT systems |
| TBD
eht |
S. Seneff, E. Hurley, R. Lau, C. Pao, P. Schmid, and V. Zue. Galaxy-II: A reference architecture for conversational system development. In Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP), December 1998, Sydney, Australia. |
| TBD
pcc |
V. Zue, S. Seneff, J. Glass, L. Hetherington, E. Hurley, H. Meng, C. Pao, J. Polifroni, R. Schloming, and P. Schmid. From interface to content: Translingual access and delivery of on-line information. Proceedings of Eurospeech97, pp. 2227-2230, September 1997, Rhodes, Greece. |
| TBD
rkm |
H. Meng, S. Busayapongchai, J. Glass, D. Goddeau, L. Hetherington, E. Hurley, C. Pao, J. Polifroni, S. Seneff, and V. Zue. Wheels: A Conversational System in the Automobile Classifieds Domain. Proc. ICSLP 96, pp. 542-545, Philadelphia, PA, October 1996. |
| TBD | Integrated approaches and systems |
| TBD | Diane J. Litman and J. Allen. A plan recognition model for subdialogues in conversation. Cognitive Science, Vol. 11, No. 2, p. 163-200, 1987. |
| TBD | R.W. Smith, D.R. Hipp, and A.W. Biermann. An Architecture for Voice Dialog Systems Based on Prolog-Style Theorem Proving. Computational Linguistics, vol. 21, no. 3, pages 281-320, September 1995. |
| TBD | Interface design |
| TBD | N. Yankelovich, G. A. Levow, and M. Marx, Designing SpeechActs: Issues in Speech User Interfaces (html version), CHI '95 Proceedings, ACM Conference on Human Factors in Computing Systems, Denver, CO, May 7-11, 1995. |
| TBD | Others (just haven't grouped these into the right topics yet) |
| TBD | Shimei Pan and Kathleen McKeown. Integrating language generation with speech synthesis in a concept to speech system. In the Proceedings of ACL/EACL '97 Concept to Speech Workshop, Madrid, Spain, 1997. |
| TBD | Barbara J. Grosz and Candy Sidner. Attention, Intentions, and the Sturcture of Discourse. Computational Linguistics, Volume 12, Number 3. 1986. |
| TBD | M. Eskenazi, A. Rudnicky, K. Gregory, P. Constantinides, R. Brennan, C. Bennett, and J. Allen. Data collection and processing in the Carnegie Mellon communicator. To appear in Eurospeech99. |
| TBD | Ronnie W. Smith and Steven A. Gordon. Effects of variable initiative on linguistic behavior in human-computer spoken natural language dialog. Computational Linguistics, 23(1), 1997. |
| TBD | Hone and Baber. Modeling the effects of constraint upon speech-based human-computer interaction. International Journal of Human-Computer Studies, 50, 1999. |
| TBD | M. Danieli and E. Gerbino. Metrics for evaluating dialogue strategies in a spoken language system. In Proceedings of the 1995 AAAI Spring Symposium on empirical methods in discourse interpretation and generation. 1995. |
| TBD | Ronnie W. Smith and Richard Hipp. Spoken natural language dialog systems: a practical approach (selected chapter). Oxford University Press, New York, 1994. |
| TBD | Eric K. Ringger and James F. Allen. Error Correction via a Post-Processor for Continuous Speech Recognition. Proceedings of the 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96). Atlanta, GA. May 1996. |
| TBD | Eric K. Ringger and James F. Allen. A Fertility Channel Model for Post-Correction of Continuous Speech Recognition.Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP'96). Philadelphia, PA. October 1996. |
| TBD | Eckert, W., Levin, E. and Pieraccini, R.User modeling for spoken dialogue system. 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, 1997. |
| TBD | Jeanne C. Fromer. Learning Optimal Discourse Strategies in a Spoken Dialogue System. Master's Thesis, Massachusetts Institute of Technology, September 1998. |
| TBD | Marilyn Walker, Jeanne Fromer, Giuseppe Di Fabbrizio, Craig Mestel and Don Hindle. What Can I Say? . In Proceedings of the Conference on Human Factors in Computing Systems , CHI98, 1997. |
| TBD | Candace Kamm, Shrikanth Narayanan, Dawn Dutton, and Russell Ritenour. Evaluating spoken dialog systems for telecommunication services. In Eurospeech '97, Rhodes Greece, 1997. |
| TBD | Irene Langkilde and Kevin Knight. The practical value of n-grams in generation. Proceedings of the International Natural Language Generation Workshop, 1998. |
| TBD | Irene Langkilde and Kevin Knight. Generation that exploits corpus-based statistical knowledge. Proceedings of the Conference of the Association for Computational Linguistics (COLING-ACL), 1998. |
| TBD | Christine Nakatani, Julia Hirschberg, and Barbara Grosz. Discourse structure in spoken language: studies on speech corpora. In Working Notes of the AAAI-95 Spring Symposium on Empirical Methods of Discourse Interpretation, 1995. |
Suggested Topics (please also suggest specific papers)