Postscript Version

Robust, Incremental Parsing and Disambiguation for a Dialog Agent

Lenhart K. Schubert

Department of Computer Science
University of Rochester
Rochester, NY 14627-0226

CONTACT INFORMATION

Email: schubert@cs.rochester.edu
Phone: (716) 275-8845
Fax : (716) 461-2018

WWW PAGE

http://www.cs.rochester.edu/u/schubert/

PROGRAM AREA

Speech and Natural Language Understanding.

KEYWORDS

dialog parsing, robust parsing, incremental parsing, integrated disambiguation

PROJECT SUMMARY

The goal of this project is to develop general techniques for robust parsing and disambiguation of task-oriented spoken dialogs. The primary test bed is the TRAINS transportation planning assistant at U. Rochester. This system assists a user to solve transportation planning problems of varying complexity, using spoken interaction and a map display with use of pointing, menus, etc.

While in the current system turn-taking is controlled by the user through a button, our eventual goal is to have the system participate as a full partner in a free-flowing dialog. A primary challenge in such dialogs lies in the disruption of ordinary grammatical structure by repairs, interjected acknowledgements and corrections, etc. In human-machine dialogs, the problem is compounded by the speech recognition errors introduced by the speech recognizer. The techniques we are developing for parsing such dialogs include the use of fragmentary parses (in addition to complete utterance parses where possible), domain-specific rules to hypothesize speech acts based on surface constituents, the use of cue words and prosody, and a novel dialog chart parser with one track for each speaker and with metarules for forming constituents that straddle acknowledgements, repairs, etc., and even "jump" from track to track.

An essential part of our project is the development of disambiguation methods for resolving both structural and word sense ambiguities. These methods must be usable in an incremental fashion, both to keep in check the growth in alternative analyses and interpretations, and (eventually) to allow mid-sentence acknowledgements, corrections, etc., by the system while the user is speaking. Our current approach involves a notion of head patterns associated with phrase structure rules. The patterns are based on word-senses of head words of syntactic constituents, or classes of these word-senses. The frequencies of these patterns will be extracted from various text and dialog corpora, and used in a probabilistic parser. This approach is expected to combine the benefits of word n-gram based disambiguation, probabilistic grammars, and semantic preference-based approaches.

To further the goal of integrating syntactic analysis and discourse analysis, we are also participating in a major effort to develop a general dialog annotation scheme that will capture the pragmatically most important properties and relationships of dialog segments, and to produce annotated dialog corpora based on this scheme.

The following points summarize the progress to date on the various facets of our research.

PROJECT REFERENCES

J.F. Allen and M.G. Core, "Draft of DAMSL: Dialog Act Markup in Several Layers", draft manual, Dept. of Computer Science, Univ. of Rochester, Rochester, NY, March 1997.

J.F. Allen, B. Miller, E.K. Ringger, and T. Sikorski, "A robust system for natural spoken dialogue", Proc. of the 34th Ann. Meet. of the Assoc. for Computational Linguistics (ACL'96), pp. 62-70, 1996.

G.N. Carlson, "Thematic roles and the individuation of events," to appear in S. Rothstein (ed.), Events and Grammar.

G.N. Carlson and B. Spejewski, "Generic passages", Natural Language Semantics 5 1997, pp. 1-65 (to appear).

M.G. Core, "Using parsed corpora for structural disambiguation in the TRAINS domain", in Proc. of the 34th Ann. Meet. of the Assoc. for Computational Linguistics (ACL'96), pp. 345-7, 1996. Expanded version available as Tech. Rep. 608, Dept. of Computer Science, Univ. of Rochester, Rochester NY 14627-0226.

M.G. Core and L. K. Schubert, "Handling speech repairs and other disruptions through parser metarules", in Working Notes, AAAI Spring Symposium on Computational Models for Mixed Initiative Interaction, Stanford, CA, March 24-26, 1997.

M.G. Core and L.K. Schubert, "Dialog parsing in the TRAINS system", Tech. Rep. 612, Dept. of Computer Science, Univ. of Rochester, Rochester, NY 14627-0226, March 1996.

A.N. Kaplan and L.K. Schubert. "Simulative inference in a computational model of belief." In Second International Workshop on Computational Semantics, pp. 107-121, Tilburg, The Netherlands, January 8-10, 1997.

AREA BACKGROUND

Our project falls into the area of "conversational problem-solving assistants". This relatively new area still lacks standard surveys or reference works, beyond basic background on NL processing, such as (Allen 1994, Pereira & Grosz 1994). But traditional NLP techniques cannot deal with intertwined discourse, with the problem-solving and plan reasoning aspects, or with multiple modalities of communication. So the area naturally remains eclectic, drawing not only on the NLP literature but also on the literature concerning the intentions and plans that drive communication (e.g., Cohen et al. 1990), knowledge representation, reasoning and planning (e.g., Genesereth & Nilsson 1987, Allen et al. 1990), and miscellaneous work on speech processing, psycholinguistics (e.g, Clifton et al. 1994), formal semantics, discourse processing, multimodal communication, hybrid reasoning, and architectures for intelligent systems.

AREA REFERENCES

J.F. Allen, L.K. Schubert, G. Ferguson, P. Heeman, C.H. Hwang, T. Kato, M. Light, N. Martin, B. Miller, M. Poesio, and D. Traum, "The TRAINS project: a case study in building a conversational planning agent", J. of Expt. and Theor. Artif. Intell. 7, 1995, pp.7-48.

J.F. Allen, Natural Language Understanding (2nd ed.), Benjamin Cummings, Menlo Park, CA, 1994.

J.F. Allen, J. Hendler, and A. Tate, (eds.), Readings in Planning, Morgan Kaufmann, San Mateo, CA, 1990.

C. Clifton, L. Frazier, and K. Rayner, (eds.), Perspectives on Sentence Processing, Erlbaum, Hillsboro, NJ, 1994.

P. Cohen, J. Morgan, and M. Pollack, (eds.), Intentions in Communication, MIT Press, Cambridge, MA, 1990.

M. Genesereth and N. Nilsson, Logical Foundations of Artificial Intelligence, Morgan Kaufmann, San Mareo, CA, 1987.

F. Pereira and B. Grosz, (eds.), Natural Language Processing, MIT Press, Cambridge, MA, 1994.

RELATED PROGRAM AREAS

Other Communication Modalities, Adaptive Human Interfaces.