next up previous contents index
Next: 3.5 Semantics Up: 3 Language Analysis and Previous: 3.3 Grammar Formalisms

3.4 Lexicons for Constraint-Based Grammars

Antonio Sanfilippo
Sharp Industries of Europe, Oxford, UK

The intelligent processing of natural language for real world applications requires lexicons which provide rich information about morphological, syntactic and semantic properties of words, are well structured and can be efficiently implemented [Bri92]. These objectives can be achieved by developing tools which facilitate the acquisition of lexical information from machine readable dictionaries and text corpora, as well as database technologies and theories of word knowledge offering an encoding of the information acquired which is desirable for NLP purposes. In the last decade, there has been a growing tendency to use unification-based grammar formalisms [Kay79,KB82,PS87,PS94,ZKC87]

to carry out the task of building such lexicons. These grammar formalisms encode lexical descriptions as feature structures, with inheritance and unification as the two basic operations relating these structures to one another. The use of inheritance and unification is appealing from both engineering and linguistic points of view as these operations can be formalized in terms of lattice-theoretic notions [Car92b] which are amenable to efficient implementation and are suitable to express the hierarchical nature of lexical structure. Likewise, feature structures have a clear mathematical and computational interpretation and provide an ideal data structure to encode complex word knowledge information.

Informally, a feature structure is a set of attribute-value pairs, where values can be atomic or feature structures themselves, providing a partial specification of words, affixes and phrases. Inheritance makes it possible to arrange feature structures into a subsumption hierarchy so that information which is repeated across sets of word entries needs only specifying once [Fli87,PS87,San93]. For example, properties which are common to all verbs (e.g., part of speech, presence of a subject) or subsets of the verb class (presence of a direct object for verbs such as amuse and put; presence of an indirect object for verbs such as go and put) can be defined as templates. Unification provides the means for integrating inherent and inherited specifications of feature structure descriptions.

In general, unification is monotonic: all information, whether inherently specified or inherited, is preserved. Consequently, a valid lexical entry can never contain conflicting values. Unification thus provides a way to perform a consistency check on lexical descriptions. For example, the danger of inadvertently assigning distinct orthographies or parts of speech to the same word entry is easily avoided as the unification of incompatible information leads to failure. An even more stringent regime of grammar checking has recently been made available through the introduction of typed feature structures [Car92b]. Through typing, feature structures can be arranged into a closed hierarchy so that two feature structures unify only if their types have a common subtype. Typing is also used to specify exactly which attributes are appropriate for a given feature structure so that arbitrary extensions of feature structures are easily eschewed.

A relaxation of monotonicity, however, is sometimes useful in order to capture regularities across the lexicon. For example, most irregular verbs in English follow the same inflectional patterns as regular verbs with respect to present and gerundive forms, while differing in the simple past and/or past participle. It would therefore be convenient to state that all verbs inherit the same regular morphological paradigm by default and then let the idiosyncratic specifications of irregular verbs override inherited information which is incompatible.

Default inheritance in the lexicon is desirable to achieve compactness and simplicity in expressing generalizations about various aspects of word knowledge [Fli87,Gaz87], but it can be problematic if used in an unconstrained manner. For example, it is well known that multiple default inheritance can lead to situations which can only be solved ad hoc or nondeterministically when conflicting values are inherited from the parent nodes [THT87]. Although a number of proposals have been made to solve these problems, a general solution is still not available so that the use of default inheritance must be tailored to specific applications.

Another difficult task in lexicon implementation, perhaps the most important with regard to grammar processing, concerns the treatment of lexical ambiguity. Lexical ambiguity can be largely related to our ability to generate appropriate uses of words in context by manipulation of semantic and/or syntactic properties of words. For example, accord is synonymous with either agree or give/grant depending on its valency, move can also be interpreted as a psychological predicate when used transitively with a sentient direct object, and enjoy can take either a noun or verb phrase complement when used in the experience sense:

Although the precise mechanisms which govern lexical knowledge are still largely unknown, there is strong evidence that word sense extensibility is not arbitrary [AL92,Pus91,Pus94,OA92]. For example, the amenability of a verb such as move to yield either a movement or psychological interpretation can be generalized to most predicates of caused motion (e.g., agitate, crash, cross, lift, strike, sweep, unwind). Moreover, the metonymical and metaphoric processes which are responsible for polysemy appear to be subject to crosslinguistic variation. For example, the ``meat vs. animal'' alternation that is found in English---viz.\ feed the lamb vs. eat lamb---is absent in Eskimo [NZ92] and is less productive in Dutch where nominal compounding is often used instead, e.g., lam vs. lamvlees.

Examples of this sort show that our ability to extend word use in context is often systematic or conventionalized. Traditional approaches to lexical representation assume that word use extensibility can be modeled by exhaustively describing the meaning of a word through closed enumeration of its senses. Word sense enumeration provides highly specialized lexical entries, but

Consequently, several attempts have been made to develop a more dynamic approach to lexical specification which provides a principled treatment of polysemy and can be used to model creative aspects of word use. For example, [Pus91,Pus94] and [PB93] propose an integrated multilayered representation of word meaning which incorporates salient aspects of world knowledge, e.g., purpose, origin, form and constituency properties are specified for object-denoting nominals. This makes it possible to conflate different uses of the same word into a single meta-entry which can be extended to achieve contextual congruity using lexical rules [CB92]. Equivalent results can be obtained using abductive reasoning to generate different word senses from polysemic lexical representations [HSAM93]. The use of lexical rules or abductive reasoning provide a principled alternative to word sense enumeration in the treatment of polysemy and can be made to cater for novel uses of words. However, it is not clear whether these practices can address the question of lexical ambiguity efficiently as there is no known general control regime on lexical rules or abductive reasoning which would deterministically restricts polysemic expansion without preempting the generation of possible word uses. A promising alternative is to use contextual information to guide sense extension. For example [SBD94,San95] propose that polysemy be expressed as lexical polymorphism within a Typed Feature Structure formalism by assigning to an ambiguous word entry a lexical type with subtype extensions describing all admissible uses of the word. Lexical ambiguities can then be solved deterministically by using syntactic and semantic contextual information during language processing to ground underspecified word entries.



next up previous contents
Next: 3.5 Semantics Up: 3 Language Analysis and Previous: 3.3 Grammar Formalisms