Postscript Version

Language Learning and its Applications

Jerome Feldman
George Lakoff
International Computer Science Institute

CONTACT INFORMATION

1947 Center St.
Berkeley, CA 94704
Phone: (510) 643-9153
Fax :    (510) 643-7684
Email: jfeldman@icsi.berkeley.edu

WWW PAGE

NTL Web Page - http://www.icsi.berkeley.edu/LZERO

PROGRAM AREA

Speech and Natural Language Understanding.

KEYWORDS

Learning, semantics, structured, connectionist, neural, metaphor, language.

PROJECT SUMMARY

The question of how human beings learn natural languages is one of abiding scientific interest, and computational models have begun to play a fruitful role in understanding how language might be acquired. Computational studies of grammar and language learning have also resulted in a wide range of applications. Both of the PIs (JF and GL) have worked on the problem for some time. The collaboration arises from two articulating insights. GL has been involved for 20 years in the study of how human conceptual systems are structured in terms of the details of the functioning of our bodies and brains and how language reflects that bodily structure (Feldman 89, Lakoff 87). Over roughly the same period, JF came to believe that structured connectionism provides the only known computational formalism adequate for the fine-grained modeling of human intelligence (Shastri 93). The joint effort began in 1988 when JF came to UC Berkeley and ICSI and with GL founded the L-zero project. The group seeks to develop structured connectionist models that can learn and use both natural conceptual systems and the languages that express them. A current overview of our efforts found on the NTL web page.

After some preliminary explorations, we were able to formulate a version of the language acquisition problem that was small enough to be tractable, but seemed to address most of the important issues. We repeat the initial manifesto below. Pursuing this path, our early efforts were quite productive and well received, but there are a number of limitations that we found no way to surmount. Over the last year, we have developed an extended version of the task specification and made significant changes in our representation and learning methods. For comparison, the 1990 challenge (Feldman et al 1990) was:

The system is given examples of pictures paired with true statements about those pictures in an arbitrary natural language.
The system is to learn the relevant portion of the language well enough so that given a novel sentence of that language, it can determine whether or not the sentence is true of the accompanying picture.
The task was extraordinarily difficult, since the conceptual systems for spatial relations concepts differ markedly from language to language. The problem was solved by making use of results from cognitive linguistics, neuroscience, and psychophysics in the design of a structured connectionist acquisition model that worked for a wide range of languages. As cognitive science, the result is a theory of how spatial relations concepts arise from the structure of the visual system. The result from the perspective of connectionist computation is that structured connectionism can be employed in learning a very important segment of human conceptual systems and the language that expresses them. A detailed description of the system can be found in Terry Regier's recently published book entitled "The Human Semantic Potential" from MIT Press, 1996.

Although both the grammar learning and concept learning projects were quite successful in their own terms, each exhibits limitations and this has led us to reformulate the original problem (Feldman et al 1996). In requiring only recognition, the original task fell far short of the human capabilities, particularly in concept acquisition and use. Much of our proposed work involves reformulating the language learning task and developing representations adequate for the extended problem.

Of the many shortcomings of our concept learning system we believe that the most important are three: invertibility, inference, and abstract concepts. The connectionst networks used in Regier's work were feedforward networks trained by back-propagation. This is fine for most applied tasks, but is inadequate as a model of human concept or word learning or inference. For example, feedforward networks have no structure that could produce an example of a concept that it recognizes perfectly. Its inferential inadequacies arise in similar ways. The network has no way to infer that A above B and B above C suggest that A is above C. After exploring a wide range of modifications to the back-prop structure, we conclude that a radically different representation was called for.

The problem of learning words for higher level and abstract concepts is not, like invertibility and inference, primarily computational. The very idea of learning labels for direct bodily experience breaks down for concepts like ``sell'', and even more so for ``inflation'', etc. GL (along with many others) has been working for decades on how such abstract target concepts map to more concrete source domains (Lakoff 1987). Such ``metaphoric'' mappings allow the inference structures of concrete source domains to be used to reason about abstract target domains. In fact, one of the main attractions of spatial relations as the subject of our first effort was that space is known to be the source domain for a wide range of metaphorical mappings. A critical part of our next phase of research is to explicitly model how the mapping of abstract to more concrete source domains plays a key role in language understanding.

The three requirements of invertibility, inference, and abstract concepts have dominated our investigations of task domains and problem formulation for the next round of research. In the rest of this section we describe our plans and their current state of development. As in the first phase, the overall task has been restricted and then divided into projects of approximately doctoral thesis size.

One new project extends our results to the motor system by modeling the acquistion of verbs of hand action in a wide variety of languages. This requires the introduction of what we call ``executing schemas'' - or x-schemas - that can not only represent the structure of the action in terms of the body-model, but can actually be used to control the performance of such an action. The executing schemas that control actions are interfaced with linking feature-structures that correlate distinctive primitives features of hand-action systems with (1) the schemas that execute those features and (2) the linguistic system that encodes those features in natural languages.

A further project extends the paradigm to abstract concepts. It will study short discourses from the domain of international economics, where metaphors based on bodily actions are commonly used in discussing economics. An example would be ``France fell into a recession. Germany pulled it out.'' In the domain of bodily actions, executable schemas for body action are used to conceptualize, reason about, and understand sentences about bodies. The theory of conceptual metaphor, studied in great detail by GL and others (Lakoff and Johnson 1980), shows how abstract domains such as economics are both thought about and talked about in terms of concrete domains such as bodily action. By employing the same computational structures in both the concrete and the abstract task we hope to gain further insights into the complex relationship among the body, concrete and abstract concepts, and the learning of natural languages.

Research Plan

The focus of our research over the next two years will be on refining and testing the x-schema/f-struct representation and its use in modeling lexical acquisition and metaphor. As always, this will combine the construction of performance systems with theoretical work in connectionist modeling and cognitive linguistics. For the initial period, we will concentrate on two ambitious demonstrations, each of which requires the completion of a number of subtasks.

Acquisition and use of motion terms

The general outline of this project and its goals were presented above, we now present a more detailed plan of how we are proceeding.
Problem Formulation
We have produced a classification of English verbs of hand action into ten categories of increasing difficulty. Our initial effort will focus on the first three of these: verbs of possession (get, place, ...), verbs of translation(slide, throw, ...) and rigid-object manipulation (tap, rotate, ...). The next two categories involve deformable objects and simple tools and we hope to tackle these as well.
Cross Linguistic Sampling
As in other semantic domains, there is considerable variation among the world's languages in the representation of hand actions. This is a continuing effort used both in initial design and in later testing of the system. Besides the European languages, we have gathered data on Arabic, Farsi, Japanese, Korean, Mandarin, and Tamil.
Representation and Model Design
This is also an ongoing effort and the current state has been described in detail above. We already know of problems that will require extensions to our current formalism and will work these out in parallel with the continued testing of the current design on the current tasks. As mentioned, it is important that this subtask is common to the various applications.
Environment
There is a significant technical challenge in constructing an environment for studying the acquisition of verbs of motion. We need a robot or simulated robot to carry out the motions so they can be labelled by native speakers of various languages. We have decided to use the Jack simulator from U. Penn (Badler et al 1993) and have made considerable progress on getting it to run in our environment and to communicate with the learning code.
Program Construction and Testing
Even after all the ideas are understood, it is a difficult task to build a system that will appropriately interface with the robot simulation, the user and the learning code. Because x-schemas are asynchronous active representations, there are additional technical problems in their implementation. We are using Sather for the implementation.
Training, Test and Evaluation
When a version of the system is complete, we need to run it through the test cycle. Native speakers of various languages label the robot action sequence. We replay the labelled robot motions and have the learning system develop word meanings. The system is then tested and, usually, changes are made to correct errors in design or coding.

Metaphoric mapping in story interpretation

The general problem of interpreting news stories or other real language is clearly beyond the scope of our efforts. We believe we have isolated an important but tractable sub-problem and are working on that in the broader context of our work on conceptual metaphor, connectionist modeling and language learning.
Problem Formulation
The central idea is to demonstrate how inferences from concrete source domains aid in the interpretation of news stories about abstract target domains, here involving international economics and politics. The performance criterion will be that the system deal correctly with novel stories within its domain. Given a pre-parsed story, it should be able to infer the bindings for appropriate unspecified features, using a combination of target and source domain inference.
Data Gathering
We have examined hundreds of stories from the wire services and on-line periodicals in order to better understand what constructions to model. The story interpretation system is currently not adaptive so it is important to have sound intuitions.
Representation and Model Design
As mentioned above, the same basic x-schema/f-struct formalism is used in all current work. In addition, we are employing a form of belief nets (Pearl 1988) for target domain inference.
Aspect
It turns out that linguistic aspect plays a key role in this domain and we have been developing an active model of aspect, both for its own sake and to aid in the inference project.
Program Building
Again for this task domain, we will need to construct and test a running program. Besides the theoretical problems, the main technical issues involve combining active x-schemas, source-target mappings and belief propagation.
Test and Evaluation
Since there is a continuous stream of news stories, we anticipate no difficulty finding novel inputs for testing the system. There will be the standard problem of evaluating a system that only solves part of a larger problem, but we believe that the criterion of inferring bindings is sharp enough for the quality of the results to be determined.

References

Badler, N., Phillips, C. and Webber, B.: Simulating Humans , Oxford University Press, NY, 1993.

Feldman, J.A.: ``Neural Representation Of Conceptual Knowledge,'' Neural Connections, Mental Computation, eds. Lynn Nadel and others, MIT Press, Cambridge, MA, pgs. 68-103, 1989.

Feldman, J., et. al. Miniature Language Acquisition: A Touchstone for Cognitive Science. Proceedings of the 12th Annual Conference of the Cognitive Science Society, 686-693. Cambridge, Mass. MIT Press.

Lakoff, G. and Johnson, M.: Metaphors We Live By, University of Chicago Press, 1980.

Lakoff, G. Women, Fire, And Dangerous Things, U. Chicago Press. 1987

Pearl, J.: Probabilistic Reasoning in Intelligent Systems, Morgan Kaufmann, CA, 1988.

Regier, Terry (1996). The Human Semantic Potential: Spatial Language and Constrained Connectionism, Cambridge, MA: MIT Press.

Shastri, L. and V. Ajjanagadde.  From simple associations to systematic reasoning, Behavioral and Brain Sciences Vol. 16, No. 3, 417--494, 1993.

PROGRESS

Since this project officially began in June 1997, there are no new results to report. The project web page , NTL , will always be the best source of the current status.

PROJECT REFERENCES

Feldman, J., et. al. Miniature Language Acquisition: A Touchstone for Cognitive Science. Proceedings of the 12th Annual Conference of the Cognitive Science Society, 686-693. Cambridge, Mass. MIT Press.

For an overview of the NTL project, see "Lzero: The First Five Years" Artificial Intelligence Review, v10 pp103-129, April 1996.

Regier, Terry (1996). The Human Semantic Potential: Spatial Language and Constrained Connectionism, Cambridge, MA: MIT Press.

A short paper describing our model of acquisition of hand-action verbs postscript version from Proceedings of the Nineteenth Annual Meeting of the Cognitive Science Society COGSCI-97

A short paper describing our motor control model of Verbal Aspect postscript version from Proceedings of the Nineteenth Annual Meeting of the Cognitive Science Society COGSCI-97

AREA BACKGROUND

Our project is inherently interdisciplinary, involving structured connectionist modeling and cognitive lingiustics. Cognitive linguistics has a bienneial conference of the International Cognitive Linguistics Association and the main journal is Cognitive Linguistics .  The structured approach to connectionist modeling (aka neural networks) is not organized as a separate discipline with papers appearing in a wide range of conferences and journals; the annual Cognitive Science Conference always contains several contributions.

AREA REFERENCES

Ballard,  Dana. Natural Computation . MIT Press. 1997

Lakoff, George. Women, Fire, And Dangerous Things. U. Chicago Press. 1987

Fauconnier, Gilles.  Mappings in Thought & Language. Cambridge  University Press. 1997.

Shastri , L. and V. Ajjanagadde.  From simple associations to systematic reasoning, Behavioral and Brain Sciences Vol. 16, No. 3, 417--494, 1993.

RELATED PROGRAM AREAS

3 .Other Communication Modalities

4. Adaptive Human Interfaces

5. Usability and User-Centered Design

POTENTIAL RELATED PROJECTS

We have ongoing collaboration with the other ICSI groups including Fillmore's Framenet project and a speech project ICSI Speech Research soon to join ISP. We are also working with Dan Jurafsky at Boulder. The work of Lynn Stein and the Cog project at MIT is similar in spirit and the two groups have maintained good contacts.  We believe that some of the more applied efforts could benefit from our work on active semantics, metaphor, etc. and look forward to exploring these possibilities.