Memory and expectations in learning, language, and visual understanding

Roger C Schank,Andrew Fano

doi:10.1007/bf00849039

Abstract

Research in vision and language has traditionally remained separate in part because the classic task of generating a representation of a given image or sentence has resulted in an emphasis on low level structural aspects of these media. In this paper we argue that image and language understanding should be approached with the intent of facilitating the performance of a task. Under this view research in image and language understanding must confront common issues that arise as a task is pursued. Language and images are both input that can be used to maintain a model of a task. We argue that a model may be maintained by incorporating changes in the scene that can be characterized at a high level of abstraction yet manifest themselves at relatively low levels of analysis. Existing task-relevant models and the associated domain knowledge are used to expect specific changes and disambiguate the interpretation of these changes, thereby allowing them to modify the existing model. From this perspective, understanding input is largely independent of the modality of the input.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Memory and expectations in learning, language, and visual understanding

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review

Lead the way for us

Journal: Artificial Intelligence Review	Publication Date: Oct 1, 1995
Citations: 15

Similar Papers

A Procedural Model of Recognition for Machine Perception

-

01 Mar 1978
01 Mar 1978

Borich 요구도와 The Locus for Focus model을 활용한 대학생의 챗GPT 현 수준 및 중요도 분석
Hyeyoung Jo ... Sewon Oh
Korean Association For Learner-Centered Curriculum And Instruction | VOL. 24
Hyeyoung Jo, et. al.Hyeyoung Jo ... Sewon Oh
15 May 2024
Korean Association For Learner-Centered Curriculum And Instruction | VOL. 24

Video skimming and characterization through the combination of image and language understanding
M.A Smith ... T Kanade
-
M.A Smith, et. al.M.A Smith ... T Kanade
03 Jan 1998
03 Jan 1998

MICHEL FOUCAULT’S ARCHAEOLOGY OF LANGUAGE

The Journal of V. N. Karazin Kharkiv National University, Series "Philosophy. Philosophical Peripeteias" | VOL. -

01 Jan 2019
The Journal of V. N. Karazin Kharkiv National University, Series "Philosophy. Philosophical Peripeteias" | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Memory and expectations in learning, language, and visual understanding

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review