Abstract

AbstractWe present OntoScene, a framework aimed at understanding the semantics of visual scenes starting from the semantics of their elements and the spatial relations holding between them. OntoScene exploits ontologies for representing knowledge and Prolog for specifying the interpretation rules that domain experts may adopt, and for implementing the SceneInterpreter engine. Ontologies allow the designer to formalize the domain in a reusable way and make the system modular and interoperable with existing multiagent systems, while Prolog provides a solid basis to define complex rules of interpretation in a way that can be affordable even for people with no background in Computational Logics. The domain selected for experimenting OntoScene is that of prehistoric rock art, which provides us with a fascinating and challenging testbed.

Highlights

  • Human perception of complex visual scenes has been studied for a long time in psychology and neuroscience (Kondo et al . 2017): according to the seminal work on “high-level scene perception” (Henderson and Hollingworth 1999), besides low-level or early vision, concerned with extraction of physical properties such as depth, color, and texture from an image (Marr 1982), and intermediate-level vision, concerned with extraction of shapeDownloaded from https://www.cambridge.org/core

  • To understand the semantics of a scene starting from the semantics of its elements and the relations holding among them we developed OntoScene, which exploits a powerful combination of ontologies and Prolog: ontologies are used for representing knowledge, and Prolog for specifying the rules that domain experts use to interpret visual scenes and for implementing the SceneInterpreter engine

  • The power of Prolog for specifying scene interpretation rules is properly exemplified by the rule in Section 6.3 that exploits the findall all-solutions predicate for collecting all the images interpreted as corniforms into one set, generates one partition of the set in a nondeterministic way, and tests whether this partition enjoys the definition of being a group

Read more

Summary

Introduction

Human perception of complex visual scenes has been studied for a long time in psychology and neuroscience (Kondo et al . 2017): according to the seminal work on “high-level scene perception” (Henderson and Hollingworth 1999), besides low-level or early vision, concerned with extraction of physical properties such as depth, color, and texture from an image (Marr 1982), and intermediate-level vision, concerned with extraction of shape. OntoScene: A Logic-based Scene Interpreter and spatial relations that can be determined without regard to meaning (Ullman 1996), a further level of vision is required to perceive and understand a scene: high-level vision concerns the mapping from visual representations to meaning and includes [...] the identification of objects and scenes. In their recent studies, Kveraga and Bar (2014) and Baldassano (2015) demonstrate that the brain has regions related to higher-order properties like overall geometry, interactions between objects, esthetic beauty, or memorability of a scene. As discussed by Mascardi et al . (2014), the three key services offered by IndianaMAS (sketch recognition, image recognition, and multilingual access to digital libraries) are provided by systems that may be MASs themselves and that are seen as black boxes by the IndianaMAS agents

Methods
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.