Logic Programming System Research Articles

This paper is concerned with problems that arise when submitting large quantities of data to analysis by an Inductive Logic Programming (ILP) system. Complexity arguments usually make it prohibitive to analyse such datasets in their entirety. We examine two schemes that allow an ILP system to construct theories by sampling from this large pool of data. The first, “subsampling”, is a single-sample design in which the utility of a potential rule is evaluated on a randomly selected sub-sample of the data. The second, “logical windowing”, is multiple-sample design that tests and sequentially includes errors made by a partially correct theory. Both schemes are derived from techniques developed to enable propositional learning methods (like decision trees) to cope with large datasets. The ILP system CProgol, equipped with each of these methods, is used to construct theories for two datasets—one artificial (a chess endgame) and the other naturally occurring (a language tagging problem). In each case, we ask the following questions of CProgol equipped with sampling: (1) Is its theory comparable in predictive accuracy to that obtained if all the data were used (that is, no sampling was employed)?s and (2) Is its theory constructed in less time than the one obtained with all the data? For the problems considered, the answers to these questions is “yes”. This suggests that an ILP program equipped with an appropriate sampling method could begin to address problems satisfactorily that have hitherto been inaccessible simply due to data extent.

When comparing inductive logic programming (ILP) and attribute-value learning techniques, there is a trade-off between expressive power and efficiency. Inductive logic programming techniques are typically more expressive but also less efficient. Therefore, the data sets handled by current inductive logic programming systems are small according to general standards within the data mining community. The main source of inefficiency lies in the assumption that several examples may be related to each other, so they cannot be handled independently. Within the learning from interpretations framework for inductive logic programming this assumption is unnecessary, which allows to scale up existing ILP algorithms. In this paper we explain this learning setting in the context of relational databases. We relate the setting to propositional data mining and to the classical ILP setting, and show that learning from interpretations corresponds to learning from multiple relations and thus extends the expressiveness of propositional learning, while maintaining its efficiency to a large extent (which is not the case in the classical ILP setting). As a case study, we present two alternative implementations of the ILP system TILDE (Top-down Induction of Logical DEcision trees): TILDEclassic, which loads all data in main memory, and TILDELDS, which loads the examples one by one. We experimentally compare the implementations, showing TILDELDS can handle large data sets (in the order of 100,000 examples or 100 MB) and indeed scales up linearly in the number of examples.

Logic Programming System Research Articles

Related Topics

Articles published on Logic Programming System

A Study of Two Sampling Methods for Analyzing Large Datasets with ILP

Return value placement and tail call optimization in high level languages

Tabling for non-monotonic programming

Scaling Up Inductive Logic Programming by Learning from Interpretations

Memory management for Prolog with tabling

CLP( χ) for automatically proving program properties

Exploiting and-or parallelism in Prolog: The OASys computational model and abstract architecture

Advantages of decision lists and implicit negatives in Inductive Logic Programming

Parallel execution of Prolog with granularity control

Knowledge base for finite-element mesh design learned by inductive logic programming

Modelling Discrete Optimisation Problems in Constraint Logic Programming

A new term representation method for prolog

Pharmacophore Discovery Using the Inductive Logic Programming System PROGOL

Towards a closer integration of finite domainpropagation and simplex-based algorithms

A parallel prolog system for distributed memory

Tools for mapping, load balancing and monitoring in the LOGFLOW parallel Prolog project

A machine learning approach for acquiring descriptive classification rules of shape contours

Cuts and side-effects in distributed memory OR-parallel prolog

Implementation of tabled evaluation with delaying in Prolog

Algorithmic Debugging and Hypothetical Reasoning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Logic Programming System Research Articles

Related Topics

Articles published on Logic Programming System

A Study of Two Sampling Methods for Analyzing Large Datasets with ILP

Return value placement and tail call optimization in high level languages

Tabling for non-monotonic programming

Scaling Up Inductive Logic Programming by Learning from Interpretations

Memory management for Prolog with tabling

CLP( χ) for automatically proving program properties

Exploiting and-or parallelism in Prolog: The OASys computational model and abstract architecture

Advantages of decision lists and implicit negatives in Inductive Logic Programming

Parallel execution of Prolog with granularity control

Knowledge base for finite-element mesh design learned by inductive logic programming

Modelling Discrete Optimisation Problems in Constraint Logic Programming

A new term representation method for prolog

Pharmacophore Discovery Using the Inductive Logic Programming System PROGOL

Towards a closer integration of finite domainpropagation and simplex-based algorithms

A parallel prolog system for distributed memory

Tools for mapping, load balancing and monitoring in the LOGFLOW parallel Prolog project

A machine learning approach for acquiring descriptive classification rules of shape contours

Cuts and side-effects in distributed memory OR-parallel prolog

Implementation of tabled evaluation with delaying in Prolog

Algorithmic Debugging and Hypothetical Reasoning