Abstract

This paper is concerned with the problem of finding a hypothesis in TP 2 consistent with given positive and negative examples. The hypothesis class TP 2 consists of all the sets of at most two tree patterns and represents the class of unions of at most two tree pattern languages. Especially, we consider the problem from the point of view of the consistency problem for TP 2. The consistency problem is a problem to decide whether there exists a consistent hypothesis with given positive and negative data within some fixed hypothesis space. Efficient solvability of that problem is closely related to the possibility of efficient machine learning or machine discovery. Unfortunately, however, the consistency problem is known to be NP-complete for many hypothesis spaces, including the class TP 2. In order to overcome this computational hardness, in this paper, we try to use additional information obtained by making queries. First, we give an algorithm that, using restricted subset queries, solves the consistency problem for TP 2 in time polynomial in the total size of given positive and negative examples. Next, we show that each subset query made by the algorithm can be replaced by several membership queries under some condition on a set of function symbols. As a result, we have that the consistency problem for TP 2 is solved in polynomial time using membership queries.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call