Learning in the presence of inaccurate information

Mark Fulk,Sanjay Jain

doi:10.1016/0304-3975(95)00135-2

Mark Fulk, Sanjay Jain

Open Access

https://doi.org/10.1016/0304-3975(95)00135-2

Copy DOI

Journal: Theoretical Computer Science	Publication Date: Jul 1, 1996
Citations: 9	License type: elsevier-specific: oa user license

Affiliation: National University of Singapore

Abstract

The present paper considers the effects of introducing inaccuracies in a learner's environment in Gold's learning model of identification in the limit. Three kinds of inaccuracies are considered: presence of spurious data is modeled as learning from a noisy environment, missing data is modeled as learning from incomplete environment, and the presence of a mixture of both spurious and missing data is modeled as learning from imperfect environment. Two learning domains are considered, namely, identification of programs from graphs of computable functions and identification of grammars from positive data about recursively enumerable languages. Many hierarchies and tradeoffs resulting from the interplay between the number of errors allowed in the final hypotheses, the number of inaccuracies in the data, the types of inaccuracies, and the type of success criteria are derived. An interesting result is that in the context of function learning, incomplete data is strictly worse for learning than noisy data.

Full Text