This article introduces a new neural network architecture, called ARTMAP, that autonomously learns to classify arbitrarily many, arbitrarily ordered vectors into recognition categories based on predictive success. This supervised learning system is built up from a pair of Adaptive Resonance Theory modules (ART a and ART b ) that are capable of self-organizing stable recognition categories in response to arbitrary sequences of input patterns. During training trials, the ART a module receives a stream [ a ( p) ] of input patterns, and ART b receives a stream [ b ( p) ] of input patterns, where b ( p) is the correct prediction given a ( p) . These ART modules are linked by an associative learning network and an internal controller that ensures autonomous system operation in real time. During test trials, the remaining patterns a ( p) are presented without b ( p) , and their predictions at ART b are compared with b ( p) . Tested on a benchmark machine learning database in both on-line and off-line simulations, the ARTMAP system learns orders of magnitude more quickly, efficiently, and accurately than alternative algorithms, and achieves 100% accuracy after training on less than half the input patterns in the database. It achieves these properties by using an internal controller that conjointly maximizes predictive generalization and minimizes predictive error by linking predictive success to category size on a trial-by-trial basis, using only local operations. This computation increases the vigilance parameter ϱ a of ART a by the minimal amount needed to correct a predictive error at ART b . Parameter ϱ a calibrates the minimum confidence that ART a must have in a category, or hypothesis, activated by an input a ( p) in order for ART a to accept that category, rather than search for a better one through an automatically controlled process of hypothesis testing. Parameter ϱ a is compared with the degree of match between a ( p) and the top-down learned expectation, or prototype, that is read-out subsequent to activation of an ART a category. Search occurs if the degree of match is less than ϱ a . ARTMAP is hereby a type of self-organizing expert system that calibrates the selectivity of its hypotheses based upon predictive success. As a result, rare but important events can be quickly and sharply distinguished even if they are similar to frequent events with different consequences. Between input trials ϱ a relaxes to a baseline vigilance ϱ a . When ϱ a is large, the system runs in a conservative mode, wherein predictions are made only if the system is confident of the outcome. Very few false-alarm errors then occur at any stage of learning, yet the system reaches asymptote with no loss of speed. Because ARTMAP learning is self-stabilizing, it can continue learning one or more databases, without degrading its corpus of memories, until its full memory capacity is utilized.