Abstract

We describe here a comprehensive framework for intelligent information management (IIM) of data collection and decision-making actions for reliable and robust event processing and recognition. This is driven by algorithmic information theory (AIT), in general, and algorithmic randomness and Kolmogorov complexity (KC), in particular. The processing and recognition tasks addressed include data discrimination and multilayer open set data categorization, change detection, data aggregation, clustering and data segmentation, data selection and link analysis, data cleaning and data revision, and prediction and identification of critical states. The unifying theme throughout the paper is that of “compression entails comprehension”, which is realized using the interrelated concepts of randomness vs. regularity and Kolmogorov complexity. The constructive and all encompassing active learning (AL) methodology, which mediates and supports the above theme, is context-driven and takes advantage of statistical learning, in general, and semi-supervised learning and transduction, in particular. Active learning employs explore and exploit actions characteristic of closed-loop control for evidence accumulation in order to revise its prediction models and to reduce uncertainty. The set-based similarity scores, driven by algorithmic randomness and Kolmogorov complexity, employ strangeness / typicality and p-values. We propose the application of the IIM framework to critical states prediction for complex physical systems; in particular, the prediction of cyclone genesis and intensification.

Highlights

  • Information loaded with meaning and in context is an asset and is referred throughout this paper as evidence

  • We propose the application of the information management (IIM) framework to critical states prediction for complex physical systems; in particular, the prediction of cyclone genesis and intensification

  • Additional indirect similarity scores suitable for evidence-based management (EBM) and DKA, which are driven by Kolmogorov complexity, are introduced throughout the paper based on the specific functionalities addressed

Read more

Summary

Introduction

Information loaded with meaning and in context is an asset and is referred throughout this paper as evidence. The MS report highlights that an immediate and important challenge is that of end-to-end scientific data management, from data acquisition and data integration, to data treatment, provenance, and persistence” and including “the acquisition of a set of widely applicable complex problem solving capabilities, based on the use of a generic computational environment.” This has been advocated by the Computing Community Consortium http://cra.org/ccc/do cs/init/From_Data_to_Knowledge_to_Action.pdf to enable 21st century discovery in science and engineering. Active learning [11] is first and foremost about the choices made during data collection It employs explore and exploit actions characteristic of closed-loop control for evidence accumulation in order to reduce uncertainty and to revise the prediction models. Strangeness/typicality and p-values; transduction and semi-supervised learning; and multi-layer and multi-set open set categorization are discussed in Sections 8 - Active learning (see Section 3) using the explore and exploit paradigm motivates and supports functionalities related to data collection and evidence accumulation. New dimensions and requirements on EBM and DKA, e.g., for life sciences, are discussed in Section The paper concludes in Section with a brief summary and venues for future research

Evidence-Based Management
Active Learning
Algorithmic Information Theory and Closed-Loop Control
Discriminative Methods and Practical
Kolmogorov Complexity
Algorithmic Randomness
Strangeness and P-Values
Transduction
10. Multi-Layer and Multi-Set Open Set Data Categorization
11. Data Collection and Evidence Accumulation
12. Change Detection Using Martingale
13. Data Aggregation
14. Data Selection and Link Analysis
15. Data Cleaning and Data Revision
16. Criticality Identification and Prediction
18. Conclusions
19. References
H H1U UU
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call