The reliability of modern computing control systems in a heterogeneous distributed computing environment, along with efficiency, survivability, security, control efficiency, is an important component of their quality. Increasingly, these systems fall into the category of "critical", have an absolute impact on the activities of organizations and enterprises within which they operate. The loss of such systems, even for a short time, leads to serious problems related to loss of income, unforeseen costs, downtime of production and personnel, loss of time, and sometimes man-made disasters. As you know, the greatest impact on the reliability of control systems has the reliability and fault tolerance of a set of software and hardware. Therefore, solving problems related to improving the reliability of the software part of the systems is the most urgent task. Currently, significant results have been obtained in the field of evaluation and forecasting of reliability indicators of elements and typical software packages at the stage of their design; a large number of methods known to algorithms and programs are known; a number of normative documents on project reliability assessment have been developed. However, the task of real-time reliability assessment, when accurate and operational accounting of a number of factors is required, has not been sufficiently solved. To solve the problem of multi-agent approach to computing control in a heterogeneous distributed computing environment used methods of systems analysis, set theory - to develop models of task distribution, models of tasks and computing resources, general systems theory - to study and develop methods of task distribution, logic-theory theory. for modeling computational processes. The article considers a multi-agent approach to computing control in a heterogeneous distributed computing environment. The algorithm is based on the use of economic mechanisms to regulate the supply and demand of resources in the computing environment. The architecture of the multi-agent approach and the functions of the agents are described. Particular attention is paid to calculating the reliability of the task plan based on the logical-probabilistic method.
Read full abstract