Agent Plans Research Articles

This paper introduces Dyna-PINN, a novel physics-informed Deep Dyna-Q (DDQ) reinforcement learning (RL) approach, designed to address the data-intensive training requirements and model-agnostic nature of the conventional model-free RL methods. The DDQ approach blends model-based and model-free elements to enhance both learning and decision-making processes. By utilizing a physics-informed neural network (PINN) based model, our method enriches the learning process with physical information, enhancing the agent's planning capabilities and leading to faster learning compared to conventional model-free RL methods like Deep Q-Network (DQN) in scenarios with low-diversity training data availability. Our results demonstrate that Dyna-PINN has 50% greater sample efficiency than DQN and outperforms rule-based control in terms of thermal discomfort. Due to physics incorporation, the Dyna-PINN implements a more logical and interpretable control policy. It shows consistently good performance compared to all control variants across low-diversity data scenarios, i.e., 6 weeks of building data, and in higher-diversity data regimes, i.e., 6 months of building energy data, demonstrating the value of physics incorporation into the RL training. Additionally, we present two other DDQ-based techniques, RC-DDQ and NN-DDQ, exploring the synergy between neural networks and physical data in intelligent control designs for building energy systems. Rigorous controller testing is performed using the Building Optimization and Testing Framework (BOPTEST), a high-fidelity simulator that closely represents a real building's operation. Through comprehensive comparisons and realistic simulations, our study underscores the effectiveness of incorporating physics-informed approaches into RL-based control strategies, paving the way for more efficient and robust building energy management systems.

We study the problem of self-interested planning under uncertainty in settings shared with more than a thousand other agents, each of which plans at its own individual level. We refer to such large numbers of agents as an agent population. The decision-theoretic formalism of interactive partially observable Markov decision process (I-POMDP) is used to model the agent's self-interested planning. The first contribution of this article is a method for drastically scaling the finitely-nested I-POMDP to certain agent populations for the first time. Our method exploits two types of structure that is often exhibited by agent populations -- anonymity and context-specific independence. We present a variant called the many-agent I-POMDP that models both these types of structure to plan efficiently under uncertainty in multiagent settings. In particular, the complexity of the belief update and solution in the many-agent I-POMDP is polynomial in the number of agents compared with the exponential growth that challenges the original framework. While exploiting structure helps mitigate the curse of many agents, the well-known curse of history that afflicts I-POMDPs continues to challenge scalability in terms of the planning horizon. The second contribution of this article is an application of the branch-and-bound scheme to reduce the exponential growth of the search tree for look ahead. For this, we introduce new fast-computing upper and lower bounds for the exact value function of the many-agent I-POMDP. This speeds up the look-ahead computations without trading off optimality, and reduces both memory and run time complexity. The third contribution is a comprehensive empirical evaluation of the methods on three new problems domains -- policing large protests, controlling traffic congestion at a busy intersection, and improving the AI for the popular Clash of Clans multiplayer game. We demonstrate the feasibility of exact self-interested planning in these large problems, and that our methods for speeding up the planning are effective. Altogether, these contributions represent a principled and significant advance toward moving self-interested planning under uncertainty to real-world applications.

Agent Plans Research Articles

Related Topics

Articles published on Agent Plans

Dyna-PINN: Physics-informed deep dyna-q reinforcement learning for intelligent control of building heating system in low-diversity training data regimes

Online Reinforcement Learning-Based Pedagogical Planning for Narrative-Centered Learning Environments

ПІДТВЕРДЖУВАЛЬНЕ ВИКРИВЛЕННЯ В ПРОЦЕСАХ АРГУМЕНТАЦІЇ

Understanding Bird’s-Eye View of Road Semantics Using an Onboard Camera

Plans or Outcomes: How Do We Attribute Intelligence to Others?

A novel approach for multi-stakeholder agricultural land reallocation using agent-based modeling: A case study in Iran

Experimental evaluation of tasking and teaming design patterns for human delegation of unmanned vehicles

Situated Temporal Planning Using Deadline-aware Metareasoning

A Bi-Level Programming Model for Protecting an Important Node in a Network

Planning and Acting with Non-Deterministic Events: Navigating between Safe States

Dealing with Incompatibilities among Procedural Goals under Uncertainty

A New Multi-Layer Distributed Approach for a Multi-objective Planning Problem

Sequential plan recognition: An iterative approach to disambiguating between hypotheses

Decision-Theoretic Planning Under Anonymity in Agent Populations

Plan Recognition Design

A Multi-Agent Based Simulation Model for Rail–Rail Transshipment: An Engineering Approach for Gantry Crane Scheduling

Resilience through Learning in Multi-Agent Cyber-Physical Systems

Land-use pattern scenario analysis using planner agents

A Strategy for PMU Placement Considering the Resiliency of Measurement System

Incorporating Bayesian learning in agent-based simulation of stakeholders’ negotiation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Agent Plans Research Articles

Related Topics

Articles published on Agent Plans

Dyna-PINN: Physics-informed deep dyna-q reinforcement learning for intelligent control of building heating system in low-diversity training data regimes

Online Reinforcement Learning-Based Pedagogical Planning for Narrative-Centered Learning Environments

ПІДТВЕРДЖУВАЛЬНЕ ВИКРИВЛЕННЯ В ПРОЦЕСАХ АРГУМЕНТАЦІЇ

Understanding Bird’s-Eye View of Road Semantics Using an Onboard Camera

Plans or Outcomes: How Do We Attribute Intelligence to Others?

A novel approach for multi-stakeholder agricultural land reallocation using agent-based modeling: A case study in Iran

Experimental evaluation of tasking and teaming design patterns for human delegation of unmanned vehicles

Situated Temporal Planning Using Deadline-aware Metareasoning

A Bi-Level Programming Model for Protecting an Important Node in a Network

Planning and Acting with Non-Deterministic Events: Navigating between Safe States

Dealing with Incompatibilities among Procedural Goals under Uncertainty

A New Multi-Layer Distributed Approach for a Multi-objective Planning Problem

Sequential plan recognition: An iterative approach to disambiguating between hypotheses

Decision-Theoretic Planning Under Anonymity in Agent Populations

Plan Recognition Design

A Multi-Agent Based Simulation Model for Rail–Rail Transshipment: An Engineering Approach for Gantry Crane Scheduling

Resilience through Learning in Multi-Agent Cyber-Physical Systems

Land-use pattern scenario analysis using planner agents

A Strategy for PMU Placement Considering the Resiliency of Measurement System

Incorporating Bayesian learning in agent-based simulation of stakeholders’ negotiation