Real-time Domain Research Articles

Abstract Multi-agent systems in complex, real time domains require agents to act effectively both autonomously and as part of a team. The complexity of many tasks arising in these domains makes them difficult to solve with pre-programmed agent behaviors. The agents must instead discover a solution on their own, using learning. In this paper, we present MLIMAS a framework for Machine Learning in Interactive Multi-Agent Systems. The MLIMAS is proposed to provide answers to the issues arising from integrating machine learning algorithms in interactive multi-agent systems, focusing on three questions i) what are the learning targets for agents?, (ii) how can the machine learning system be integrated into the agent architecture?, and (iii) how can agents learn interactively?. MLIMAS addresses those three questions plus supporting multi-agent systems consisting of autonomous and adaptive agents acting in real-time and noisy environments. As a result of such required capabilities, MLIMAS allows dynamic and intelligent behavior of the agents to efficiently achieve their local and coalition goals such through modeling other agents actions, and interactively taking benefits of self and others preferences in learning and achieving the agents goals. We studied the proposed framework in the Taxi Domain compared with the traditional Q-Learning algorithm without interactive share of information. Our experiments showed 2 times improvement for the average award received per agents trail rather than the traditional Q-Learning approach. In addition, we have got %80 improvement for the same number of trials of the agents to reach the passengers.

Read full abstract

In this paper, Monte Carlo tree search (MCTS) is introduced for controlling the Pac-Man character in the real-time game Ms Pac-Man. MCTS is used to find an optimal path for an agent at each turn, determining the move to make based on the results of numerous randomized simulations. Several enhancements are introduced in order to adapt MCTS to the real-time domain. Ms Pac-Man is an arcade game, in which the protagonist has several goals but no conclusive terminal state. Unlike games such as Chess or Go there is no state in which the player wins the game. Instead, the game has two subgoals, 1) surviving and 2) scoring as many points as possible. Decisions must be made in a strict time constraint of 40 ms. The Pac-Man agent has to compete with a range of different ghost teams, hence limited assumptions can be made about their behavior. In order to expand the capabilities of existing MCTS agents, four enhancements are discussed: 1) a variable-depth tree; 2) simulation strategies for the ghost team and Pac-Man; 3) including long-term goals in scoring; and 4) reusing the search tree for several moves with a decay factor γ. The agent described in this paper was entered in both the 2012 World Congress on Computational Intelligence (WCCI'12, Brisbane, Qld., Australia) and the 2012 IEEE Conference on Computational Intelligence and Games (CIG'12, Granada, Spain) Pac-Man Versus Ghost Team competitions, where it achieved second and first places, respectively. In the experiments, we show that using MCTS is a viable technique for the Pac-Man agent. Moreover, the enhancements improve overall performance against four different ghost teams.

Read full abstract

Real-time Domain Research Articles

Related Topics

Articles published on Real-time Domain

Numerical Path Integral Approach to Quantum Dynamics and Stationary Quantum States

Research on the production performance of multistage fractured horizontal well in shale gas reservoir

Lateral diffusion contributes to FRET from lanthanide-tagged membrane proteins

Consolidation around a tunnel in a general poroelastic medium under anisotropic initial stress conditions

Real-time application mapping for many-cores using a limited migrative model

A Detailed Analysis of Software Cost Estimation Using Cosmic-FFP

MLIMAS: A Framework for Machine Learning in Interactive Multi-agent Systems

Symbol Timing Sequence Structure for OFDM-CDMA Communication System under Low SNR

Validation of Effective Time Translational Invariance and Linear Viscoelasticity of Polymer Undergoing Cross-linking Reaction

Coupled Thermoelasticity Analysis of Annular Laminate Disk Using Laplace Transform and Galerkin Finite Element Method

Optimization of compute unified device architecture for real-time ultrahigh-resolution optical coherence tomography

Spatial and temporal isolation of virtual CAN controllers

Towards hardware embedded virtualization technology

Real-Time Monte Carlo Tree Search in Ms Pac-Man

Moment equations for chromatography based on Langmuir type reaction kinetics

Shape-Dependent Electronic Excitations in Metallic Chains

Exposure

Imaging the eye fundus with real-time en-face spectral domain optical coherence tomography

A Novel Hybrid Solving Approach Based on Combining Similarity Solutions with Laplace Transformation Technique to Solve Different Engineering Problems

OBSTACLE DETECTION AND ELECTRONIC NAVIGATION SYSTEM FOR VISUALLY IMPAIRED PERSONS

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Real-time Domain Research Articles

Related Topics

Articles published on Real-time Domain

Numerical Path Integral Approach to Quantum Dynamics and Stationary Quantum States

Research on the production performance of multistage fractured horizontal well in shale gas reservoir

Lateral diffusion contributes to FRET from lanthanide-tagged membrane proteins

Consolidation around a tunnel in a general poroelastic medium under anisotropic initial stress conditions

Real-time application mapping for many-cores using a limited migrative model

A Detailed Analysis of Software Cost Estimation Using Cosmic-FFP

MLIMAS: A Framework for Machine Learning in Interactive Multi-agent Systems

Symbol Timing Sequence Structure for OFDM-CDMA Communication System under Low SNR

Validation of Effective Time Translational Invariance and Linear Viscoelasticity of Polymer Undergoing Cross-linking Reaction

Coupled Thermoelasticity Analysis of Annular Laminate Disk Using Laplace Transform and Galerkin Finite Element Method

Optimization of compute unified device architecture for real-time ultrahigh-resolution optical coherence tomography

Spatial and temporal isolation of virtual CAN controllers

Towards hardware embedded virtualization technology

Real-Time Monte Carlo Tree Search in Ms Pac-Man

Moment equations for chromatography based on Langmuir type reaction kinetics

Shape-Dependent Electronic Excitations in Metallic Chains

Exposure

Imaging the eye fundus with real-time en-face spectral domain optical coherence tomography

A Novel Hybrid Solving Approach Based on Combining Similarity Solutions with Laplace Transformation Technique to Solve Different Engineering Problems

OBSTACLE DETECTION AND ELECTRONIC NAVIGATION SYSTEM FOR VISUALLY IMPAIRED PERSONS