Tree Search Research Articles

This paper introduces a new data-structural object that we call the tiny pointer. In many applications, traditional \(\log n\) -bit pointers can be replaced with \(o(\log n)\) -bit tiny pointers at the cost of only a constant-factor time overhead and a small probability of failure. We develop a comprehensive theory of tiny pointers, and give optimal constructions for both fixed-size tiny pointers (i.e., settings in which all of the tiny pointers must be the same size) and variable-size tiny pointers (i.e., settings in which the average tiny-pointer size must be small, but some tiny pointers can be larger). If a tiny pointer references an item in an array filled to load factor \(1-\delta\) , then the optimal tiny-pointer size is \(\Theta(\log\log\log n+\log\delta^{-1})\) bits in the fixed-size case, and \(\Theta(\log\delta^{-1})\) expected bits in the variable-size case. Our tiny-pointer constructions also require us to revisit several classic problems having to do with balls and bins; these results may be of independent interest. Using tiny pointers, we apply tiny pointers to five classic data-structure problems. We show that: A data structure storing \(n\) \(v\) -bit values for \(n\) keys with constant-factor time modifications/queries can be implemented to take space \(nv+O(n\log^{(r)}n)\) bits, for any constant \(r\gt0\) , as long as the user stores a tiny pointer of expected size \(O(1)\) with each key—here, \(\log^{(r)}n\) is the \(r\) -th iterated logarithm. Any binary search tree can be made succinct, meaning that it achieves \((1+o(1))\) times the optimal space, with constant-factor time overhead, and can even be made to be within \(O(n)\) bits of optimal if we allow for \(O(\log^{*}n)\) -time modifications—this holds even for rotation-based trees such as the splay tree and the red-black tree. Any fixed-capacity key-value dictionary can be made stable (i.e., items do not move once inserted) with constant-factor time overhead and \((1+o(1))\) -factor space overhead. Any key-value dictionary that requires uniform-size values can be made to support arbitrary-size values with constant-factor time overhead and with an additional space consumption of \(\log^{(r)}n+O(\log j)\) bits per \(j\) -bit value for an arbitrary constant \(r\gt0\) of our choice. Given an external-memory array \(A\) of size \((1+\varepsilon)n\) containing a dynamic set of up to \(n\) key-value pairs, it is possible to maintain an internal-memory stash of size \(O(n\log\varepsilon^{-1})\) bits so that the location of any key-value pair in \(A\) can be computed in constant time (and with no IOs). In each case tiny pointers allow for us to take a natural space-inefficient solution that uses pointers and make it space-efficient for free.

Read full abstract

Active inference is a theory of perception, learning, and decision making that can be applied to neuroscience, robotics, psychology, and machine learning. Recently, intensive research has been taking place to scale up this framework using Monte Carlo tree search and deep learning. The goal of this activity is to solve more complicated tasks using deep active inference. First, we review the existing literature and then progressively build a deep active inference agent as follows: we (1) implement a variational autoencoder (VAE), (2) implement a deep hidden Markov model (HMM), and (3) implement a deep critical hidden Markov model (CHMM). For the CHMM, we implemented two versions, one minimizing expected free energy, CHMM[EFE] and one maximizing rewards, CHMM[reward]. Then we experimented with three different action selection strategies: the ε-greedy algorithm as well as softmax and best action selection. According to our experiments, the models able to solve the dSprites environment are the ones that maximize rewards. On further inspection, we found that the CHMM minimizing expected free energy almost always picks the same action, which makes it unable to solve the dSprites environment. In contrast, the CHMM maximizing reward keeps on selecting all the actions, enabling it to successfully solve the task. The only difference between those two CHMMs is the epistemic value, which aims to make the outputs of the transition and encoder networks as close as possible. Thus, the CHMM minimizing expected free energy repeatedly picks a single action and becomes an expert at predicting the future when selecting this action. This effectively makes the KL divergence between the output of the transition and encoder networks small. Additionally, when selecting the action down the average reward is zero, while for all the other actions, the expected reward will be negative. Therefore, if the CHMM has to stick to a single action to keep the KL divergence small, then the action down is the most rewarding. We also show in simulation that the epistemic value used in deep active inference can behave degenerately and in certain circumstances effectively lose, rather than gain, information. As the agent minimizing EFE is not able to explore its environment, the appropriate formulation of the epistemic value in deep active inference remains an open question.

Read full abstract

Tree Search Research Articles

Related Topics

Articles published on Tree Search

‘Journeys in the Dark’ - Towards Game Master AI in Complex Board Games

Mining contextually meaningful subgraphs from a vertex-attributed graph.

Spectrum Sensing and Resource Allocation for Frequency Hopping Based CRN Using Hazelnut Tree Search Algorithm

A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search

A novel reinforcement learning-based method for structure optimization

Time-lapsed in-situ monitoring of mechanical property in deep soil mixing with surface wave

Terraces in species tree inference from gene trees

GREMI: An Explainable Multi-Omics Integration Framework for Enhanced Disease Prediction and Module Identification.

Analysis of the Application of Artificial Intelligence in Mahjong

Coordination of NPCs in multi-agent systems based on behavior trees

Optimization of Truck–Cargo Matching for the LTL Logistics Hub Based on Three-Dimensional Pallet Loading

Improved Monte Carlo tree search formulation with multiple root nodes for discrete sizing optimization of truss structures

Uncertainty Qualification for Deep Learning-Based Elementary Reaction Property Prediction.

Tiny Pointers

Optimizing a linear function over the efficient set of a multiple objective integer quadratic program

Real-time reactive task allocation and planning of large heterogeneous multi-robot systems with temporal logic specifications

Deconstructing Deep Active Inference: A Contrarian Information Gatherer.

An exact branch-and-price-and-cut algorithm for a practical and large-scale dial-a-ride problem

Separated spacecraft attitude algorithm research based on star sensor multi-information fusion

Bounds and Algorithms for Alphabetic Codes and Binary Search Trees

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Tree Search Research Articles

Related Topics

Articles published on Tree Search

‘Journeys in the Dark’ - Towards Game Master AI in Complex Board Games

Mining contextually meaningful subgraphs from a vertex-attributed graph.

Spectrum Sensing and Resource Allocation for Frequency Hopping Based CRN Using Hazelnut Tree Search Algorithm

A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search

A novel reinforcement learning-based method for structure optimization

Time-lapsed in-situ monitoring of mechanical property in deep soil mixing with surface wave

Terraces in species tree inference from gene trees

GREMI: An Explainable Multi-Omics Integration Framework for Enhanced Disease Prediction and Module Identification.

Analysis of the Application of Artificial Intelligence in Mahjong

Coordination of NPCs in multi-agent systems based on behavior trees

Optimization of Truck–Cargo Matching for the LTL Logistics Hub Based on Three-Dimensional Pallet Loading

Improved Monte Carlo tree search formulation with multiple root nodes for discrete sizing optimization of truss structures

Uncertainty Qualification for Deep Learning-Based Elementary Reaction Property Prediction.

Tiny Pointers

Optimizing a linear function over the efficient set of a multiple objective integer quadratic program

Real-time reactive task allocation and planning of large heterogeneous multi-robot systems with temporal logic specifications

Deconstructing Deep Active Inference: A Contrarian Information Gatherer.

An exact branch-and-price-and-cut algorithm for a practical and large-scale dial-a-ride problem

Separated spacecraft attitude algorithm research based on star sensor multi-information fusion

Bounds and Algorithms for Alphabetic Codes and Binary Search Trees