Reinforcement Learning Mechanism Research Articles

The utilization of directional antennas for neighbor discovery in wireless ad hoc networks brings notable benefits, such as extended transmission range, reduced transmission interference, and enhanced antenna gain. However, when nodes use directional antennas for neighbor discovery, the communication range is limited, resulting in a lack of knowledge of potential neighbors. Hence, it is necessary to design a special antenna direction switching strategy for neighbor discovery based on directional antennas. Traditional methods of switching antenna directions are often random or follow predefined sequences, overlooking the historical knowledge of sector exploration for antenna directions. In contrast, existing machine learning approaches aim to leverage observed historical knowledge to adjust antenna directions for faster neighbor discovery. Nonetheless, the latency of neighbor discovery is still high because the node cannot fully utilize the observed historical knowledge (i.e., only using the knowledge observed by the node in transmission mode, ignoring the knowledge observed by the node in reception mode). Meanwhile, the corresponding reward and penalty mechanisms are still not detailed enough (i.e., these reward and penalty mechanisms only consider the sectors of discovered and undiscovered neighboring nodes, ignoring the scenario of sectors that have been rewarded). In this paper, the neighbor discovery process is modeled as a reinforcement learning-based learning automaton. We propose an enhanced reinforcement learning-based two-way transmit-receive directional antennas neighbor discovery algorithm, called ERTTND. The algorithm consists of a two-way transmit-receive reinforcement learning mechanism (TTRL) and an enhanced reward-and-penalty mechanism (ERAP). This algorithm leverages insights from nodes in transmission and reception modes to refine their tactical decisions. Then, through an enriched reward-and-penalty framework, nodes optimize their strategies, thus expediting neighbor discovery based on directional antennas in wireless ad hoc networks. Simulation results demonstrate that compared to existing representative algorithms, the proposed ERTTND algorithm can achieve over 30% savings in terms of average discovery delay and energy consumption.

Read full abstract

Learning to make adaptive decisions involves making choices, assessing their consequence, and leveraging this assessment to attain higher rewarding states. Despite vast literature on value-based decision-making, relatively little is known about the cognitive processes underlying decisions in highly uncertain contexts. Real world decisions are rarely accompanied by immediate feedback, explicit rewards, or complete knowledge of the environment. Being able to make informed decisions in such contexts requires significant knowledge about the environment, which can only be gained via exploration. Here we aim at understanding and formalizing the brain mechanisms underlying these processes. To this end, we first designed and performed an experimental task. Human participants had to learn to maximize reward while making sequences of decisions with only basic knowledge of the environment, and in the absence of explicit performance cues. Participants had to rely on their own internal assessment of performance to reveal a covert relationship between their choices and their subsequent consequences to find a strategy leading to the highest cumulative reward. Our results show that the participants' reaction times were longer whenever the decision involved a future consequence, suggesting greater introspection whenever a delayed value had to be considered. The learning time varied significantly across participants. Second, we formalized the neurocognitive processes underlying decision-making within this task, combining mean-field representations of competing neural populations with a reinforcement learning mechanism. This model provided a plausible characterization of the brain dynamics underlying these processes, and reproduced each aspect of the participants' behavior, from their reaction times and choices to their learning rates. In summary, both the experimental results and the model provide a principled explanation to how delayed value may be computed and incorporated into the neural dynamics of decision-making, and to how learning occurs in these uncertain scenarios.

Read full abstract

Reinforcement Learning Mechanism Research Articles

Related Topics

Articles published on Reinforcement Learning Mechanism

A Linear Programming-Based Reinforcement Learning Mechanism for Incomplete-Information Games

Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions.

Modeling and simulation of Fuzzy-rule based WBAN using multi-level fuzzy colored petri-nets and reinforcement learning

Multi-UAVs task allocation method based on MPSO-SA-DQN

Optimized dead-zone inverse control using reinforcement learning and sliding-mode mechanism for a class of high-order nonlinear systems

Enhanced reinforcement learning-based two-way transmit-receive directional antennas neighbor discovery in wireless ad hoc networks

Disentangling negative reinforcement, working memory, and deductive reasoning deficits in elevated BMI

A novel energy-efficiency framework for UAV-assisted networks using adaptive deep reinforcement learning

How to improve “construct, merge, solve and adapt"? Use reinforcement learning!

Policy complexity suppresses dopamine responses.

A Multiproject and Multilevel Plan Management Model Based on a Hybrid Program Evaluation and Review Technique and Reinforcement Learning Mechanism

The Impact of Educational Informatics on School Management Decision-Making in the Context of Big Data

Cognitive mechanisms of learning in sequential decision-making under uncertainty: an experimental and theoretical approach.

Exploring the effectiveness of reward-based learning strategies for second-language speech sounds.

Eligibility traces in an autonomous soccer robot with obstacle avoidance and navigation policy

An overview: Attention mechanisms in multi-agent reinforcement learning

Chinese image captioning with fusion encoder and visual keyword search

Synergies and Challenges in the Integration of Cloud Computing and Deep Learning: Current Status, Interconnectedness, and Future Directions

Evolving malware detection through instant dynamic graph inverse reinforcement learning

Toward Cooperatively Caching in Multi-UAV-Assisted Network: A Queue-Aware CDS-Based Reinforcement Learning Mechanism With Energy-Efficiency Maximization

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Learning Mechanism Research Articles

Related Topics

Articles published on Reinforcement Learning Mechanism

A Linear Programming-Based Reinforcement Learning Mechanism for Incomplete-Information Games

Reinforcement Learning for Synchronization of Heterogeneous Multiagent Systems by Improved Q-Functions.

Modeling and simulation of Fuzzy-rule based WBAN using multi-level fuzzy colored petri-nets and reinforcement learning

Multi-UAVs task allocation method based on MPSO-SA-DQN

Optimized dead-zone inverse control using reinforcement learning and sliding-mode mechanism for a class of high-order nonlinear systems

Enhanced reinforcement learning-based two-way transmit-receive directional antennas neighbor discovery in wireless ad hoc networks

Disentangling negative reinforcement, working memory, and deductive reasoning deficits in elevated BMI

A novel energy-efficiency framework for UAV-assisted networks using adaptive deep reinforcement learning

How to improve “construct, merge, solve and adapt"? Use reinforcement learning!

Policy complexity suppresses dopamine responses.

A Multiproject and Multilevel Plan Management Model Based on a Hybrid Program Evaluation and Review Technique and Reinforcement Learning Mechanism

The Impact of Educational Informatics on School Management Decision-Making in the Context of Big Data

Cognitive mechanisms of learning in sequential decision-making under uncertainty: an experimental and theoretical approach.

Exploring the effectiveness of reward-based learning strategies for second-language speech sounds.

Eligibility traces in an autonomous soccer robot with obstacle avoidance and navigation policy

An overview: Attention mechanisms in multi-agent reinforcement learning

Chinese image captioning with fusion encoder and visual keyword search

Synergies and Challenges in the Integration of Cloud Computing and Deep Learning: Current Status, Interconnectedness, and Future Directions

Evolving malware detection through instant dynamic graph inverse reinforcement learning

Toward Cooperatively Caching in Multi-UAV-Assisted Network: A Queue-Aware CDS-Based Reinforcement Learning Mechanism With Energy-Efficiency Maximization