Sequential Decision Research Articles

The intricacy of wireless network ecosystems and Internet of Things (IoT) connected devices have increased rapidly as technology advances and cyber threats increase. The existing methods cannot make sequential decisions in complex network environments, particularly in scenarios with partial observability and non-stationarity. Network awareness monitors and comprehends the network's assets, vulnerabilities, and ongoing activities in real-time. Advanced analytics, machine learning algorithms, and artificial intelligence are used to improve risk perception by analyzing massive amounts of information, identifying trends, and anticipating future security breaches. Hence, this study suggests the Deep Reinforcement Learning-assisted Network Awareness Risk Perception and Prevention Model (DRL-NARPP) for detecting malicious activity in cybersecurity. The proposed system begins with the concept of network awareness, which uses DRL algorithms to constantly monitor and evaluate the condition of the network in terms of factors like asset configurations, traffic patterns, and vulnerabilities. DRL provides autonomous learning and adaptation to changing network settings, revealing the ever-changing nature of network awareness risks in real time. Incorporating DRL into risk perception increases the system's capacity to recognize advanced attack methods while simultaneously decreasing the number of false positives and enhancing the reliability of risk assessments. DRL algorithms drive dynamic and context-aware response mechanisms, making up the adaptive network prevention component of the development. Predicting new threats and proactively deploying preventive measures, such as changing firewall rules, isolating compromised devices, or dynamically reallocating resources to reduce developing risks, is made possible by the system's ability to learn from historical data and prevailing network activity. The suggested DRL-NARPP model increases the anomaly detection rate by 98.3%, the attack prediction accuracy rate by 97.4%, and the network risk assessment ratio by 96.4%, reducing the false positive ratio by 11.2% compared to other popular methodologies.

Guidance commands of flight vehicles can be regarded as a series of data sets having fixed time intervals; thus, guidance design constitutes a typical sequential decision problem and satisfies the basic conditions for using the deep reinforcement learning (DRL) technique. In this paper, we consider the scenario where the escape flight vehicle (EFV) generates guidance commands based on the DRL technique, while the pursuit flight vehicles (PFVs) derive their guidance commands employing the proportional navigation method. For every PFV, the evasion distance is described as the minimum distance between the EFV and the PFV during the escape-and-pursuit process. For the EFV, the objective of the guidance design entails progressively maximizing the residual velocity, which is described as the EFV’s velocity when the last evasion distance is attained, subject to the constraint imposed by the given evasion distance threshold. In the outlined problem, three dimensionalities of uncertainty emerge: (1) the number of PFVs requiring evasion at each time instant; (2) the precise time instant at which each of the evasion distances can be attained; (3) whether each attained evasion distance exceeds the given threshold or not. To solve the challenging problem, we propose an innovative solution that integrates the recurrent neural network (RNN) with the proximal policy optimization (PPO) algorithm, engineered to generate the guidance commands of the EFV. Initially, the model, trained by the RNN-based PPO algorithm, demonstrates effectiveness in evading a single PFV. Subsequently, the aforementioned model is deployed to evade additional PFVs, thereby systematically augmenting the model’s capabilities. Comprehensive simulation outcomes substantiate that the guidance design method based on the proposed RNN-based PPO algorithm is highly effective.

Sequential Decision Research Articles

Related Topics

Articles published on Sequential Decision

Multi-agent reinforcement learning clustering algorithm based on silhouette coefficient

A Review of Degradation Models and Remaining Useful Life Prediction for Testing Design and Predictive Maintenance of Lithium-Ion Batteries.

Q-learning based scheduling method for continuous pickling process of titanium strips

Dynamic resource matching in manufacturing using deep reinforcement learning

A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making

Reinforcement learning for watershed and aquifer management: a nationwide view in the country of Mexico with emphasis in Baja California Sur

Application Study on the Reinforcement Learning Strategies in the Network Awareness Risk Perception and Prevention

An algorithm for multi-armed bandit based on variance change sensitivity

SWOAM: Swarm optimized agents for energy management in grid-interactive connected buildings

Sequential Label Enhancement.

Reinforced Sequential Decision-Making for Sepsis Treatment: The PosNegDM Framework With Mortality Classifier and Transformer

Neurophysiological insights into sequential decision-making: exploring the secretary problem through ERPs and TBR dynamics.

Guidance Design for Escape Flight Vehicle against Multiple Pursuit Flight Vehicles Using the RNN-Based Proximal Policy Optimization Algorithm

An investigation of belief-free DRL and MCTS for inspection and maintenance planning

Multi-period share pledging with sequential three-way proportion decision

Energy management for scalable battery swapping stations: A deep reinforcement learning and mathematical optimization cascade approach

Multi-robot Source Navigation Method Based on Coordination Graph Monte Carlo Tree Search

Goal commitment is supported by vmPFC through selective attention

Probabilistic reach-avoid for Bayesian neural networks

A sequential three-way risk sorting model with the cautionary principle under probabilistic linguistic environment

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sequential Decision Research Articles

Related Topics

Articles published on Sequential Decision

Multi-agent reinforcement learning clustering algorithm based on silhouette coefficient

A Review of Degradation Models and Remaining Useful Life Prediction for Testing Design and Predictive Maintenance of Lithium-Ion Batteries.

Q-learning based scheduling method for continuous pickling process of titanium strips

Dynamic resource matching in manufacturing using deep reinforcement learning

A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making

Reinforcement learning for watershed and aquifer management: a nationwide view in the country of Mexico with emphasis in Baja California Sur

Application Study on the Reinforcement Learning Strategies in the Network Awareness Risk Perception and Prevention

An algorithm for multi-armed bandit based on variance change sensitivity

SWOAM: Swarm optimized agents for energy management in grid-interactive connected buildings

Sequential Label Enhancement.

Reinforced Sequential Decision-Making for Sepsis Treatment: The PosNegDM Framework With Mortality Classifier and Transformer

Neurophysiological insights into sequential decision-making: exploring the secretary problem through ERPs and TBR dynamics.

Guidance Design for Escape Flight Vehicle against Multiple Pursuit Flight Vehicles Using the RNN-Based Proximal Policy Optimization Algorithm

An investigation of belief-free DRL and MCTS for inspection and maintenance planning

Multi-period share pledging with sequential three-way proportion decision

Energy management for scalable battery swapping stations: A deep reinforcement learning and mathematical optimization cascade approach

Multi-robot Source Navigation Method Based on Coordination Graph Monte Carlo Tree Search

Goal commitment is supported by vmPFC through selective attention

Probabilistic reach-avoid for Bayesian neural networks

A sequential three-way risk sorting model with the cautionary principle under probabilistic linguistic environment