Dynamic Programming Problem Research Articles

Federated learning (FL) has been considered as a promising paradigm for enabling distributed machine learning (ML) in wireless networks. To address the limited energy capacity of wireless devices, we propose a simultaneous wireless information and power transfer (SWIPT) aided FL, in which one FL server (FLS) co-located at a cellular base station (BS) uses SWIPT to simultaneously broadcast the global model to wireless user-devices (UDs) and provide wireless power transfer to them. The UDs then use the harvested energy to train their local models and further transmit the local models to the FLS for aggregation. To improve the spectrum efficiency, we consider that the UDs form a non-orthogonal multiple access (NOMA) group for simultaneously sending their local models over the same spectrum channel. Taking the UDs' time-varying available energy and channel conditions into account, we propose a dynamic optimization of the UDs-scheduling, the BS's transmit-power allocation, and the UDs' power-splitting factors for SWIPT, with the objective of minimizing the long-term energy consumption while ensuring the FL convergence. The optimization problem, however, is challenging to solve since it is a finite-horizon dynamic programming problem but with an unknown stopping time, and moreover, the action space covers both discrete and continuous variables. To address these difficulties, we first execute a series of equivalent transformations to reduce the number of decision variables and then formulate the problem as a stochastic shortest path problem, based on which we propose an actor-critic deep reinforcement learning algorithm with the proximal policy optimization to efficiently learn the policy that dynamically adjusts the UDs-scheduling for FL as well as the BS's transmit-power for SWIPT. Numerical results validate the effectiveness and performance of our proposed algorithm. The results demonstrate that our proposed algorithm can effectively reduce the long-term energy consumption in comparison with two baseline algorithms.

Read full abstract

The penetration of renewable energy in modern power systems is still increasing. Battery energy storage can rapidly respond to a dispatch order and is expected to provide multiple auxiliary services. However, deep charging cycles have negative impacts on battery health. This paper presents a health-aware long-term operation strategy for lithium-ion battery energy storage participating in the energy and frequency regulation markets. The strategy determines the capacity bounds that can be bid in the energy and frequency regulation markets and updates these bounds every three months, aiming to preserve battery health and increase market revenue. A long-term operational modeling framework is proposed to address the multi-timescale nature, integrating frequency control, energy arbitrage, and the evolution of battery degradation in a holistic model. A nonlinear degradation model is developed to approximate the health impact of main stress factors, which captures the nonlinearity of three-stage capacity degradation process caused by the formation of solid electrolyte interphase film and lithium plating. For intraday operation, a two-scale stochastic programming model is proposed, in which the market bidding and automatic generation control response are simulated in the timescales of one hour and two seconds, respectively. This simulation based method is closer to industrial practice, as it accounts for various factors in BESS operation. For the seasonal update of the capacity allocation strategy, a dynamic programming problem is established and solved based on the nonlinear degradation model and the daily revenue obtained from simulation of intraday operation; the capacity allocation strategy is determined from the renowned Principle of Optimality by Bellman. This two timescale modeling framework captures the interaction between short-term operation strategy and long-term degradation process. Numerical simulations validate that the proposed method can slow down battery degradation and increase lifetime revenue.

Read full abstract

Dynamic Programming Problem Research Articles

Related Topics

Articles published on Dynamic Programming Problem

Information Relaxation and a Duality-Driven Algorithm for Stochastic Dynamic Programs

Optimal trading with regime switching: Numerical and analytic techniques applied to valuing storage in an electricity balancing market

A zeroing feedback gradient-based neural dynamics model for solving dynamic quadratic programming problems with linear equation constraints in finite time

Enhancing Make-to-Order Manufacturing Agility: When Flexible Capacity Meets Dynamic Pricing

Artificial neural networks to solve dynamic programming problems: A bias-corrected Monte Carlo operator

Improvement of the method of time rationing for assembling car groups on one track

A kernel-based approximate dynamic programming approach: Theory and application

Optimal control for quadrotors during inspection of power utility assets

Methodology for forming a rational structure of technology for the restoration of car, vehicle and road-building machinery parts

Neutrosophic parametric study of piecewise quadratic fuzzy multi-objective dynamic programming problems

Mathematical simulation of combat actions on two encounters using dynamic programming and Wolfram Mathematica package

Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach

ADP-Based Optimal Control of Linear Singularly Perturbed Systems With Uncertain Dynamics: A Two-Stage Value Iteration Method

Dynamic User-Scheduling and Power Allocation for SWIPT Aided Federated Learning: A Deep Learning Approach

Optimizing Operations Management and Business Analytics Strategies under Uncertainty: Dynamic Programming

Health-aware coordinate long-term and short-term operation for BESS in energy and frequency regulation markets

An improved slope-based adaptive control vector parameterization method for dynamic programming

Coarse-grained multicomputer parallel algorithm using the four-splitting technique for the minimum cost parenthesizing problem

Approximate dynamic programming approach to efficient metro train timetabling and passenger flow control strategy with stop-skipping

Radio Map-Based Trajectory Design for UAV-Assisted Wireless Energy Transmission Communication Network by Deep Reinforcement Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dynamic Programming Problem Research Articles

Related Topics

Articles published on Dynamic Programming Problem

Information Relaxation and a Duality-Driven Algorithm for Stochastic Dynamic Programs

Optimal trading with regime switching: Numerical and analytic techniques applied to valuing storage in an electricity balancing market

A zeroing feedback gradient-based neural dynamics model for solving dynamic quadratic programming problems with linear equation constraints in finite time

Enhancing Make-to-Order Manufacturing Agility: When Flexible Capacity Meets Dynamic Pricing

Artificial neural networks to solve dynamic programming problems: A bias-corrected Monte Carlo operator

Improvement of the method of time rationing for assembling car groups on one track

A kernel-based approximate dynamic programming approach: Theory and application

Optimal control for quadrotors during inspection of power utility assets

Methodology for forming a rational structure of technology for the restoration of car, vehicle and road-building machinery parts

Neutrosophic parametric study of piecewise quadratic fuzzy multi-objective dynamic programming problems

Mathematical simulation of combat actions on two encounters using dynamic programming and Wolfram Mathematica package

Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach

ADP-Based Optimal Control of Linear Singularly Perturbed Systems With Uncertain Dynamics: A Two-Stage Value Iteration Method

Dynamic User-Scheduling and Power Allocation for SWIPT Aided Federated Learning: A Deep Learning Approach

Optimizing Operations Management and Business Analytics Strategies under Uncertainty: Dynamic Programming

Health-aware coordinate long-term and short-term operation for BESS in energy and frequency regulation markets

An improved slope-based adaptive control vector parameterization method for dynamic programming

Coarse-grained multicomputer parallel algorithm using the four-splitting technique for the minimum cost parenthesizing problem

Approximate dynamic programming approach to efficient metro train timetabling and passenger flow control strategy with stop-skipping

Radio Map-Based Trajectory Design for UAV-Assisted Wireless Energy Transmission Communication Network by Deep Reinforcement Learning