Dynamic pricing under competition using reinforcement learning

Alexander Kastius,Rainer Schlosser

doi:10.1057/s41272-021-00285-3

Alexander Kastius, Rainer Schlosser

Open Access

https://doi.org/10.1057/s41272-021-00285-3

Copy DOI

Abstract

Dynamic pricing is considered a possibility to gain an advantage over competitors in modern online markets. The past advancements in Reinforcement Learning (RL) provided more capable algorithms that can be used to solve pricing problems. In this paper, we study the performance of Deep Q-Networks (DQN) and Soft Actor Critic (SAC) in different market models. We consider tractable duopoly settings, where optimal solutions derived by dynamic programming techniques can be used for verification, as well as oligopoly settings, which are usually intractable due to the curse of dimensionality. We find that both algorithms provide reasonable results, while SAC performs better than DQN. Moreover, we show that under certain conditions, RL algorithms can be forced into collusion by their competitors without direct communication.

Highlights

In modern-day online trading on large platforms using the correct price is crucial
Our goal is to evaluate the performance of two examples of reinforcement learning (RL) algorithms on dynamic pricing problems
We studied pricing competition motivated by online markets in order to provide insights for practitioners to assess in how far reinforcement learning can be used to automate frequent repricing in a self-adaptive way

Summary

Introduction

In modern-day online trading on large platforms using the correct price is crucial. If your goods’ prices are way off the competition, customers might go for cheaper competitors or ones that offer a better service or a similar product. Many traders nowadays can make use of dynamic pricing algorithms, that automatically update their price according to the competitors’ current offers. Those price updates might occur at a high frequency. Markets offer the advantage, that optimal solutions can still be computed via dynamic programming (DP), cf., e.g., Schlosser and Richly (2019), which provides an opportunity to compare and verify the results of reinforcement learning (RL). The second algorithm is Soft Actor Critic (SAC), which is a recent iteration in the group of policy gradient algorithms. It is based on two components, the actor and the critic. The two subsections contain a deeper introduction to both algorithms

Objectives

Methods

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Revenue and Pricing Management	Publication Date: Feb 27, 2021
Citations: 23	License type: open-access

R Discovery Prime

R Discovery Prime

Dynamic pricing under competition using reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Revenue and Pricing Management

Lead the way for us

Similar Papers

Next-gen resource optimization in NB-IoT networks: Harnessing soft actor–critic reinforcement learning
S Anbazhagan ... R.K Mugelan
Computer Networks and ISDN Systems | VOL. 252
S Anbazhagan, et. al.S Anbazhagan ... R.K Mugelan
01 Jul 2024
Computer Networks and ISDN Systems | VOL. 252

Deep Reinforcement Learning for Vision-Based Navigation of UAVs in Avoiding Stationary and Mobile Obstacles
Amudhini P Kalidas ... Christy Jackson Joshua
Drones | VOL. 7
Amudhini P Kalidas, et. al.Amudhini P Kalidas ... Christy Jackson Joshua
01 Apr 2023
Drones | VOL. 7

E xploration E xploitation Problem in Policy Based Deep Reinforcement Learning for Episodic and Continuous Environments
Vedang Naik ... Rohit Sahoo
International Journal of Engineering and Advanced Technology | VOL. 11
Vedang Naik, et. al.Vedang Naik ... Rohit Sahoo
30 Dec 2021
International Journal of Engineering and Advanced Technology | VOL. 11

Dynamic pricing based electric vehicle charging station location strategy using reinforcement learning
Yanbin Li ... Yun Li
Energy (Oxford, England) | VOL. 281
Yanbin Li, et. al.Yanbin Li ... Yun Li
29 Jun 2023
Energy (Oxford, England) | VOL. 281

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic pricing under competition using reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Revenue and Pricing Management