Learning Equilibria in Matching Markets with Bandit Feedback

Meena Jagadeesan,Alexander Wei,Jacob Steinhardt,Yixin Wang,Michael I Jordan

doi:10.1145/3583681

Abstract

Large-scale, two-sided matching platforms must find market outcomes that align with user preferences while simultaneously learning these preferences from data. Classical notions of stability (Gale and Shapley, 1962; Shapley and Shubik, 1971) are, unfortunately, of limited value in the learning setting, given that preferences are inherently uncertain and destabilizing while they are being learned. To bridge this gap, we develop a framework and algorithms for learning stable market outcomes under uncertainty. Our primary setting is matching with transferable utilities, where the platform both matches agents and sets monetary transfers between them. We design an incentive-aware learning objective that captures the distance of a market outcome from equilibrium. Using this objective, we analyze the complexity of learning as a function of preference structure, casting learning as a stochastic multi-armed bandit problem. Algorithmically, we show that “optimism in the face of uncertainty,” the principle underlying many bandit algorithms, applies to a primal-dual formulation of matching with transfers and leads to near-optimal regret bounds. Our work takes a first step toward elucidating when and how stable matchings arise in large, data-driven marketplaces.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Equilibria in Matching Markets with Bandit Feedback

Abstract

Talk to us

Similar Papers

More From: Journal of the ACM

Lead the way for us

Journal: Journal of the ACM	Publication Date: May 24, 2023
Citations: 1

Similar Papers

Adaptive Exploration in Stochastic Multi-armed Bandit Problem
Xiaofang Zhang ... Quan Liu
-
Xiaofang Zhang, et. al.Xiaofang Zhang ... Quan Liu
27 Dec 2016
27 Dec 2016

Mechanisms with learning for stochastic multi-armed bandit problems
Shweta Jain ... Divya Padmanabhan
Indian Journal of Pure and Applied Mathematics | VOL. 47
Shweta Jain, et. al.Shweta Jain ... Divya Padmanabhan
01 Jun 2016
Indian Journal of Pure and Applied Mathematics | VOL. 47

Approximation algorithms for restless bandit problems
...
-
, et. al. ...
04 Jan 2009
04 Jan 2009

Approximation Algorithms for Restless Bandit Problems
Sudipto Guha ... Kamesh Munagala
-
Sudipto Guha, et. al.Sudipto Guha ... Kamesh Munagala
04 Jan 2009
04 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Equilibria in Matching Markets with Bandit Feedback

Abstract

Talk to us

Similar Papers

More From: Journal of the ACM