Solving the Rubik’s cube with deep reinforcement learning and search

Forest Agostinelli,Pierre Baldi,Stephen Mcaleer,Alexander Shmakov

doi:10.1038/s42256-019-0070-z

Abstract

The Rubik’s cube is a prototypical combinatorial puzzle that has a large state space with a single goal state. The goal state is unlikely to be accessed using sequences of randomly generated moves, posing unique challenges for machine learning. We solve the Rubik’s cube with DeepCubeA, a deep reinforcement learning approach that learns how to solve increasingly difficult states in reverse from the goal state without any specific domain knowledge. DeepCubeA solves 100% of all test configurations, finding a shortest path to the goal state 60.3% of the time. DeepCubeA generalizes to other combinatorial puzzles and is able to solve the 15 puzzle, 24 puzzle, 35 puzzle, 48 puzzle, Lights Out and Sokoban, finding a shortest path in the majority of verifiable cases. For some combinatorial puzzles, solutions can be verified to be optimal, for others, the state space is too large to be certain that a solution is optimal. A new deep learning based search heuristic performs well on the iconic Rubik’s cube and can also generalize to puzzles in which optimal solvers are intractable.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Solving the Rubik’s cube with deep reinforcement learning and search

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence

Lead the way for us

Journal: Nature Machine Intelligence	Publication Date: Jul 15, 2019
Citations: 90

Similar Papers

Towards efficiently solving the rubik’s cube with deep reinforcement learning and recursion
M Mahindra Roshan ... U Subramaniam
E3S Web of Conferences | VOL. 491
M Mahindra Roshan, et. al.M Mahindra Roshan ... U Subramaniam
01 Jan 2024
E3S Web of Conferences | VOL. 491

Energy efficient task scheduling based on deep reinforcement learning in cloud environment: A specialized review
Huanhuan Hou ... Azlan Ismail
Future Generation Computer Systems | VOL. 151
Huanhuan Hou, et. al.Huanhuan Hou ... Azlan Ismail
14 Oct 2023
Future Generation Computer Systems | VOL. 151

Target‐driven visual navigation in indoor scenes using reinforcement learning and imitation learning
Qiang Fang ... Yujun Zeng
CAAI Transactions on Intelligence Technology | VOL. 7
Qiang Fang, et. al.Qiang Fang ... Yujun Zeng
21 Apr 2021
CAAI Transactions on Intelligence Technology | VOL. 7

Matching workloads ro systems with deep reinforcement learning
Bing Hu ... Nicholas Mason
Journal of Management and Engineering Integration | VOL. 17
Bing Hu, et. al.Bing Hu ... Nicholas Mason
01 Jun 2024
Journal of Management and Engineering Integration | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Solving the Rubik’s cube with deep reinforcement learning and search

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence