Proceedings of the Genetic and Evolutionary Computation Conference 2016 2016
DOI: 10.1145/2908812.2908887
|View full text |Cite
|
Sign up to set email alerts
|

Discovering Rubik's Cube Subgroups using Coevolutionary GP

Abstract: This work reports on an approach to direct policy discovery (a form of reinforcement learning) using genetic programming (GP) for the 3 × 3 × 3 Rubik's Cube. Specifically, a synthesis of two approaches is proposed: 1) a previous group theoretic formulation is used to suggest a sequence of objectives for developing solutions to different stages of the overall task; and 2) a hierarchical formulation of GP policy search is utilized in which policies adapted for an earlier objective are explicitly transferred to a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 25 publications
0
4
0
Order By: Relevance
“…The path cost of x s is 0. The heuristic function h(x) is obtained from the learned cost-to-go function shown in equation (2).…”
Section: Bwas A* Searchmentioning
confidence: 99%
See 1 more Smart Citation
“…The path cost of x s is 0. The heuristic function h(x) is obtained from the learned cost-to-go function shown in equation (2).…”
Section: Bwas A* Searchmentioning
confidence: 99%
“…Developing machine learning algorithms to deal with this property of the Rubik's cube might provide insights into learning to solve planning problems with large state spaces. Although machine learning methods have previously been applied to the Rubik's cube, these methods have either failed to reliably solve the cube [1][2][3][4] or have had to rely on specific domain knowledge 5,6 . Outside of machine learning methods, methods based on pattern databases (PDBs) have been effective at solving puzzles such as the Rubik's cube, the 15 puzzle and the 24 puzzle 7,8 , but these methods can be memory-intensive and puzzle-specific.…”
mentioning
confidence: 99%
“…But they did not learn from scratch because they used an optimal solver based on Kociemba (2015) to generate training examples for the DNN. Smith et al (2016) tried to learn Rubik's cube by genetic programming. However, their learned solver could only reliably solve cubes with up to 5 scrambling twists.…”
Section: Computational Costsmentioning
confidence: 99%
“…Mantere proposed a method based on the cultural algorithm for solving the cube [10]. Smith et al [11] developed evolutionary approach using genetic programming.…”
Section: Introductionmentioning
confidence: 99%