pcbsaso

Hebbian Automata Reinforcement Learning Improviser (HARLI) demonstrates its 'wave' strategy. HARLI starts with randomly initialised weights and updates them according to an evolved Hebbian learning policy, eventually reaching a high-scoring wave generation strategy. The waves travel at c, the maximum speed in Life-like CA, and yield a reward of about 40 to 50. -> https://github.com/rivesunder/harli_learning, (Davis 2021)

A Hebbian policy games the game in a CA RL environment