Model based multi-layer perceptron network plays Bekeley’s version of pacman. Trained using Q-learning and experience replay techniques with 3 rounds of random exploration decaying from 1.0 to 0 followed by some sweet game play action.
Thanks to UC Berkeley for developing this environment.
github of this project: https://github.com/daniken/Pacmann .
Comments are closed.