Gopher-v0
Maximize your score in the Atari 2600 game Gopher. In this environment, the observation is an RGB image of the screen, which is an array of shape (210, 160, 3) Each action is repeatedly performed for a duration of \(k\) frames, where \(k\) is uniformly sampled from \(\{2, 3, 4\}\).
Gopher-v0 Evaluations
Algorithm | Best 100-episode performance | Submitted |
---|---|---|
ppwwyyxx's algorithm writeup | 22595.00 ± 1012.30 | |
gdb's algorithm writeup | 364.80 ± 27.82 | |
ceobillionaire's algorithm | 23830.80 ± 984.26 | |
ceobillionaire's algorithm | 626.00 ± 38.51 | |
gdb's algorithm | 353.20 ± 32.30 | |
justheuristic's algorithm | 0.00 ± 0.00 |