SolveBeliefMDP Gifs displaying the deep RL policy's performance on the LaserTag domain Vanila LaserTag (Discrete Robot State and Action Space with exact belief updates) Modified LaserTag (Continuous Robot State and Action Space with exact belief updates)