Batch Reinforcement Learning