icml2020专题

An Optimistic Perspective on Offline Reinforcement Learning（ICML2020）

Abstract \quad 该文章利用了 the DQN replay dataset 研究了Offline RL，该数据集包含了一个 DQN agent 在60款Atari 2600游戏上的 the entire replay experience 。 \quad 我们证明了 recent off-policy deep RL 算法，即使仅仅在 replay dataset 上训练，