本文主要是介绍[NOTE] Advice and Perspectives on RL Research Frontiers - Rich Sutton in DLRLSS 2019,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!
根据我的习惯,当然先放ressources:slides,video. 这是Sutton在DLRLSS 2019 summer school上的一个lecture,从他自己的角度分享了对RL领域的一些理解,他目前的研究方向及前沿等。一些思考还是很有启发的。个别要点摘录于此,细节可以自行阅读、观看。
Developing your own research thoughts
- There are no authorities in science. Be ambitious but also humble. Your own thought is of great value.
- One best way of training is to write for yourself and discuss with others.
- When thinking on big questions, it's easy to get stuck:
- Define your own terms
- Go multiple: think about alternatives
- Go meta: what are the properties that the solution should have
- Retreat to clearer question
- The most important insight you will ever contribute is too obvious to see.(The discovery of gravity)
“Completing the square” for doing RL research
Research that Sutton is doing
有必要更加深入地理解Prediction和Control的联系与区别。
下文简而言之,Sutton is working on subprolems. The world env is often too complex to learn as a whole. It's natural to have multiple components like different parts of the body. I think it's a bit like the multi-agent concept, whose goals may not directly relate to the global reward.
关于Permanent memory的部分其实非常有想象空间。
这篇关于[NOTE] Advice and Perspectives on RL Research Frontiers - Rich Sutton in DLRLSS 2019的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!