Reinforcement Learning_Policy Gradient

2023-04-11 22:53 作者:别叫我小红 0人读过 | 我要投稿

The following notes contain Lesson 7 of the David Silver's lecture [1] and Chapter 9 of Shiyu Zhao's Mathematical Foundation of Reinforcement Learning [2].

This part originally included lots of frustrating mathematical contents. Since I have not had a good understanding yet, these contents are mainted for later discussion.