Policy Gradient Theorem Explained - Reinforcement Learning
Video Statistics and Information
Channel: Elliot Waite
Views: 13,345
Rating: undefined out of 5
Keywords: policy gradient, policy gradient reinforcement learning, policy gradient reinforce, policy gradient explained, policy gradient theorem, policy gradient theorem explained, policy gradient theorem proof, policy gradient derivation, policy gradient methods, policy gradient pytorch, policy gradient algorithms, policy gradient reinforcement learning example, policy gradient rl, policy gradient formula, policy gradient introduction, policy gradient log trick, reinforcement learning
Id: cQfOQcpYRzE
Channel Id: undefined
Length: 59min 36sec (3576 seconds)
Published: Sun Nov 22 2020
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.
This is pretty good, I started getting stuck at 45 mins in, and I'd trace it back to not truly understanding the step beginning 32 minutes in, when you introduce Ŷ. I also at that point started wanting to reconnect it to the original formulae given in the initial introduction.