Proximal Policy Optimization
OpenAI is implementing a new reinforcement learning algorithm, Proximal Policy Optimization, which will allow them to train AI policies. What are your thoughts about this? Let us know in the comments!
Read the full article here.