Stars
1
star
written in Rich Text Format
Clear filter
Proximal Policy Optimization (PPO) is a popular state-of-the-art Policy Gradient Method. It is supposed to learn relatively quickly and stable while being much simpler to tune, compared to other st…