0% found this document useful (0 votes)
14 views1 page

Algorithm Key Points

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views1 page

Algorithm Key Points

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Algorithm Key Points


Actor and Critic have separate networks

Loss functions are related to one another (nested function calls)


Replay buffer is large, finite, and sampled uniformly; overwrite
earliest memories

Target networks for both actor and critic, with soft updates

Batch norm for both networks

Actions introduced to critic after states

Stochastic policy adds noise function to deterministic policy

You might also like