Code release for "Reducing Sampling Error in Batch Temporal Difference Learning" (ICML 2020).
Note: Code upload is in progress. In its current state, the code can be referred to for domain implementation, value function learning, PSEC policy learning et cetera. Once we include an automated run script with instructions, we will remove this note. Thanks for your interest!