Divyansh Garg, Joey Hejna, Matthieu Geist, Stefano Ermon: Extreme Q-Learning: MaxEnt RL without Entropy. ICLR 2023