Stars
AI agents running research on single-GPU nanochat training automatically
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Download market data from Yahoo! Finance's API
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…