Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Really Fast End-to-End Jax RL Implementations
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES
Honor of Kings AI Open Environment of Tencent
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
Lightweight version of MAPPO to help you quickly migrate to your local environment.
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
hanabi_learning_environment is a research platform for Hanabi experiments.
An extension of the PyMARL codebase that includes additional algorithms and environment support
Author's PyTorch implementation of BCQ for continuous and discrete actions
Decision Intelligence Platform for Autonomous Driving simulation.
corl-team / CORL
Forked from tinkoff-ai/CORLHigh-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
MLGym A New Framework and Benchmark for Advancing AI Research Agents
Dream to Control: Learning Behaviors by Latent Imagination
Effortlessly add AI-generated transcription subtitles to your videos
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Matplotlib中文教程,在线阅读地址:https://datawhalechina.github.io/fantastic-matplotlib/
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
Pytorch implementation of the CREPE pitch tracker
Implementation of benchmark RL algorithms