Lists (1)
Sort Name ascending (A-Z)
Stars
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
Train transformer language models with reinforcement learning.
上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
⛽️「算法通关手册」:从零开始的「算法与数据结构」学习教程,200 道「算法面试热门题目」,1000+ 道「LeetCode 题目解析」,持续更新中!
Transformer: PyTorch Implementation of "Attention Is All You Need"
Ultralytics YOLOv5 in PyTorch > ONNX > CoreML > TFLite
Booking the sports places automatically.
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
Fast and Lightweight Observability Data Collector
The repository is for safe reinforcement learning baselines.
Reimplementation (currently partial) of Deep Imitative Models paper, ICLR '20
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Multi-Joint dynamics with Contact. A general purpose physics simulator.
A parallel framework for population-based multi-agent reinforcement learning.
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
Awesome Game AI materials of Multi-Agent Reinforcement Learning
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network" (ITSC 2021)
Computational framework for reinforcement learning in traffic control
Spatiotemporal Adaptive Gated Graph Convolution Network for Urban Traffic Flow Forecasting