Skip to content
View JiahuiSun's full-sized avatar

Block or report JiahuiSun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 71,087 9,632 Updated Jun 13, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,946 4,072 Updated Jun 13, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,630 968 Updated Jun 9, 2026

Train transformer language models with reinforcement learning.

Python 18,629 2,790 Updated Jun 13, 2026

上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template

TeX 3,799 798 Updated May 20, 2026

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,272 763 Updated Oct 16, 2024

⛽️「算法通关手册」:从零开始的「算法与数据结构」学习教程,200 道「算法面试热门题目」,1000+ 道「LeetCode 题目解析」,持续更新中!

Python 7,725 1,286 Updated May 17, 2026
Python 2 1 Updated Oct 17, 2025

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 4,592 633 Updated Jul 15, 2025

Ultralytics YOLOv5 in PyTorch > ONNX > CoreML > TFLite

Python 57,519 17,468 Updated Jun 12, 2026

Booking the sports places automatically.

JavaScript 4 2 Updated Nov 13, 2021

Unified Reinforcement Learning Framework

Python 834 81 Updated Sep 6, 2024

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

TeX 11,176 2,850 Updated Mar 15, 2024

Fast and Lightweight Observability Data Collector

C++ 2,156 440 Updated Jun 12, 2026

The repository is for safe reinforcement learning baselines.

Jupyter Notebook 797 102 Updated Mar 13, 2026

Multi-Objective Reinforcement Learning

Python 307 57 Updated Aug 10, 2021

Reimplementation (currently partial) of Deep Imitative Models paper, ICLR '20

Python 73 16 Updated Dec 8, 2022

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,629 899 Updated Mar 24, 2023

Robust-HUCRL

Python 3 1 Updated Nov 13, 2023

Multi-Joint dynamics with Contact. A general purpose physics simulator.

C++ 13,854 1,576 Updated Jun 13, 2026

A parallel framework for population-based multi-agent reinforcement learning.

Python 553 65 Updated Dec 14, 2023

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Python 4,585 649 Updated Jun 26, 2024

Awesome Game AI materials of Multi-Agent Reinforcement Learning

968 117 Updated Jun 26, 2024

[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Python 133 34 Updated Oct 9, 2020

Mac screensaver for counting down to a date

Swift 974 89 Updated Jul 16, 2018

This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network" (ITSC 2021)

Jupyter Notebook 39 12 Updated Apr 27, 2022

Computational framework for reinforcement learning in traffic control

Python 1,187 394 Updated Jul 27, 2024

Spatiotemporal Adaptive Gated Graph Convolution Network for Urban Traffic Flow Forecasting

Python 75 14 Updated Oct 27, 2020
Next