JiahuiSun

Sun Jiahui JiahuiSun

I am currently a Ph.D in Shanghai Jiao Tong University(SJTU). I received my Bachelor's degree from Tianjin University(TJU).

18 followers · 20 following

Achievements

Lists (1)

Sort

offline RL

Stars

bytedance / deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 71,087 9,632 Updated Jun 13, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,946 4,072 Updated Jun 13, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,630 968 Updated Jun 9, 2026

deepseek-ai / DeepSeek-R1

92,011 11,714 Updated Jun 27, 2025

deepseek-ai / DeepSeek-V3

Python 103,749 16,735 Updated Aug 28, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 18,629 2,790 Updated Jun 13, 2026

sjtug / SJTUThesis

上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template

TeX 3,799 798 Updated May 20, 2026

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 8,272 763 Updated Oct 16, 2024

itcharge / AlgoNote

⛽️「算法通关手册」：从零开始的「算法与数据结构」学习教程，200 道「算法面试热门题目」，1000+ 道「LeetCode 题目解析」，持续更新中！

Python 7,725 1,286 Updated May 17, 2026

TianjunChi / yolov3_pytorch

Python 2 1 Updated Oct 17, 2025

hyunwoongko / transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 4,592 633 Updated Jul 15, 2025

ultralytics / yolov5

Ultralytics YOLOv5 in PyTorch > ONNX > CoreML > TFLite

Python 57,519 17,468 Updated Jun 12, 2026

SunicYosen / sjtu-sports

Booking the sports places automatically.

JavaScript 4 2 Updated Nov 13, 2021

OpenRL-Lab / openrl

Unified Reinforcement Learning Framework

Python 834 81 Updated Sep 6, 2024

billryan / resume

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

TeX 11,176 2,850 Updated Mar 15, 2024

alibaba / loongcollector

Fast and Lightweight Observability Data Collector

C++ 2,156 440 Updated Jun 12, 2026

chauncygu / Safe-Reinforcement-Learning-Baselines

The repository is for safe reinforcement learning baselines.

Jupyter Notebook 797 102 Updated Mar 13, 2026

RunzheYang / MORL

Multi-Objective Reinforcement Learning

Python 307 57 Updated Aug 10, 2021

nrhinehart / deep_imitative_models

Reimplementation (currently partial) of Deep Imitative Models paper, ICLR '20

Python 73 16 Updated Dec 8, 2022

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,629 899 Updated Mar 24, 2023

sebascuri / rhucrl

Robust-HUCRL

Python 3 1 Updated Nov 13, 2023

google-deepmind / mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

C++ 13,854 1,576 Updated Jun 13, 2026

sjtu-marl / malib

A parallel framework for population-based multi-agent reinforcement learning.

Python 553 65 Updated Dec 14, 2023

kwai / DouZero

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Python 4,585 649 Updated Jun 26, 2024

datamllab / awesome-game-ai

Awesome Game AI materials of Multi-Agent Reinforcement Learning

968 117 Updated Jun 26, 2024

mit-gfx / PGMORL

[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Python 133 34 Updated Oct 9, 2020

soffes / Countdown

Mac screensaver for counting down to a date

Swift 974 89 Updated Jul 16, 2018

juhyeonkim95 / TaxiSimulatorOnGraph

This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network" (ITSC 2021)

Jupyter Notebook 39 12 Updated Apr 27, 2022

flow-project / flow

Computational framework for reinforcement learning in traffic control

Python 1,187 394 Updated Jul 27, 2024

RobinLu1209 / STAG-GCN

Spatiotemporal Adaptive Gated Graph Convolution Network for Urban Traffic Flow Forecasting

Python 75 14 Updated Oct 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sun Jiahui JiahuiSun

Achievements

Achievements

Block or report JiahuiSun

Lists (1)

offline RL

Stars

bytedance / deer-flow

verl-project / verl

OpenRLHF / OpenRLHF

deepseek-ai / DeepSeek-R1

deepseek-ai / DeepSeek-V3

huggingface / trl

sjtug / SJTUThesis

LianjiaTech / BELLE

itcharge / AlgoNote

TianjunChi / yolov3_pytorch

hyunwoongko / transformer

ultralytics / yolov5

SunicYosen / sjtu-sports

OpenRL-Lab / openrl

billryan / resume

alibaba / loongcollector

chauncygu / Safe-Reinforcement-Learning-Baselines

RunzheYang / MORL

nrhinehart / deep_imitative_models

sweetice / Deep-reinforcement-learning-with-pytorch

sebascuri / rhucrl

google-deepmind / mujoco

sjtu-marl / malib

kwai / DouZero

datamllab / awesome-game-ai

mit-gfx / PGMORL

soffes / Countdown

juhyeonkim95 / TaxiSimulatorOnGraph

flow-project / flow

RobinLu1209 / STAG-GCN