jinghanjia

Jinghan Jia jinghanjia

Ph.D. Student at Michigan State University. Scalable and Trustworthy AI. ❤️Optimization, AI for Programming language, AI health.

25 followers · 19 following

Achievements

Highlights

Stars

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,355 731 Updated Apr 18, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,018 530 Updated Apr 16, 2026

SWE-Gym / SWE-Gym

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]

Jupyter Notebook 668 40 Updated Jul 29, 2025

grananqvist / Awesome-Quant-Machine-Learning-Trading

Quant/Algorithm trading resources with an emphasis on Machine Learning

3,592 642 Updated May 21, 2025

R2E-Gym / R2E-Gym

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 262 58 Updated Jul 13, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 11,714 1,313 Updated Apr 17, 2026

allenai / SERA

Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.

Python 140 23 Updated Mar 8, 2026

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,801 9,717 Updated Nov 12, 2025

marin-community / marin

Open-source framework for the research and development of foundation models.

Python 853 104 Updated Apr 18, 2026

mem0ai / mem0

Universal memory layer for AI Agents

Python 53,356 5,980 Updated Apr 18, 2026

junfanz1 / Awesome-AI-Review

Awesome AI industry & research review

560 108 Updated Mar 10, 2026

junfanz1 / Software-Engineer-Coding-Interviews

Data Structure Algorithms, (GenAI/ML) System Design, Machine Learning, DevOps coding interview practices

775 204 Updated Oct 7, 2025

Mirix-AI / MIRIX

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,513 279 Updated Apr 17, 2026

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 4,313 265 Updated Apr 8, 2026

zai-org / GLM-4.5

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 4,319 451 Updated Feb 1, 2026

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,017 2,063 Updated Mar 27, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,119 682 Updated Apr 17, 2026

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,487 114 Updated Jan 19, 2026

liangyuwang / Tiny-FSDP

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 104 9 Updated Aug 20, 2025

Exorust / TorchLeet

Leetcode for Pytorch

Jupyter Notebook 2,018 257 Updated Jan 19, 2026

liquidslr / interview-company-wise-problems

Lists of company wise questions. Every csv file in the companies directory corresponds to a list of questions on leetcode for a specific company based on the leetcode company tags. Updated as of 20…

22,951 4,557 Updated Jun 20, 2025

dwcoder / QuantitativePrimer

An Interview Primer for Quantitative Finance

TeX 1,529 188 Updated Sep 28, 2019

fla-org / flash-linear-attention

🚀 Efficient implementations for emerging model architectures

Python 4,900 502 Updated Apr 17, 2026

OPTML-Group / EPiC

Python 4 Updated Jun 11, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 16,051 1,566 Updated Mar 4, 2026

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 61,134 12,315 Updated Mar 11, 2026

0russwest0 / Awesome-Agent-RL

506 21 Updated Oct 11, 2025

OPTML-Group / Unlearn-ILU

Python 6 1 Updated Jun 15, 2025

dunnolab / awesome-in-context-rl

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

289 14 Updated Sep 8, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,175 793 Updated Apr 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jinghan Jia jinghanjia

Achievements

Achievements

Highlights

Block or report jinghanjia

Stars

THUDM / slime

Gen-Verse / OpenClaw-RL

SWE-Gym / SWE-Gym

grananqvist / Awesome-Quant-Machine-Learning-Trading

R2E-Gym / R2E-Gym

axolotl-ai-cloud / axolotl

allenai / SERA

karpathy / nanoGPT

marin-community / marin

mem0ai / mem0

junfanz1 / Awesome-AI-Review

junfanz1 / Software-Engineer-Coding-Interviews

Mirix-AI / MIRIX

openai / harmony

zai-org / GLM-4.5

openai / gpt-oss

kvcache-ai / Mooncake

KellerJordan / Muon

liangyuwang / Tiny-FSDP

Exorust / TorchLeet

liquidslr / interview-company-wise-problems

dwcoder / QuantitativePrimer

fla-org / flash-linear-attention

OPTML-Group / EPiC

QwenLM / Qwen-Agent

youngyangyang04 / leetcode-master

0russwest0 / Awesome-Agent-RL

OPTML-Group / Unlearn-ILU

dunnolab / awesome-in-context-rl

OpenPipe / ART