Skip to content
View jinghanjia's full-sized avatar

Highlights

  • Pro

Block or report jinghanjia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 5,123 692 Updated Apr 5, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,655 469 Updated Apr 4, 2026

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]

Jupyter Notebook 656 40 Updated Jul 29, 2025

Quant/Algorithm trading resources with an emphasis on Machine Learning

3,547 632 Updated May 21, 2025

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 258 56 Updated Jul 13, 2025

Go ahead and axolotl questions

Python 11,582 1,287 Updated Apr 5, 2026

Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.

Python 136 20 Updated Mar 8, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,108 9,561 Updated Nov 12, 2025

Open-source framework for the research and development of foundation models.

Python 831 102 Updated Apr 5, 2026

Universal memory layer for AI Agents

Python 52,009 5,823 Updated Apr 4, 2026

Awesome AI industry & research review

555 108 Updated Mar 10, 2026

Data Structure Algorithms, (GenAI/ML) System Design, Machine Learning, DevOps coding interview practices

756 195 Updated Oct 7, 2025

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,483 272 Updated Mar 12, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,257 264 Updated Mar 27, 2026

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 4,309 445 Updated Feb 1, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,973 2,063 Updated Mar 27, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,041 652 Updated Apr 5, 2026

Muon is an optimizer for hidden layers in neural networks

Python 2,460 111 Updated Jan 19, 2026

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 101 9 Updated Aug 20, 2025

Leetcode for Pytorch

Jupyter Notebook 1,996 252 Updated Jan 19, 2026

Lists of company wise questions. Every csv file in the companies directory corresponds to a list of questions on leetcode for a specific company based on the leetcode company tags. Updated as of 20…

22,356 4,399 Updated Jun 20, 2025

An Interview Primer for Quantitative Finance

TeX 1,524 189 Updated Sep 28, 2019

🚀 Efficient implementations for emerging model architectures

Python 4,809 478 Updated Apr 5, 2026
Python 4 Updated Jun 11, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,903 1,531 Updated Mar 4, 2026

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 60,980 12,318 Updated Mar 11, 2026
Python 6 1 Updated Jun 15, 2025

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

287 14 Updated Sep 8, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,132 785 Updated Apr 3, 2026
Next