Skip to content
View jinghanjia's full-sized avatar

Highlights

  • Pro

Block or report jinghanjia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 5,355 731 Updated Apr 18, 2026

OpenClaw-RL: Train any agent simply by talking

Python 5,018 530 Updated Apr 16, 2026

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]

Jupyter Notebook 668 40 Updated Jul 29, 2025

Quant/Algorithm trading resources with an emphasis on Machine Learning

3,592 642 Updated May 21, 2025

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 262 58 Updated Jul 13, 2025

Go ahead and axolotl questions

Python 11,714 1,313 Updated Apr 17, 2026

Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.

Python 140 23 Updated Mar 8, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,801 9,717 Updated Nov 12, 2025

Open-source framework for the research and development of foundation models.

Python 853 104 Updated Apr 18, 2026

Universal memory layer for AI Agents

Python 53,356 5,980 Updated Apr 18, 2026

Awesome AI industry & research review

560 108 Updated Mar 10, 2026

Data Structure Algorithms, (GenAI/ML) System Design, Machine Learning, DevOps coding interview practices

775 204 Updated Oct 7, 2025

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,513 279 Updated Apr 17, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,313 265 Updated Apr 8, 2026

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 4,319 451 Updated Feb 1, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,017 2,063 Updated Mar 27, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,119 682 Updated Apr 17, 2026

Muon is an optimizer for hidden layers in neural networks

Python 2,487 114 Updated Jan 19, 2026

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 104 9 Updated Aug 20, 2025

Leetcode for Pytorch

Jupyter Notebook 2,018 257 Updated Jan 19, 2026

Lists of company wise questions. Every csv file in the companies directory corresponds to a list of questions on leetcode for a specific company based on the leetcode company tags. Updated as of 20…

22,951 4,557 Updated Jun 20, 2025

An Interview Primer for Quantitative Finance

TeX 1,529 188 Updated Sep 28, 2019

🚀 Efficient implementations for emerging model architectures

Python 4,900 502 Updated Apr 17, 2026
Python 4 Updated Jun 11, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 16,051 1,566 Updated Mar 4, 2026

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 61,134 12,315 Updated Mar 11, 2026
Python 6 1 Updated Jun 15, 2025

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

289 14 Updated Sep 8, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,175 793 Updated Apr 18, 2026
Next