-
msu
-
23:51
(UTC -12:00) - @jia_jinghan
- https://jinghanjia.netlify.app/
Highlights
- Pro
Stars
slime is an LLM post-training framework for RL Scaling.
OpenClaw-RL: Train any agent simply by talking
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
Quant/Algorithm trading resources with an emphasis on Machine Learning
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Open-source framework for the research and development of foundation models.
Data Structure Algorithms, (GenAI/ML) System Design, Machine Learning, DevOps coding interview practices
Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…
Renderer for the harmony response format to be used with gpt-oss
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Muon is an optimizer for hidden layers in neural networks
Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP
Lists of company wise questions. Every csv file in the companies directory corresponds to a list of questions on leetcode for a specific company based on the leetcode company tags. Updated as of 20…
An Interview Primer for Quantitative Finance
🚀 Efficient implementations for emerging model architectures
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!