Skip to content
View Chord2048's full-sized avatar
😋
on my way
😋
on my way
  • Tencent
  • Shenzhen

Block or report Chord2048

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Rust package manager

Rust 15,123 2,946 Updated Jun 18, 2026

Agentic RL on Any Harness at Scale

Python 571 61 Updated Jun 17, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,242 289 Updated Jun 18, 2026

A Gym for Agentic LLMs

Python 494 33 Updated Jan 21, 2026

A construction kit for reinforcement learning environment management.

Python 456 67 Updated Jun 18, 2026

NexRL is an ultra-loosely-coupled LLM post-training framework.

Python 115 8 Updated May 13, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,926 7,700 Updated Jun 18, 2026

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,290 333 Updated Jun 13, 2026

A project implementing various agentic RL based on the Slime post-training framework

Python 470 32 Updated Apr 11, 2026

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 290 62 Updated Jul 13, 2025

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,752 5,996 Updated Jun 18, 2026

Evaluate and improve models and agents using environments

Python 988 190 Updated Jun 17, 2026

Standardized environment infrastructure for Agentic AI development.

Python 308 36 Updated Jun 18, 2026

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 301 25 Updated Jan 17, 2026

Docker image registry for SWE-bench, created by Epoch AI.

Python 18 1 Updated Aug 21, 2025

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Python 2,961 188 Updated Jun 18, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,162 6,604 Updated Jun 18, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,032 4,093 Updated Jun 18, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 60,207 7,406 Updated Jun 11, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,209 905 Updated Jun 18, 2026

Magnificent app which corrects your previous console command.

Python 97,368 3,948 Updated Jul 19, 2024

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,964 445 Updated Nov 13, 2025

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 20,856 3,998 Updated Jun 18, 2026

The agent that grows with you

Python 196,713 34,685 Updated Jun 18, 2026
JavaScript 2 Updated Apr 17, 2026

Flutter makes it easy and fast to build beautiful apps for mobile and beyond

Dart 176,984 30,535 Updated Jun 18, 2026

A framework for building native applications using React

C++ 126,027 25,183 Updated Jun 18, 2026

AndroidWorld is an environment and benchmark for autonomous agents

Python 799 155 Updated Jun 12, 2026

MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B

Jupyter Notebook 1,822 177 Updated Apr 20, 2026

Fast, small, and fully autonomous AI personal assistant infrastructure, any OS, any platform — deploy anywhere, swap anything 🦀

Rust 31,941 4,733 Updated Jun 18, 2026
Next