Skip to content
View guixianjin's full-sized avatar

Block or report guixianjin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Smart Investing Examples

Python 457 206 Updated Sep 13, 2021

Lightweight and Scalable Post-training: The Ray-Free, Debug-Friendly Alignment Stack with Megatron-native simplicity.

Python 53 2 Updated May 20, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 12,007 1,103 Updated Jun 11, 2026

An agentic skills framework & software development methodology that works.

Shell 226,402 20,117 Updated Jun 12, 2026

🦸 AI 编程超能力 · 中文增强版 — superpowers(116k+ ⭐)完整汉化 + 6 个中国原创 skills,让 Claude Code / Copilot CLI / Hermes Agent / Cursor / Windsurf / Kiro / Gemini CLI 等 16 款 AI 编程工具真正会干活

Shell 5,268 508 Updated Jun 12, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,550 254 Updated Jun 13, 2026

Stable and Efficient Reinforcement Learning for Trillion-Parameter LLMs

Python 137 8 Updated May 30, 2026

Post-training with Tinker

Python 3,465 446 Updated Jun 13, 2026

Self-evolving agent: grows skill tree from 3.3K-line seed, achieving full system control with 6x less token consumption

Python 12,823 1,476 Updated Jun 13, 2026

Fast, small, and fully autonomous AI personal assistant infrastructure, any OS, any platform — deploy anywhere, swap anything 🦀

Rust 31,894 4,721 Updated Jun 13, 2026

A lightweight, powerful framework for multi-agent workflows

Python 27,118 4,186 Updated Jun 13, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Python 132,145 21,394 Updated Jun 13, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,219 190 Updated Aug 26, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 218 76 Updated Jun 12, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,160 2,094 Updated Jun 9, 2026

A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows

Python 160 18 Updated May 25, 2026

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 17,174 1,945 Updated Jun 12, 2026

Vocabulary Parallelism

Python 26 Updated Mar 10, 2025

Fault-tolerant for DL frameworks

Python 71 13 Updated Jul 5, 2023

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,953 6,507 Updated Jun 13, 2026

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 423 45 Updated Jun 13, 2026
Python 366 28 Updated Aug 12, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 964 86 Updated Jun 8, 2026

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 661 63 Updated Jan 29, 2026

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,261 59 Updated Aug 27, 2025

A high-performance and light-weight router for vLLM large scale deployment

Rust 267 96 Updated May 6, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,857 7,676 Updated Jun 13, 2026

AI agents running research on single-GPU nanochat training automatically

Python 86,457 12,523 Updated Mar 26, 2026
Next