Skip to content
View jiaqiw09's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report jiaqiw09

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,026 62 Updated Mar 3, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,602 233 Updated May 30, 2026
Python 11 2 Updated Jun 15, 2026

Build compute kernels and load them from the Hub.

Python 693 105 Updated Jun 15, 2026
Python 154 9 Updated May 25, 2026
Python 29 7 Updated Jun 15, 2026

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 575 43 Updated May 18, 2026

Kernel sources for https://huggingface.co/kernels-ext-npu

Python 4 1 Updated Apr 3, 2026

AgentHub SDK is the unified and transparent multi-LLM SDK for building reliable Agent Apps. (GPT-5.5/Claude 4.8/Gemini 3.5)

Python 95 7 Updated Jun 12, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,403 700 Updated May 17, 2026

Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation

Python 838 126 Updated Mar 16, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 59,436 7,305 Updated Jun 11, 2026

A collection of memory efficient attention operators implemented in the Triton language.

Python 297 21 Updated Jun 12, 2026

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 673 80 Updated May 21, 2026

🚀 Efficient implementations for emerging model architectures

Python 5,223 558 Updated Jun 11, 2026

Nano vLLM

Python 14,041 2,219 Updated Apr 26, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,618 578 Updated Jun 15, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 9,977 892 Updated Jun 15, 2026

General-purpose AI designed for knowledge workers — creators, strategists, and operators — and individuals seeking AI systems they can truly control to help them get work done, with full flexibilit…

Dockerfile 39,310 6,210 Updated Jun 15, 2026

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 39,165 3,727 Updated Jul 9, 2025

Financial data platform for analysts, quants and AI agents.

Python 69,218 6,976 Updated Jun 15, 2026

The agent engineering platform.

Python 139,397 23,101 Updated Jun 15, 2026

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

C++ 1,331 230 Updated Jun 15, 2026

Puzzles for learning Triton, play it with minimal environment configuration!

Python 710 101 Updated Mar 17, 2026

Cataloging released Triton kernels.

308 16 Updated Sep 9, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,961 445 Updated Mar 5, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,969 1,055 Updated May 7, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

8,004 287 Updated May 15, 2025

Legado 3.0 Book Reader with powerful controls & full functions❤️阅读3.0, 阅读是一款可以自定义来源阅读网络内容的工具,为广大网络文学爱好者提供一种方便、快捷舒适的试读体验。

Kotlin 46,880 5,782 Updated May 27, 2026
Next