jiaqiw09

🏠

Working from home

jiaqiw09

🏠

Working from home

8 followers · 4 following

Achievements

x2 x2

Achievements

x2 x2

Stars

volcengine / veScale

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,026 62 Updated Mar 3, 2026

CalvinXKY / InfraTech

分享AI Infra知识&代码练习：PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,602 233 Updated May 30, 2026

isLinXu / Vitriol

Python 11 2 Updated Jun 15, 2026

huggingface / kernels

Build compute kernels and load them from the Hub.

Python 693 105 Updated Jun 15, 2026

wpsnote / wpsnote-skills

Python 154 9 Updated May 25, 2026

kali20gakki / msAgent

Python 29 7 Updated Jun 15, 2026

stepfun-ai / SteptronOss

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 575 43 Updated May 18, 2026

pt-ecosystem / kernels-ext-npu

Kernel sources for https://huggingface.co/kernels-ext-npu

Python 4 1 Updated Apr 3, 2026

Prism-Shadow / agenthub

AgentHub SDK is the unified and transparent multi-LLM SDK for building reliable Agent Apps. (GPT-5.5/Claude 4.8/Gemini 3.5)

Python 95 7 Updated Jun 12, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,403 700 Updated May 17, 2026

Wenyueh / MinivLLM

Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation

Python 838 126 Updated Mar 16, 2026

datawhalechina / hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 59,436 7,305 Updated Jun 11, 2026

flagos-ai / FlagAttention

A collection of memory efficient attention operators implemented in the Triton language.

Python 297 21 Updated Jun 12, 2026

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 673 80 Updated May 21, 2026

fla-org / flash-linear-attention

🚀 Efficient implementations for emerging model architectures

Python 5,223 558 Updated Jun 11, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 14,041 2,219 Updated Apr 26, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,618 578 Updated Jun 15, 2026

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 9,977 892 Updated Jun 15, 2026

mindsdb / minds

General-purpose AI designed for knowledge workers — creators, strategists, and operators — and individuals seeking AI systems they can truly control to help them get work done, with full flexibilit…

Dockerfile 39,310 6,210 Updated Jun 15, 2026

QuivrHQ / quivr

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 39,165 3,727 Updated Jul 9, 2025