Skip to content
View WangErXiao's full-sized avatar
:octocat:
hi
:octocat:
hi

Block or report WangErXiao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent skills for vLLM

Shell 67 19 Updated Apr 3, 2026

A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …

TypeScript 68,669 5,852 Updated Apr 28, 2026

An agentic skills framework & software development methodology that works.

Shell 170,159 15,022 Updated Apr 28, 2026

OpenClaw-RL: Train any agent simply by talking

Python 5,145 547 Updated Apr 28, 2026
Python 236 46 Updated Apr 27, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 118,563 19,701 Updated Apr 28, 2026

Spec-driven development (SDD) for AI coding assistants.

TypeScript 43,420 3,009 Updated Apr 24, 2026

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识(https://forceinjection.github.io/)

HTML 1,131 178 Updated Apr 27, 2026

The open source coding agent.

TypeScript 150,774 17,344 Updated Apr 28, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 168,510 26,123 Updated Apr 26, 2026

Repo for Qwen Image Finetune

Jupyter Notebook 230 25 Updated Mar 12, 2026

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 367 24 Updated Sep 29, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,313 262 Updated Sep 12, 2025

A high-performance and light-weight router for vLLM large scale deployment

Rust 207 72 Updated Apr 24, 2026

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 25,099 3,936 Updated Mar 6, 2026

TORCH_TRACE parser for PT2

Rust 85 27 Updated Apr 9, 2026

Ultralytics YOLO 🚀

Python 56,502 10,867 Updated Apr 28, 2026

A framework for efficient model inference with omni-modality models

Python 4,527 845 Updated Apr 28, 2026

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 377 78 Updated Apr 28, 2026

Easy, Fast, and Scalable Multimodal AI

Python 124 9 Updated Apr 17, 2026

A PyTorch native platform for training generative AI models

Python 5,273 796 Updated Apr 28, 2026

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 19,318 3,298 Updated Apr 27, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,112 345 Updated Apr 14, 2026

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

Python 61,382 5,150 Updated Apr 27, 2026

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

C 273 28 Updated Aug 6, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 91,605 14,108 Updated Apr 16, 2026

Official Docker image for the NATS server

Dockerfile 160 54 Updated Apr 27, 2026

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,923 644 Updated Apr 28, 2026
Next