Stars
Bridge local AI coding agents (Claude Code, Cursor, Gemini CLI, Codex) to messaging platforms (Feishu/Lark, DingTalk, Slack, Telegram, Discord, LINE, WeChat Work). Chat with your AI dev assistant f…
你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
MoonPalace(月宫)是由 Moonshot AI 月之暗面提供的 API 调试工具。
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
ROCm / Megatron-LM
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
HabanaAI / vllm-fork
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Run compilers interactively from your web browser and interact with the assembly
Unified KV Cache Compression Methods for Auto-Regressive Models
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.