Lists (10)
Sort Name ascending (A-Z)
Starred repositories
The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models
Lyken17 / threadweaver
Forked from facebookresearch/threadweaverThe implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, GitHub Copilot CLI, OpenClaw, Factory Droid, Trae, Google Antigravity). Turn any folder of code, docs, papers, images, o…
CUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.
Teams-first Multi-agent orchestration for Claude Code
Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocations such as NCCL, ...)
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …
A benchmark of real-world DL kernel problems
🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
AI agents running research on single-GPU nanochat training automatically
部署一个 Claude 官方 API 反向代理服务,使用 Caddy 和 Fly.io
Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy
Humanizer 的汉化版本,Claude Code Skills,旨在消除文本中 AI 生成的痕迹。
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows