Lists (8)
Sort Name ascending (A-Z)
Deep Learning
Application of deep neurual networks.Design
For good taste.Engine
Learn code from source.Learning
Tutorials and summaryLLM
Snippet
Some snippet that might help you.Tool Chain
Using tools to help you work. However, tools cann't inspire you.Wheel
Stars
Accelerating MoE with IO and Tile-aware Optimizations
猫抓 浏览器资源嗅探扩展 / cat-catch Browser Resource Sniffing Extension
A feature-rich command-line audio/video downloader
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Kimi K2 is the large language model series developed by Moonshot AI team
Use Kimi latest model(kimi-k2-0711-preview) to drive your Claude Code.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Efficient Triton Kernels for LLM Training
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
Model Context Protocol Servers
The official Python SDK for Model Context Protocol servers and clients
🚀 The fast, Pythonic way to build MCP servers and clients
High-velocity, monorepo-scale workflow for Git
CUDA Python: Performance meets Productivity
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
FlashMLA: Efficient Multi-head Latent Attention Kernels
MoBA: Mixture of Block Attention for Long-Context LLMs
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
Hydra is a framework for elegantly configuring complex applications