- Shanghai, China
-
19:30
(UTC +08:00)
Highlights
- Pro
Lists (6)
Sort Name ascending (A-Z)
Stars
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
An agentic skills framework & software development methodology that works.
VocoType 是一款运行在本地端侧的隐私安全语音输入工具,通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI 优化文本、自定义替换词典、录音视频转文字等功能,让语音输入更高效、更安全。
Curated collection of papers in machine learning systems
A framework for efficient model inference with omni-modality models
GPT-SoVITS ONNX Inference Engine & Model Converter
🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?
Ring attention implementation with flash attention
A high-performance inference engine for LLMs, optimized for diverse AI accelerators.
My learning notes for ML SYS.
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
From scratch implementation of a vision language model in pure PyTorch
[DAC2024, TensorSSA] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning
[DAC2025] Tropical: Enhancing SLO Attainment in Disaggregated LLM Serving via SLO-Aware Multiplexing
DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit
[DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
Awesome Eino Projects for Learning | 学习 Eino AI 开发框架的项目库
🐈️ 纯真数据库 IPIP.net 格式版,Make qqwry.ipdb Great Again!!!
Simulate keyboard Input with GUI,模拟键盘输入带GUI,破解禁止粘贴
Lab2A-D, Lab3A-B, and Lab4A-B in different branches tagged these names so you can easily handle individual parts
Master programming by recreating your favorite technologies from scratch.
A Golang implemented Redis Server and Cluster. Go 语言实现的 Redis 服务器和分布式集群
A course to build distributed key-value service based on TiKV model