- Canada
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
SGLang is a high-performance serving framework for large language models and multimodal models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Analyze computation-communication overlap in V3/R1.
A book-in-progress about the Linux kernel and its insides.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
DeepEP: an efficient expert-parallel communication library
Perplexity open source garden for inference technology
A modular graph-based Retrieval-Augmented Generation (RAG) system
Curated collection of papers in machine learning systems
Large Language Model (LLM) Systems Paper List
A curated list of awesome smartnic tutorials, papers and projects.
C++高性能分布式服务器框架,webserver,websocket server,自定义tcp_server(包含日志模块,配置模块,线程模块,协程模块,协程调度模块,io协程调度模块,hook模块,socket模块,bytearray序列化,http模块,TcpServer模块,Websocket模块,Https模块等, Smtp邮件模块, MySQL, SQLite3, ORM,Red…
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
GoogleTest - Google Testing and Mocking Framework
A high performance and generic framework for distributed DNN training
AMD TCPDirect ultra low latency kernel bypass TCP and UDP implementation for AMD Solarflare network adapters, to be used with corresponding versions of Onload®️ at https://github.com/Xilinx-CNS/onl…
bytedance / ps-lite
Forked from dmlc/ps-liteA lightweight parameter server interface
Agent2Agent (A2A) is an open protocol enabling communication and interoperability between opaque agentic applications.
A Model Context Protocol (MCP) server implementation that provides network control and management capabilities through the ONOS SDN controller.