Stars
ValueCell is a community-driven, multi-agent platform for financial applications.
A Datacenter Scale Distributed Inference Serving Framework
A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.
SGLang is a fast serving framework for large language models and vision language models.
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
FlashInfer: Kernel Library for LLM Serving
A book for Learning the Foundations of LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
Joplin - the privacy-focused note taking app with sync capabilities for Windows, macOS, Linux, Android and iOS.
MPI programming lessons in C and executable code examples
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A minimal GPU design in Verilog to learn how GPUs work from the ground up
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)
My resume in LaTeX (template suited for new graduates; 应届生简历模板)
🇨🇳 GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Understanding Deep Learning - Simon J.D. Prince
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation
TVM Documentation in Chinese Simplified / TVM 中文文档