Stars
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
้ฟ้DIENไธDIN Tensorflow2.0 ๅค็ฐ
๐ Path to a free self-taught education in Computer Science!
๐ List of awesome university courses for learning Computer Science!
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
A cd command that learns - easily navigate directories from the command line
Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ๐๐ฉท๐๐คโค๏ธ๐ค
Open source FPGA-based NIC and platform for in-network compute
A collection of pre-trained, state-of-the-art models in the ONNX format
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Visualizer for neural network, deep learning and machine learning models
๐ ๐จ๐ณ ๐ ่ฎบๆ้ ่ฏป็ฌ่ฎฐ๏ผๅๅธๅผ็ณป็ปใ่ๆๅใๆบๅจๅญฆไน ๏ผPapers Notebook (Distributed System, Virtualization, Machine Learning)
A high performance and generic framework for distributed DNN training
A list of ICs and IPs for AI, Machine Learning and Deep Learning.
To make it easy to benchmark AI accelerators
Tools for monitoring NVIDIA GPUs on Linux