Stars
AI agents running research on single-GPU nanochat training automatically
Google Research
Hackable and optimized Transformers building blocks, supporting a composable construction.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
PipeFusion / PipeFusion
Forked from xdit-project/xDiTA Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters
Fast CUDA matrix multiplication from scratch
A high-performance inference engine for LLMs, optimized for diverse AI accelerators.
Visualizer for neural network, deep learning and machine learning models
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
A distributed key-value storage system developed by Alibaba Group
Example TensorFlow codes and Caicloud TensorFlow as a Service dev environment.
An industrial deep learning framework for high-dimension sparse data
Source codes for book <<<BeginningAlgorithmContests>> Second edition
It is open source ebook about TensorFlow kernel and implementation mechanism.
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone
A simple Python script showing how the backpropagation algorithm works.
推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction
Deep Learning Book Chinese Translation
Homepage for STAT 157 at UC Berkeley
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
All Algorithms implemented in Python
Master the command line, in one page
Repo for counting stars and contributing. Press F to pay respect to glorious developers.