Skip to content
View algebra-MCX's full-sized avatar

Block or report algebra-MCX

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CUDA 算子手撕与面试指南

Cuda 736 81 Updated Aug 23, 2025

Machine Learning Engineering Open Book

Python 16,078 987 Updated Dec 20, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,456 12,183 Updated Dec 21, 2025

The best ChatGPT that $100 can buy.

Python 39,029 4,947 Updated Dec 9, 2025
Python 91 12 Updated Nov 28, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,275 351 Updated Dec 22, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,664 121 Updated Dec 22, 2025

Nano vLLM

Python 9,936 1,248 Updated Nov 3, 2025

Nano vLLM Triton

Python 11 Updated Aug 27, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,054 4,669 Updated Dec 19, 2025

LLM training in simple, raw C/CUDA

Cuda 28,443 3,335 Updated Jun 26, 2025

Fast and memory-efficient exact attention

Python 21,236 2,241 Updated Dec 22, 2025

Tensor library for machine learning

C++ 13,741 1,431 Updated Dec 17, 2025

LLM inference in C/C++

C++ 91,800 14,185 Updated Dec 22, 2025

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

419 39 Updated Aug 2, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,013 882 Updated Dec 4, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,238 1,190 Updated Dec 22, 2025

A comprehensive guide for beginners in the field of data management and artificial intelligence.

510 21 Updated Apr 8, 2025

Neural Networks: Zero to Hero

Jupyter Notebook 19,362 2,715 Updated Aug 18, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,180 3,036 Updated Aug 15, 2024

Video+code lecture on building nanoGPT from scratch

Python 4,622 725 Updated Aug 13, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,300 8,591 Updated Nov 12, 2025

MLIR For Beginners tutorial

C++ 1,168 111 Updated Jul 18, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,482 762 Updated Dec 22, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,900 2,275 Updated Sep 3, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,563 245 Updated Dec 18, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

Jupyter Notebook 15,448 1,795 Updated Dec 18, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,424 2,623 Updated Dec 3, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 59,653 12,280 Updated Nov 7, 2025

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 3,240 354 Updated Jun 22, 2025
Next