Skip to content
View wz1qqx's full-sized avatar

Highlights

  • Pro

Block or report wz1qqx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ValueCell is a community-driven, multi-agent platform for financial applications.

Python 7,751 1,356 Updated Dec 24, 2025

Sampling profiler for Python programs

Rust 14,761 492 Updated Dec 15, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,683 753 Updated Dec 25, 2025

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 121 15 Updated Dec 25, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,960 3,865 Updated Dec 25, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,053 891 Updated Dec 24, 2025

FlashInfer: Kernel Library for LLM Serving

Python 4,356 616 Updated Dec 25, 2025

Tile primitives for speedy kernels

Cuda 3,017 220 Updated Dec 9, 2025

A book for Learning the Foundations of LLMs

15,044 1,391 Updated Dec 12, 2025

LLM inference in C/C++

C++ 91,973 14,239 Updated Dec 25, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,449 553 Updated Dec 8, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,154 12,181 Updated Dec 25, 2025

Joplin - the privacy-focused note taking app with sync capabilities for Windows, macOS, Linux, Android and iOS.

TypeScript 52,659 5,648 Updated Dec 25, 2025

MPI programming lessons in C and executable code examples

C 2,326 762 Updated Sep 22, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,079 4,672 Updated Dec 24, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 9,001 704 Updated Aug 18, 2024
Jupyter Notebook 930 227 Updated Jan 10, 2025

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,100 1,071 Updated Oct 29, 2025

PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)

C++ 101 210 Updated Dec 25, 2025

My resume in LaTeX (template suited for new graduates; 应届生简历模板)

TeX 753 147 Updated Aug 3, 2025

🇨🇳 GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。

Java 104,730 13,434 Updated Oct 12, 2024

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 74,571 12,033 Updated Jul 30, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,013 6,566 Updated Nov 11, 2025

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 8,609 1,978 Updated Dec 15, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,225 31,537 Updated Dec 24, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,161 1,698 Updated Dec 24, 2025

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 44,642 5,546 Updated Dec 3, 2025

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation

Java 120,783 14,699 Updated Oct 30, 2025

TVM Documentation in Chinese Simplified / TVM 中文文档

TypeScript 2,895 555 Updated Nov 21, 2025

Yinghan's Code Sample

Cuda 361 61 Updated Jul 25, 2022
Next