Skip to content
View i-Taozi's full-sized avatar
🌔
working
🌔
working

Block or report i-Taozi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SiMM: Scalable in-Memory Middleware

C++ 40 1 Updated Apr 20, 2026

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 722 361 Updated Jun 13, 2026

iFlow cli is a comprehensive command-line intelligence that embeds in your terminal, analyzes your repositories, does coding tasks, interprets your needs across contexts, and boosts efficiency by p…

Shell 5,127 420 Updated Mar 20, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,549 254 Updated Jun 13, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,104 891 Updated Jun 13, 2026

Awesome system papers for AI

18 Updated Jun 13, 2026

High-performance distributed data shuffling (all-to-all) library for MoE training and inference

Python 123 11 Updated Mar 7, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,385 698 Updated May 17, 2026

RLinf is a flexible, scalable and open-source infrastructure designed for reinforcement-learning (RL) post-training of foundation models — including large language models (LLMs), vision-language mo…

Python 2 Updated Jun 13, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,769 526 Updated Jun 13, 2026

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 126 15 Updated Dec 25, 2025

Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)

C++ 75 9 Updated Apr 25, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,925 2,396 Updated Sep 3, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 31,185 3,736 Updated Jun 10, 2026

cuDNN Frontend is NVIDIA's modern, open-source entry point to the cuDNN library and a growing collection of high-performance open-source kernels.

Python 845 183 Updated Jun 11, 2026

LLM inference in C/C++

C++ 116,280 19,518 Updated Jun 13, 2026

Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning

C++ 1,117 428 Updated Aug 4, 2019

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,860 2,464 Updated Jun 13, 2026

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 13,065 2,375 Updated Jun 3, 2026

mperf是一个面向移动/嵌入式平台的算子性能调优工具箱

C++ 194 32 Updated Aug 17, 2023

Fast and memory-efficient exact attention

Python 24,128 2,826 Updated Jun 10, 2026

Making large AI models cheaper, faster and more accessible

Python 41,397 4,511 Updated May 25, 2026

A utility library for application developers to query the configuration of the Arm Immortalis GPU or Arm Mali GPU present in their system.

C++ 67 12 Updated Apr 21, 2026

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 70,873 8,365 Updated Jan 25, 2026

A profiler to disclose and quantify hardware features on GPUs.

C++ 176 25 Updated May 15, 2022

中国科学技术大学编译原理课程实验项目

C++ 48 18 Updated Jun 14, 2023

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

C++ 4,809 549 Updated Oct 24, 2024

2020秋中山大学高性能计算课程课件与作业

C++ 48 18 Updated Jan 25, 2021
Next