i-Taozi

🌔

working

Tonic i-Taozi

🌔

working

Pre-train && RL train

6 followers · 5 following

Achievements

Stars

scitix / SiMM

SiMM: Scalable in-Memory Middleware

C++ 40 1 Updated Apr 20, 2026

NVIDIA-NeMo / Megatron-Bridge

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 722 361 Updated Jun 13, 2026

iflow-ai / iflow-cli

iFlow cli is a comprehensive command-line intelligence that embeds in your terminal, analyzes your repositories, does coding tasks, interprets your needs across contexts, and boosts efficiency by p…

Shell 5,127 420 Updated Mar 20, 2026

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,549 254 Updated Jun 13, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,104 891 Updated Jun 13, 2026

Zhaojp-Frank / AwesomePaper-for-AI

Awesome system papers for AI

18 Updated Jun 13, 2026

infinigence / FUSCO

High-performance distributed data shuffling (all-to-all) library for MoE training and inference

Python 123 11 Updated Mar 7, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,385 698 Updated May 17, 2026

andylin-hao / RLinf

Forked from RLinf/RLinf

RLinf is a flexible, scalable and open-source infrastructure designed for reinforcement-learning (RL) post-training of foundation models — including large language models (LLMs), vision-language mo…

Python 2 Updated Jun 13, 2026

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,769 526 Updated Jun 13, 2026

infinigence / Semi-PD

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 126 15 Updated Dec 25, 2025

infinigence / SpecEE

Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)

C++ 75 9 Updated Apr 25, 2025

deepseek-ai / DeepSeek-V3

Python 103,748 16,734 Updated Aug 28, 2025

infinigence / Infini-Megrez

339 20 Updated Oct 11, 2025

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,925 2,396 Updated Sep 3, 2025

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 31,185 3,736 Updated Jun 10, 2026

NVIDIA / cudnn-frontend

cuDNN Frontend is NVIDIA's modern, open-source entry point to the cuDNN library and a growing collection of high-performance open-source kernels.

Python 845 183 Updated Jun 11, 2026

ggml-org / llama.cpp

LLM inference in C/C++

C++ 116,280 19,518 Updated Jun 13, 2026

dmlc / mshadow

Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning

C++ 1,117 428 Updated Aug 4, 2019

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,860 2,464 Updated Jun 13, 2026

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 13,065 2,375 Updated Jun 3, 2026

MegEngine / mperf

mperf是一个面向移动/嵌入式平台的算子性能调优工具箱

C++ 194 32 Updated Aug 17, 2023

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 24,128 2,826 Updated Jun 10, 2026

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,397 4,511 Updated May 25, 2026

ARM-software / libGPUInfo

A utility library for application developers to query the configuration of the Arm Immortalis GPU or Arm Mali GPU present in their system.

C++ 67 12 Updated Apr 21, 2026

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 70,873 8,365 Updated Jan 25, 2026

microsoft / ArchProbe

A profiler to disclose and quantify hardware features on GPUs.

C++ 176 25 Updated May 15, 2022

XiaoYaoYouUSTC / Cminusf-Compiler

中国科学技术大学编译原理课程实验项目

C++ 48 18 Updated Jun 14, 2023

MegEngine / MegEngine

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

C++ 4,809 549 Updated Oct 24, 2024

He-Ze / HPC-Lab-SYSU

2020秋中山大学高性能计算课程课件与作业

C++ 48 18 Updated Jan 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tonic i-Taozi

Achievements

Achievements

Block or report i-Taozi

Stars

scitix / SiMM

NVIDIA-NeMo / Megatron-Bridge

iflow-ai / iflow-cli

radixark / miles

THUDM / slime

Zhaojp-Frank / AwesomePaper-for-AI

infinigence / FUSCO

sgl-project / mini-sglang

andylin-hao / RLinf

RLinf / RLinf

infinigence / Semi-PD

infinigence / SpecEE

deepseek-ai / DeepSeek-V3

infinigence / Infini-Megrez

Infrasys-AI / AISystem

Lightning-AI / pytorch-lightning

NVIDIA / cudnn-frontend

ggml-org / llama.cpp

dmlc / mshadow

NVIDIA / TensorRT-LLM

NVIDIA / TensorRT

MegEngine / mperf

Dao-AILab / flash-attention

hpcaitech / ColossalAI

ARM-software / libGPUInfo

binary-husky / gpt_academic

microsoft / ArchProbe

XiaoYaoYouUSTC / Cminusf-Compiler

MegEngine / MegEngine

He-Ze / HPC-Lab-SYSU