Skip to content
View zhuango's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Peking

Block or report zhuango

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open Source Machine Learning Framework for Everyone

C++ 192,328 74,959 Updated Nov 7, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,770 25,816 Updated Nov 7, 2025

LLM inference in C/C++

C++ 89,274 13,585 Updated Nov 7, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,369 11,091 Updated Nov 7, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,625 4,613 Updated Nov 7, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,890 540 Updated Nov 7, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,260 420 Updated Nov 7, 2025

Train transformer language models with reinforcement learning.

Python 16,199 2,279 Updated Nov 7, 2025

Models and examples built with TensorFlow

Python 77,670 45,426 Updated Nov 6, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,916 691 Updated Nov 6, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,200 186 Updated Nov 6, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,192 31,066 Updated Nov 6, 2025

Reference implementations of MLPerf® inference benchmarks

Python 1,480 588 Updated Nov 6, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,336 2,265 Updated Nov 6, 2025

scikit-learn: machine learning in Python

Python 63,937 26,408 Updated Nov 6, 2025

Awesome LLM compression research papers and tools.

1,700 109 Updated Nov 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,994 3,929 Updated Nov 6, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,695 974 Updated Nov 6, 2025

Making large AI models cheaper, faster and more accessible

Python 41,223 4,538 Updated Nov 6, 2025

Minimal examples of data structures and algorithms in Python

Python 24,821 4,699 Updated Nov 6, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,634 1,072 Updated Nov 6, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 19,997 2,084 Updated Nov 5, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

3,042 201 Updated Nov 5, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,681 440 Updated Nov 4, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,726 793 Updated Nov 4, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,020 283 Updated Nov 4, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,330 807 Updated Oct 31, 2025

A framework for few-shot evaluation of language models.

Python 10,550 2,832 Updated Oct 29, 2025

Algorithms, 4th edition textbook code and libraries

Java 7,512 2,680 Updated Oct 29, 2025

[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Python 125 7 Updated Oct 29, 2025
Next