Skip to content
View ZHUI's full-sized avatar
  • University of Science and Technology of China
  • Shen Zhen

Block or report ZHUI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,157 288 Updated Mar 22, 2026

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,460 1,228 Updated Jul 30, 2024

Animation engine for explanatory math videos

Python 85,687 7,194 Updated Mar 26, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,465 1,315 Updated Apr 1, 2026

Nano vLLM

Python 12,625 1,842 Updated Nov 3, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 22,857 4,133 Updated Apr 1, 2026

XLeRobot: Practical Dual-Arm Mobile Home Robot for $660

Python 4,900 525 Updated Mar 31, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,448 495 Updated Apr 1, 2026

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 813 137 Updated Apr 1, 2026

LeetGPU Challenges

Python 727 70 Updated Mar 31, 2026

Native Multimodal Models are World Learners

Python 1,490 61 Updated Dec 30, 2025

Documentation that simply works

Python 26,437 4,061 Updated Mar 27, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,252 263 Updated Mar 27, 2026

A framework for few-shot evaluation of language models.

Python 11,969 3,145 Updated Apr 1, 2026

Flexible Python configuration system. The last one you will ever need.

Python 2,372 153 Updated Mar 12, 2026

Hydra is a framework for elegantly configuring complex applications

Python 10,284 822 Updated Feb 7, 2026
Python 967 93 Updated Dec 11, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,865 1,528 Updated Mar 4, 2026

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Python 239 8 Updated May 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,374 3,551 Updated Apr 1, 2026

所有小初高、大学PDF教材。

Roff 66,186 14,746 Updated Oct 18, 2025

为键盘工作者设计的单词记忆与英语肌肉记忆锻炼软件 / Words learning and English muscle memory training software designed for keyboard workers

TypeScript 21,718 2,404 Updated Mar 9, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,400 135 Updated Mar 11, 2026

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

TypeScript 87,047 8,808 Updated Apr 1, 2026

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,930 3,054 Updated Dec 17, 2025
Python 214 8 Updated Oct 27, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,278 99 Updated Aug 28, 2025

CUDA Library Samples

C++ 2,366 457 Updated Mar 17, 2026

Main libjpeg-turbo repository

C 4,273 1,133 Updated Mar 31, 2026

PyExifTool (active PyPI project) - A Python library to communicate with an instance of Phil Harvey's ExifTool command-line application. Runs one process with special -stay_open flag, and pipes data…

Python 196 24 Updated Nov 26, 2023
Next