Skip to content
View Bingrui-Li's full-sized avatar

Block or report Bingrui-Li

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,790 25,824 Updated Nov 7, 2025

Lightweight coding agent that runs in your terminal

Rust 49,984 6,180 Updated Nov 7, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,444 11,109 Updated Nov 7, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,913 3,233 Updated Nov 7, 2025

Ongoing research training transformer models at scale

Python 14,127 3,252 Updated Nov 7, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,217 31,076 Updated Nov 7, 2025

A library for mechanistic interpretability of GPT-style language models

Python 2,716 465 Updated Nov 7, 2025

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 680 118 Updated Nov 7, 2025

Async RL Training at Scale

Python 748 125 Updated Nov 7, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,891 540 Updated Nov 7, 2025

Linux kernel source tree

C 206,514 58,298 Updated Nov 7, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,563 2,536 Updated Nov 7, 2025

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、AI Agent、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

2,484 284 Updated Nov 7, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,019 3,931 Updated Nov 7, 2025

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,375 5,869 Updated Nov 7, 2025

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 27,585 8,818 Updated Nov 7, 2025

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,741 3,516 Updated Nov 7, 2025

Making large AI models cheaper, faster and more accessible

Python 41,225 4,540 Updated Nov 7, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,872 301 Updated Nov 7, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

8,580 566 Updated Nov 7, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,459 1,117 Updated Nov 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,217 2,441 Updated Nov 7, 2025

Post-training with Tinker

Python 1,459 114 Updated Nov 7, 2025

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,526 1,441 Updated Nov 7, 2025

Building General-Purpose Robots Based on Embodied Foundation Model

Python 583 38 Updated Nov 7, 2025

MiniMax-M2, a model built for Max coding & agentic workflows.

1,471 99 Updated Nov 7, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,797 3,693 Updated Nov 7, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,632 4,613 Updated Nov 7, 2025

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 759 65 Updated Nov 7, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,945 324 Updated Nov 7, 2025
Next