wbn03

Follow

wbn wbn03

Follow

Believe yourself.

6 followers · 6 following

Achievements

Achievements

Stars

flashinfer-ai / mlsys26-agent-baseline

Python 34 11 Updated Mar 12, 2026

youhunwl / TVAPP

收集全网 Android TV电视盒子应用，涵盖影视、直播、K歌、工具、游戏等类型，整理优质APK资源，支持便捷下载与自动更新。提供安全验证、分类索引与兼容性标注，助力用户打造家庭影音娱乐中心！ ✅ TVBox/影视仓等影音壳接口配置源。

JavaScript 18,686 2,498 Updated Jun 22, 2026

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,853 543 Updated Jun 18, 2026

huggingface / optimum-quanto

A pytorch quantization backend for optimum

Python 1,044 90 Updated Jun 9, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,519 6,649 Updated Jun 22, 2026

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,321 1,992 Updated Jan 9, 2026

NVIDIA / warp

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

Python 6,783 535 Updated Jun 21, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,310 1,263 Updated Jun 22, 2026

fla-org / native-sparse-attention

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 1,006 53 Updated Feb 5, 2026

godotengine / godot

Godot Engine – Multi-platform 2D and 3D game engine

C++ 112,902 25,728 Updated Jun 19, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,398 1,058 Updated Jun 4, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,751 1,293 Updated Jun 15, 2026

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

8,007 288 Updated May 15, 2025

microsoft / calculator

Windows Calculator: A simple yet powerful calculator that ships with Windows

C++ 30,967 5,768 Updated Jun 18, 2026

kebijuelun / Awesome-LLM-Learning

Learning Large Language Model (LLM）(大语言模型学习)

TypeScript 951 114 Updated Jan 5, 2026

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,349 410 Updated Apr 20, 2026

shalldie / vscode-background

Bring background images to your vscode. vscode background 背景扩展插件。

TypeScript 1,836 162 Updated Jun 19, 2026

HeKun-NVIDIA / CUDA-Programming-Guide-in-Chinese

This is a Chinese translation of the CUDA programming guide

1,989 291 Updated Nov 13, 2024

haoliuhl / ringattention

Large Context Attention

Python 773 53 Updated Oct 13, 2025

fundamentalvision / Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,983 625 Updated May 16, 2024

HazyResearch / flash-fft-conv

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

C++ 355 35 Updated Dec 28, 2024

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,788 33,567 Updated Jun 22, 2026

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,933 2,486 Updated Jun 22, 2026

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 24,209 2,851 Updated Jun 20, 2026

ucb-bar / gemmini

Berkeley's Spatial Array Generator

Scala 1,360 270 Updated Jun 18, 2026

tpoisonooo / how-to-optimize-gemm

row-major matmul optimization

C++ 735 94 Updated May 14, 2026

apache / tvm

Open Machine Learning Compiler Framework

Python 13,484 3,896 Updated Jun 22, 2026

microsoft / nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++ 1,002 167 Updated Sep 19, 2024

lakshayg / erfinv

The inverse error function

C++ 16 5 Updated Oct 13, 2025

masszhou / spconv_lite

sparse convolution lib. derived from spconv

C++ 56 11 Updated Feb 3, 2021