Skip to content
View wbn03's full-sized avatar

Block or report wbn03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

收集全网 Android TV电视盒子应用,涵盖影视、直播、K歌、工具、游戏等类型,整理优质APK资源,支持便捷下载与自动更新。提供安全验证、分类索引与兼容性标注,助力用户打造家庭影音娱乐中心! ✅ TVBox/影视仓等影音壳接口配置源。

JavaScript 18,686 2,498 Updated Jun 22, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,853 543 Updated Jun 18, 2026

A pytorch quantization backend for optimum

Python 1,044 90 Updated Jun 9, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,519 6,649 Updated Jun 22, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,321 1,992 Updated Jan 9, 2026

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

Python 6,783 535 Updated Jun 21, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,310 1,263 Updated Jun 22, 2026

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 1,006 53 Updated Feb 5, 2026

Godot Engine – Multi-platform 2D and 3D game engine

C++ 112,902 25,728 Updated Jun 19, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,398 1,058 Updated Jun 4, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,751 1,293 Updated Jun 15, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

8,007 288 Updated May 15, 2025

Windows Calculator: A simple yet powerful calculator that ships with Windows

C++ 30,967 5,768 Updated Jun 18, 2026

Learning Large Language Model (LLM)(大语言模型学习)

TypeScript 951 114 Updated Jan 5, 2026

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,349 410 Updated Apr 20, 2026

Bring background images to your vscode. vscode background 背景扩展插件。

TypeScript 1,836 162 Updated Jun 19, 2026

This is a Chinese translation of the CUDA programming guide

1,989 291 Updated Nov 13, 2024

Large Context Attention

Python 773 53 Updated Oct 13, 2025

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,983 625 Updated May 16, 2024

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

C++ 355 35 Updated Dec 28, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,788 33,567 Updated Jun 22, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,933 2,486 Updated Jun 22, 2026

Fast and memory-efficient exact attention

Python 24,209 2,851 Updated Jun 20, 2026

Berkeley's Spatial Array Generator

Scala 1,360 270 Updated Jun 18, 2026

row-major matmul optimization

C++ 735 94 Updated May 14, 2026

Open Machine Learning Compiler Framework

Python 13,484 3,896 Updated Jun 22, 2026

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++ 1,002 167 Updated Sep 19, 2024

The inverse error function

C++ 16 5 Updated Oct 13, 2025

sparse convolution lib. derived from spconv

C++ 56 11 Updated Feb 3, 2021
Next