Skip to content
View ruitard's full-sized avatar
😶‍🌫️
Focusing
😶‍🌫️
Focusing
  • Harbin Institute of Technology
  • Shenzhen Guangdong, China

Organizations

@hit-at-sea @1604104se-hitwh

Block or report ruitard

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

gmp is a C++ metaprogramming library tailored for code generation at compile time.

C++ 38 2 Updated Apr 28, 2026

ArcLight: A Lightweight LLM Inference Framework

C++ 21 1 Updated Mar 24, 2026
Shell 81 46 Updated Apr 24, 2026

AIOS: AI Agent Operating System

Python 5,579 762 Updated Apr 23, 2026

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

C++ 869 68 Updated Apr 3, 2026

LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, su…

Python 346 34 Updated Apr 28, 2026

revng: the core repository of the rev.ng project

C++ 1,664 125 Updated Apr 27, 2026

FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang/triton.

C++ 252 58 Updated Apr 28, 2026

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,718 388 Updated Apr 9, 2026

Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖

Python 8,224 851 Updated Apr 28, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 10,806 1,091 Updated Apr 20, 2026

🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉

C++ 4,404 777 Updated Mar 19, 2026

开源开发工具周刊

Makefile 197 19 Updated Apr 26, 2026

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

Rust 3,054 106 Updated Apr 27, 2026

3D astronomy and space exploration program.

Python 250 22 Updated Mar 2, 2026

Inference Llama 2 in one file of pure C

C 19,453 2,524 Updated Aug 6, 2024

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 1,002 231 Updated Apr 28, 2026

Python tool for converting files and office documents to Markdown.

Python 118,275 7,795 Updated Apr 20, 2026

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 4,926 514 Updated Apr 28, 2026

Open-source Pricing and Billing Infrastructure 🚀 Subscription management, Invoicing, Pricing, Usage-based billing, Cost limiting, Grandfathering, Experiments, Revenue analytics & Actionable insights

Rust 1,059 54 Updated Apr 28, 2026

Financial data platform for analysts, quants and AI agents.

Python 66,654 6,659 Updated Apr 28, 2026

Official inference framework for 1-bit LLMs

Python 38,631 3,498 Updated Mar 10, 2026

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 803 28 Updated Oct 13, 2025

Waydroid uses a container-based approach to boot a full Android system on a regular GNU/Linux system like Ubuntu.

Python 11,269 468 Updated Apr 26, 2026

Programmable debugger

Python 2,052 207 Updated Apr 27, 2026

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

Rust 650 79 Updated Apr 22, 2026

Rust keyboard firmware library with layers, macros, real-time keymap editing, wireless(BLE) and split support

Rust 1,604 176 Updated Apr 28, 2026

Handwriting synthesis with Harfbuzz WASM.

Rust 485 12 Updated Aug 22, 2024

Fast, flexible LLM inference

Rust 7,069 586 Updated Apr 15, 2026

Rust implementation of behavior trees for deterministic AI

Rust 442 25 Updated Mar 17, 2026
Next