Skip to content
View tonyluj's full-sized avatar
  • Alibaba Cloud
  • Hangzhou China
  • 03:26 (UTC +08:00)
  • X @tonyluj

Block or report tonyluj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A high-performance and light-weight router for vLLM large scale deployment

Rust 177 61 Updated Mar 31, 2026

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,080 164 Updated Apr 4, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 347,999 69,531 Updated Apr 4, 2026

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 49,685 2,230 Updated Apr 4, 2026

The agent engineering platform

Python 132,356 21,835 Updated Apr 4, 2026

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,480 205 Updated Apr 3, 2026

Rime 配置:雾凇拼音 | 长期维护的简体词库

Lua 16,356 1,045 Updated Apr 2, 2026

A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust

Rust 25,293 1,327 Updated Apr 1, 2026

A collection of 100+ specialized Claude Code subagents covering a wide range of development use cases

Shell 16,223 1,831 Updated Apr 1, 2026

Ultimate camera streaming application

Go 12,723 1,009 Updated Mar 23, 2026

IronClaw is OpenClaw inspired implementation in Rust focused on privacy and security

Rust 11,401 1,310 Updated Apr 4, 2026

Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python 14,432 1,905 Updated Apr 4, 2026

Shepherd Model Gateway Claude Code Skills

7 1 Updated Mar 9, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 162 64 Updated Apr 4, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,040 652 Updated Apr 3, 2026

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 947 131 Updated Jan 22, 2026

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 137 40 Updated Apr 4, 2026

LLMRouter: An Open-Source Library for LLM Routing

Python 1,597 146 Updated Mar 17, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,917 552 Updated Mar 13, 2026

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 25,958 2,725 Updated Apr 4, 2026

React app for inspecting, building and debugging with the Realtime API

JavaScript 3,573 1,413 Updated Aug 28, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,654 752 Updated Mar 26, 2026

A framework for efficient model inference with omni-modality models

Python 4,120 687 Updated Apr 4, 2026

A Lightweight LLM Inference Performance Simulator

Python 67 19 Updated Mar 18, 2026

Contexts Optical Compression

Python 22,776 2,094 Updated Jan 27, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,422 5,177 Updated Apr 4, 2026

Shared data types for building collaborative software

JavaScript 21,568 756 Updated Apr 2, 2026

WebAssembly Micro Runtime (WAMR)

C 5,872 777 Updated Apr 2, 2026

A highly customable, adaptable, runtime agnostic and WASM/WASI friendly Gossip protocol (SWIM) which helps manage cluster membership and member failure detection.

Rust 128 8 Updated Feb 9, 2026
Next