Skip to content
View tonyluj's full-sized avatar
  • Alibaba Cloud
  • Hangzhou China
  • 20:23 (UTC +08:00)
  • X @tonyluj

Block or report tonyluj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A high-performance and light-weight router for vLLM large scale deployment

Rust 177 60 Updated Mar 31, 2026

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,080 164 Updated Apr 4, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 347,699 69,415 Updated Apr 4, 2026

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 49,651 2,225 Updated Apr 4, 2026

The agent engineering platform

Python 132,309 21,828 Updated Apr 4, 2026

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,480 205 Updated Apr 3, 2026

Rime 配置:雾凇拼音 | 长期维护的简体词库

Lua 16,350 1,045 Updated Apr 2, 2026

A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust

Rust 25,281 1,325 Updated Apr 1, 2026

A collection of 100+ specialized Claude Code subagents covering a wide range of development use cases

Shell 16,202 1,832 Updated Apr 1, 2026

Ultimate camera streaming application

Go 12,723 1,010 Updated Mar 23, 2026

IronClaw is OpenClaw inspired implementation in Rust focused on privacy and security

Rust 11,393 1,308 Updated Apr 4, 2026

Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python 14,417 1,902 Updated Apr 3, 2026

Shepherd Model Gateway Claude Code Skills

7 1 Updated Mar 9, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 161 64 Updated Apr 3, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,039 651 Updated Apr 3, 2026

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 947 131 Updated Jan 22, 2026

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 137 40 Updated Apr 4, 2026

LLMRouter: An Open-Source Library for LLM Routing

Python 1,596 146 Updated Mar 17, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,915 551 Updated Mar 13, 2026

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 25,933 2,719 Updated Apr 4, 2026

React app for inspecting, building and debugging with the Realtime API

JavaScript 3,573 1,413 Updated Aug 28, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,653 752 Updated Mar 26, 2026

A framework for efficient model inference with omni-modality models

Python 4,110 683 Updated Apr 4, 2026

A Lightweight LLM Inference Performance Simulator

Python 67 19 Updated Mar 18, 2026

Contexts Optical Compression

Python 22,773 2,094 Updated Jan 27, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,412 5,173 Updated Apr 4, 2026

Shared data types for building collaborative software

JavaScript 21,568 756 Updated Apr 2, 2026

WebAssembly Micro Runtime (WAMR)

C 5,872 777 Updated Apr 2, 2026

A highly customable, adaptable, runtime agnostic and WASM/WASI friendly Gossip protocol (SWIM) which helps manage cluster membership and member failure detection.

Rust 128 8 Updated Feb 9, 2026
Next