Skip to content
View tonyluj's full-sized avatar
  • Alibaba Cloud
  • Hangzhou China
  • 17:00 (UTC +08:00)
  • X @tonyluj

Block or report tonyluj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A high-performance and light-weight router for vLLM large scale deployment

Rust 177 61 Updated Mar 31, 2026

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,080 164 Updated Apr 4, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 348,419 69,683 Updated Apr 5, 2026

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 49,729 2,232 Updated Apr 5, 2026

The agent engineering platform

Python 132,411 21,845 Updated Apr 5, 2026

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,481 205 Updated Apr 3, 2026

Rime 配置:雾凇拼音 | 长期维护的简体词库

Lua 16,358 1,046 Updated Apr 2, 2026

A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust

Rust 25,300 1,328 Updated Apr 1, 2026

A collection of 100+ specialized Claude Code subagents covering a wide range of development use cases

Shell 16,270 1,838 Updated Apr 1, 2026

Ultimate camera streaming application

Go 12,727 1,009 Updated Mar 23, 2026

IronClaw is OpenClaw inspired implementation in Rust focused on privacy and security

Rust 11,420 1,313 Updated Apr 5, 2026

Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python 14,456 1,910 Updated Apr 4, 2026

Shepherd Model Gateway Claude Code Skills

7 1 Updated Mar 9, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 164 65 Updated Apr 5, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,041 653 Updated Apr 5, 2026

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 947 131 Updated Jan 22, 2026

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 137 40 Updated Apr 4, 2026

LLMRouter: An Open-Source Library for LLM Routing

Python 1,598 146 Updated Mar 17, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,921 553 Updated Mar 13, 2026

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 26,027 2,737 Updated Apr 4, 2026

React app for inspecting, building and debugging with the Realtime API

JavaScript 3,572 1,413 Updated Aug 28, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,660 752 Updated Mar 26, 2026

A framework for efficient model inference with omni-modality models

Python 4,137 691 Updated Apr 5, 2026

A Lightweight LLM Inference Performance Simulator

Python 67 19 Updated Mar 18, 2026

Contexts Optical Compression

Python 22,782 2,093 Updated Jan 27, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,436 5,189 Updated Apr 5, 2026

Shared data types for building collaborative software

JavaScript 21,570 757 Updated Apr 2, 2026

WebAssembly Micro Runtime (WAMR)

C 5,875 780 Updated Apr 2, 2026

A highly customable, adaptable, runtime agnostic and WASM/WASI friendly Gossip protocol (SWIM) which helps manage cluster membership and member failure detection.

Rust 128 8 Updated Feb 9, 2026
Next