Skip to content
View tonyluj's full-sized avatar
  • Alibaba Cloud
  • Hangzhou China
  • 07:32 (UTC +08:00)
  • X @tonyluj

Block or report tonyluj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The highest-scoring AI memory system ever benchmarked. And it's free.

Python 41,721 5,331 Updated Apr 11, 2026

A high-performance and light-weight router for vLLM large scale deployment

Rust 188 68 Updated Mar 31, 2026

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,083 167 Updated Apr 11, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 354,923 71,779 Updated Apr 11, 2026

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 50,434 2,317 Updated Apr 11, 2026

The agent engineering platform

Python 133,194 22,002 Updated Apr 11, 2026

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,498 206 Updated Apr 10, 2026

Rime 配置:雾凇拼音 | 长期维护的简体词库

Lua 16,485 1,055 Updated Apr 6, 2026

A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust

Rust 25,469 1,344 Updated Apr 1, 2026

A collection of 100+ specialized Claude Code subagents covering a wide range of development use cases

Shell 16,982 1,939 Updated Apr 1, 2026

Ultimate camera streaming application

Go 12,780 1,015 Updated Mar 23, 2026

IronClaw is OpenClaw inspired implementation in Rust focused on privacy and security

Rust 11,642 1,336 Updated Apr 11, 2026

Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python 15,044 2,022 Updated Apr 11, 2026

Shepherd Model Gateway Claude Code Skills

7 1 Updated Mar 9, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 181 71 Updated Apr 11, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,074 665 Updated Apr 11, 2026

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 949 131 Updated Jan 22, 2026

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 156 40 Updated Apr 11, 2026

LLMRouter: An Open-Source Library for LLM Routing

Python 1,632 151 Updated Mar 17, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,960 563 Updated Mar 13, 2026

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 26,835 2,831 Updated Apr 11, 2026

React app for inspecting, building and debugging with the Realtime API

JavaScript 3,573 1,411 Updated Aug 28, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,756 761 Updated Mar 26, 2026

A framework for efficient model inference with omni-modality models

Python 4,232 734 Updated Apr 11, 2026

A Lightweight LLM Inference Performance Simulator

Python 67 19 Updated Mar 18, 2026

Contexts Optical Compression

Python 22,808 2,097 Updated Jan 27, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,661 5,288 Updated Apr 11, 2026

Shared data types for building collaborative software

JavaScript 21,615 760 Updated Apr 11, 2026

WebAssembly Micro Runtime (WAMR)

C 5,884 785 Updated Apr 8, 2026
Next