StLeoX

Leo Xie StLeoX

65 followers · 163 following

huawei-cloudnative
Hangzhou

Achievements

Lists (7)

Sort

Starred repositories

jeinlee1991 / chinese-llm-benchmark

非线智能 NoneLinear - ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括374个大模型，覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型，以及step3.5-flash、kimi-…

6,199 251 Updated Jun 18, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,568 449 Updated Jun 18, 2026

hearth-project / hearth

Scale-to-zero LLM serving on Kubernetes without adopting a platform — one CRD + KEDA, vendor-neutral across NVIDIA & Ascend.

Go 6 5 Updated Jun 21, 2026

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,883 607 Updated Jun 23, 2026

HPMLL / BurstGPT

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 273 15 Updated Mar 19, 2026

XiaomiMiMo / MiMo-Code

MiMo Code: Where Models and Agents Co-Evolve

TypeScript 10,423 978 Updated Jun 23, 2026

alibaba / UnifiedModel

The semantic layer that makes enterprise data understandable to AI agents — model entities and relations once, query through SPL/MCP/REST, and connect telemetry, services, and business objects in o…

Go 188 36 Updated Jun 23, 2026

hiyeshu / trip-map-builder

旅行行程规划技能：规划 → 小红书调研 → 交互式地图页面 | Agent Skill for trip planning with 小红书 research and interactive map generation

HTML 73 11 Updated May 24, 2026

borski / travel-hacking-toolkit

AI-powered travel hacking and search with cash, points, miles, and award flights. Drop-in skills and MCP servers for Claude, Codex, and OpenCode.

Python 556 54 Updated May 2, 2026

Tracer-Cloud / opensre

Build your own AI SRE agents. The open source toolkit for the AI era.

Python 7,394 976 Updated Jun 23, 2026

strowk / mcp-k8s-go

MCP server connecting to Kubernetes

Go 382 57 Updated Dec 22, 2025

ai-dynamo / grove

Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling

Go 225 74 Updated Jun 23, 2026

ai-dynamo / aiperf

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 388 107 Updated Jun 22, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,428 540 Updated Jun 23, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,325 1,265 Updated Jun 23, 2026

nashsu / llm_wiki

LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratc…

TypeScript 12,534 1,511 Updated Jun 23, 2026

yzddmr6 / repo-analyzer

AI coding agent skill for deep architectural analysis of open-source projects | 开源项目深度架构分析，一句话生成专业级分析报告

423 64 Updated Apr 27, 2026

kserve / kserve

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Go 5,608 1,539 Updated Jun 22, 2026

AISBench / benchmark

AISBench Benchmark is a model evaluation tool built on OpenCompass, compatible with OpenCompass’s configuration system, dataset structure, and model backend implementation, while extending support …

Python 115 46 Updated Jun 23, 2026