Skip to content
View StLeoX's full-sized avatar
  • huawei-cloudnative
  • Hangzhou

Block or report StLeoX

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-…

6,199 251 Updated Jun 18, 2026

My learning notes for ML SYS.

Python 6,568 449 Updated Jun 18, 2026

Scale-to-zero LLM serving on Kubernetes without adopting a platform — one CRD + KEDA, vendor-neutral across NVIDIA & Ascend.

Go 6 5 Updated Jun 21, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,883 607 Updated Jun 23, 2026

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 273 15 Updated Mar 19, 2026

MiMo Code: Where Models and Agents Co-Evolve

TypeScript 10,423 978 Updated Jun 23, 2026

The semantic layer that makes enterprise data understandable to AI agents — model entities and relations once, query through SPL/MCP/REST, and connect telemetry, services, and business objects in o…

Go 188 36 Updated Jun 23, 2026

旅行行程规划技能:规划 → 小红书调研 → 交互式地图页面 | Agent Skill for trip planning with 小红书 research and interactive map generation

HTML 73 11 Updated May 24, 2026

AI-powered travel hacking and search with cash, points, miles, and award flights. Drop-in skills and MCP servers for Claude, Codex, and OpenCode.

Python 556 54 Updated May 2, 2026

Build your own AI SRE agents. The open source toolkit for the AI era.

Python 7,394 976 Updated Jun 23, 2026

MCP server connecting to Kubernetes

Go 382 57 Updated Dec 22, 2025

Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling

Go 225 74 Updated Jun 23, 2026

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 388 107 Updated Jun 22, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,428 540 Updated Jun 23, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,325 1,265 Updated Jun 23, 2026

LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratc…

TypeScript 12,534 1,511 Updated Jun 23, 2026

AI coding agent skill for deep architectural analysis of open-source projects | 开源项目深度架构分析,一句话生成专业级分析报告

423 64 Updated Apr 27, 2026

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Go 5,608 1,539 Updated Jun 22, 2026

AISBench Benchmark is a model evaluation tool built on OpenCompass, compatible with OpenCompass’s configuration system, dataset structure, and model backend implementation, while extending support …

Python 115 46 Updated Jun 23, 2026

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 20,003 3,056 Updated Jun 11, 2026

An autonomous agent for deep financial research

TypeScript 27,149 3,376 Updated Jun 15, 2026

AgentRun CLI (agentrun / ar) — A command-line tool for managing AI agent infrastructure on the AgentRun platform.

Python 20 7 Updated Jun 22, 2026

The lightest AI sandbox. A process-based sandbox for Linux, no container, no VM, no privilege, no prompt injection

Rust 234 27 Updated Jun 23, 2026

A multi-sandbox container runtime that provides cloud-native, all-scenario multiple sandbox container solutions.

Rust 1,429 117 Updated Jun 5, 2026
Go 149 80 Updated Jun 22, 2026

Kubernetes-native AI serving platform for scalable model serving.

Go 379 138 Updated Jun 23, 2026

An open source template for building cloud agents.

TypeScript 5,675 739 Updated Jun 17, 2026

Early watch is a change safety system for Kubernetes

Go 48 1 Updated Jun 19, 2026

🍅 World's neatest Pomodoro timer for macOS menu bar

Swift 3,386 191 Updated May 29, 2026

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

Python 65,740 6,493 Updated Jun 17, 2026
Next