gxkevin

gxkevin

Stars

ravikumar1907 / llm-ebpf-tracer

C 27 4 Updated Jun 5, 2025

CloudDetail / apo

APO is a comprehensive observability platform combining OpenTelemetry with eBPF. Leveraging LLM to enable automated analysis and troubleshooting 🚀.

Go 376 54 Updated Mar 25, 2026

eunomia-bpf / MCPtrace

MCP server: using eBPF to tracing your kernel

Python 65 9 Updated Feb 12, 2026

eunomia-bpf / agentsight

Zero instrucment LLM and AI agent (e.g. claude code, gemini-cli) observability in eBPF

C 261 38 Updated Mar 28, 2026

deepflowio / deepflow

eBPF Observability - Distributed Tracing and Profiling

Go 3,953 435 Updated Mar 30, 2026

alex-ilgayev / MCPSpy

MCP Monitoring with eBPF

C 505 76 Updated Jan 16, 2026

ccfos / huatuo

A cloud-native operating system observability project based on eBPF, incubated under CCF.

Go 1,152 66 Updated Mar 30, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,007 635 Updated Mar 30, 2026

aiming-lab / CITER

[COLM'25] CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing

Python 19 3 Updated Jun 25, 2025

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 341,254 67,351 Updated Mar 30, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,851 376 Updated Mar 30, 2026

casys-kaist / LLMServingSim

LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure

Python 228 48 Updated Mar 13, 2026

Tokenomics-AI / Tokenomics

Make every token count — an experimental LLM inference layer that optimizes cost through caching, adaptive routing, and ML-assisted decision-making.

Python 1 Updated Jan 1, 2026

kali20gakki / msAgent

Python 20 5 Updated Mar 30, 2026

facebookincubator / dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …

C++ 368 81 Updated Mar 25, 2026

Mr-chen-05 / rules-2.1-optimized

企业级AI助手规则体系 - 基于agent-rules优化扩展，专为中国开发者打造，支持Augment、Cursor、Claude Code、Trae AI等主流AI工具的一键安装和配置

Batchfile 160 29 Updated Nov 7, 2025

ModelEngine-Group / app-platform

AppPlatform 是一个前沿的大模型应用工程，旨在通过集成的声明式编程和低代码配置工具，简化和优化大模型的训练与推理应用的开发过程。本工程为软件工程师和产品经理提供一个强大的、可扩展的环境，以支持从概念到部署的全流程 AI 应用开发。

Java 1,427 230 Updated Mar 13, 2026

EduAgentX-Remake / EduAgentX-BackEnd

基于Spring AI + LangGraph4j 工作流 + RAG 知识库 + Redis 高并发优化 + Dubbo微服务架构（7个独立服务）/单体架构+ Higress 云原生网关的教育智能体平台

Java 12 4 Updated Nov 16, 2025

TianheMICALab / SimCXL

A full-system, cycle-level simulator based on gem5 that provides complete support for all three CXL sub-protocols and all three types of CXL devices.

C++ 138 40 Updated Mar 4, 2026

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 57,571 4,766 Updated Mar 30, 2026

vllm-project / vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

Python 1,847 996 Updated Mar 30, 2026

AdvancedCompiler / AdvancedCompiler

先进编译实验室的个人主页

C++ 214 22 Updated Oct 15, 2025

flagos-ai / FlagGems

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 935 300 Updated Mar 30, 2026

naginoa / LLMs_interview_notes

Forked from jackaduma/awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

632 137 Updated Oct 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly