AI-Native Engineering Leader · Director of Software Engineering @ IndustrialMind.ai
17 years of backend engineering, 3 years shipping LLM systems in production. I build multi-agent platforms, LLM observability / AIOps closed loops, and eval-driven AI quality.
- "AI Manager" multi-agent orchestration — autonomous task generation with closed-loop execution (plan → execute → verify → retry), MCP-based tool integration
- LLM observability & self-healing ops — SigNoz / OpenTelemetry across every agent step; on-call agents that query logs & traces, localize root cause, and verify recovery
- Eval-driven AI quality — rag-eval / agent-eval suites (golden datasets, LLM-as-judge, trajectory assertions), CI-gated releases
- kRPC — proto-free gRPC middleware; backend interfaces auto-generate TypeScript & Dart clients, full-link HTTP/2
- 1BRC — processed 1 billion rows in < 10 s with Java 21 + GraalVM Native (official certificate)
- Tsinghua Rust OS Training Camp (Fall 2024) — top 15% graduate, 1st place in memory-allocator contest, invited back as TA
- Previously Alibaba / Ant Financial (P7), iHerb — QPS 10,000+ architectures, 99.99% availability, one algorithm patent (CN108733825A)
Java Python Rust Kubernetes / Istio Multi-Agent MCP RAG Milvus SigNoz / OpenTelemetry GraalVM