-
Microsoft
- Sunnyvale
Stars
A high-performance and light-weight router for vLLM large scale deployment
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Eval Protocol (EP) is an open solution for doing reinforcement learning fine-tuning on existing agents — across any language, container, or framework.
AI-powered code review automation using Kaito as the backend. A fork of qodo-ai/pr-agent enhanced with Kaito for better contextual understanding and more accurate PR reviews.
A Model Context Protocol (MCP) server that enables AI assistants to interact with AKS clusters. It serves as a bridge between AI tools (like Claude, Cursor, and GitHub Copilot) and AKS.
A powerful AI coding agent. Built for the terminal.
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.
provide layer 3 and layer 7 network connectivity among pods in different physical regions
📖 A collection of pure bash alternatives to external processes.
A novel container runtime, aka confidential container, for cloud-native confidential computing and enclave runtime ecosystem.
Cluster API Provider for Nested Clusters
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
OpenYurt - Extending your native Kubernetes to edge(project under CNCF)
Production-Grade Container Scheduling and Management
Run your deep learning workloads on Kubernetes more easily and efficiently.
A working place for multi-tenancy related proposals and prototypes.