Stars
DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm
TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.
Secure, Fast, and Extensible Sandbox runtime for AI agents.
Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents.
Open-source, secure environment with real-world tools for enterprise-grade agents.
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
Kubernetes-native AI serving platform for scalable model serving.
A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.
🎥 Make videos programmatically with React
A Datacenter Scale Distributed Inference Serving Framework
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Build Real-Time Knowledge Graphs for AI Agents
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Open-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A toolkit to run Ray applications on Kubernetes
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
A high-throughput and memory-efficient inference and serving engine for LLMs
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
MCPCAN is a centralized management platform for MCP services. It deploys each MCP service using a container deployment method. The platform supports container monitoring and MCP service token verif…
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
A Go framework for building production agent systems with graph workflows, tools, memory, A2A, AG-UI, MCP, evaluation, and observability.
A lightweight 2D graphics library for modern GPUs, delivering high-performance text, image, and vector rendering across major platforms.
KubeBlocks is a Kubernetes Operator designed to manage a variety of databases and streaming systems, including MySQL, PostgreSQL, MongoDB, Redis, RabbitMQ, RocketMQ, and more, within Kubernetes env…
Cost-efficient and pluggable Infrastructure components for GenAI inference