Starred repositories
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
The agent that grows with you
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
JumpServer is an open-source Privileged Access Management (PAM) platform that provides DevOps and IT teams with on-demand and secure access to SSH, RDP, Kubernetes, Database and RemoteApp endpoints…
A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…
OpenClaw 中文官方技能库 | 翻译自 Clawdbot 官方技能,按场景分类整理,支持中文自然语言调用
从零开始玩转OpenClaw:最全面的中文教程,涵盖安装、配置、实战案例和避坑指南(github版)
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Kubernetes application example tutorials
MooreThreads / MT-DeepEP
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
A simple, open source bilingual translation extension & Greasemonkey script (一个简约、开源的 双语对照翻译扩展 & 油猴脚本)
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
Kubernetes-native AI serving platform for scalable model serving.
System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge
FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang/triton.
PandaWiki 是一款 AI 大模型驱动的开源知识库搭建系统,帮助你快速构建智能化的 产品文档、技术文档、FAQ、博客系统,借助大模型的力量为你提供 AI 创作、AI 问答、AI 搜索等能力。
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
antgroup / ant-ray
Forked from ray-project/rayRay is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. AntRay is forked from ray, offering incremental new features on top …
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
Offline optimization of your disaggregated Dynamo graph