Stars
Proxy that exposes Antigravity provided claude / gemini models, so we can use them in Claude Code and OpenClaw (Clawdbot)
High-performance distributed data shuffling (all-to-all) library for MoE training and inference
Run the latest vscode-server on RHEL/CentOS 7!
The Open-Source Data Annotation Platform
Open-source multimodal data annotation platform with AI auto-annotation support.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
A lightweight library for portable low-level GPU computation using WebGPU.
[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Cronicle V2 (Orchestra) community prototype
A high-throughput and memory-efficient inference and serving engine for LLMs
A Tiny Modern C++ Header Brings Unified Interface for Different Languages
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Download image from the Docker Hub HTTPS API
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
An unprofessional open-source Chinese font derived from Fontworks' Klee One. 一款非专业的开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。