Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
AI agents running research on single-GPU nanochat training automatically
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.
No fortress, purely open ground. OpenManus is Coming.
The best-benchmarked open-source AI memory system. And it's free.
Build resilient language agents as graphs.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fully open reproduction of DeepSeek-R1
SGLang is a high-performance serving framework for large language models and multimodal models.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
Train transformer language models with reinforcement learning.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
An open source implementation of CLIP.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Give your agents the power of the Hugging Face ecosystem
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.