Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
real time face swap and one-click video deepfake with only a single image
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
LlamaIndex is the leading document agent and OCR platform
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
SGLang is a high-performance serving framework for large language models and multimodal models.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
Open standard for machine learning interoperability
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Janus-Series: Unified Multimodal Understanding and Generation Models
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
An open-source tool-augmented conversational language model from Fudan University
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Refine high-quality datasets and visual AI models
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…
ModelScope: bring the notion of Model-as-a-Service to life.