Starred repositories
OpenMMLab Detection Toolbox and Benchmark
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
State-of-the-art 2D and 3D Face Analysis Project
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
SGLang is a fast serving framework for large language models and vision language models.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Toolkit for linearizing PDFs for LLM datasets/training
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
SMSBoom - Deprecate: Due to judicial reasons, the repository has been suspended!
阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构
Generate audiobooks from e-books, voice cloning & 1107+ languages!
An orchestration platform for the development, production, and observation of data assets.
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
Ongoing research training transformer models at scale
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
An extremely fast Python type checker and language server, written in Rust.
GenAI Agent Framework, the Pydantic way
Simple, unified interface to multiple Generative AI providers
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"