DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,905 2,230 Updated Nov 6, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 15,813 1,195 Updated Nov 4, 2025

zai-org / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,696 1,838 Updated Jun 27, 2024

GaiZhenbiao / ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Python 15,437 2,268 Updated Aug 15, 2025

OpenEthan / SMSBoom

SMSBoom - Deprecate: Due to judicial reasons, the repository has been suspended!

Python 15,341 3,654 Updated Mar 20, 2024

bbfamily / abu

阿布量化交易系统(股票，期权，期货，比特币，机器学习) 基于python的开源量化交易，量化投资架构

Python 15,233 4,251 Updated Mar 11, 2025

browser-use / web-ui

🖥️ Run AI Agent in your browser.

Python 15,124 2,622 Updated Aug 31, 2025

DrewThomasson / ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages!

Python 14,865 1,140 Updated Nov 5, 2025

dagster-io / dagster

An orchestration platform for the development, production, and observation of data assets.

Python 14,349 1,874 Updated Nov 6, 2025

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,238 989 Updated Jul 31, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,099 3,246 Updated Nov 6, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,351 1,352 Updated Oct 1, 2025

astral-sh / ty

An extremely fast Python type checker and language server, written in Rust.

Python 13,303 132 Updated Nov 3, 2025

pydantic / pydantic-ai

GenAI Agent Framework, the Pydantic way

Python 13,226 1,359 Updated Nov 6, 2025

andrewyng / aisuite

Simple, unified interface to multiple Generative AI providers

Python 12,712 1,292 Updated Nov 5, 2025

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,330 1,947 Updated Oct 20, 2025

vipstone / faceai

一款入门级的人脸、视频、文字检测以及识别的项目.

Python 11,041 2,517 Updated Apr 16, 2020

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,913 946 Updated Nov 6, 2025