-
-
Amphion Public
Forked from open-mmlab/AmphionAmphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Python MIT License UpdatedDec 5, 2025 -
BabelDOC Public
Forked from funstory-ai/BabelDOCYet Another Document Translator
Python GNU Affero General Public License v3.0 UpdatedDec 1, 2025 -
awesome-public-datasets Public
Forked from awesomedata/awesome-public-datasetsA topic-centric list of high-quality open datasets in public domains. By everyone, for everyone!
MIT License UpdatedNov 29, 2025 -
PaddleOCR2Pytorch Public
Forked from frotms/PaddleOCR2PytorchPaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
Python Apache License 2.0 UpdatedNov 18, 2025 -
ART Public
Forked from OpenPipe/ARTAgent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!
Python Apache License 2.0 UpdatedAug 12, 2025 -
dots.ocr Public
Forked from rednote-hilab/dots.ocrMultilingual Document Layout Parsing in a Single Vision-Language Model
Python MIT License UpdatedAug 12, 2025 -
AI-Media2Doc Public
Forked from hanshuaikang/AI-Media2Doc一键将音视频转化为小红书/公众号/知识笔记/思维导图/视频字幕等各种风格的文档。
Vue MIT License UpdatedJul 23, 2025 -
olmocr Public
Forked from allenai/olmocrToolkit for linearizing PDFs for LLM datasets/training
Python Apache License 2.0 UpdatedJul 14, 2025 -
ai-audio-datasets Public
Forked from Yuan-ManX/ai-audio-datasetsAI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
MIT License UpdatedJul 8, 2025 -
fonts Public
Forked from google/fontsFont files available from Google Fonts, and a public issue tracker for all things Google Fonts
HTML UpdatedJul 7, 2025 -
-
ai_wiki Public
Forked from charliedream1/ai_wiki《AI全栈-全网优秀资源搜集站》:搜集全网优秀资源,记载工程实践问题的解决策略与关键要点,分享各种实用案例,追踪前沿技术发展,囊括 AI 全栈知识,涵盖大模型、编程技术、机器学习、深度学习、强化学习、图神经网络、语音识别、NLP 及图像识别等领域
Jupyter Notebook UpdatedJul 6, 2025 -
RAG-Anything Public
Forked from HKUDS/RAG-Anything"RAG-Anything: All-in-One RAG System"
Python MIT License UpdatedJul 5, 2025 -
ACE-Step Public
Forked from ace-step/ACE-StepACE-Step: A Step Towards Music Generation Foundation Model
Python Apache License 2.0 UpdatedJun 27, 2025 -
docling Public
Forked from docling-project/doclingGet your documents ready for gen AI
Python MIT License UpdatedJun 25, 2025 -
GPT-SoVITS Public
Forked from RVC-Boss/GPT-SoVITS1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Python MIT License UpdatedJun 15, 2025 -
nerd-fonts Public
Forked from ryanoasis/nerd-fontsIconic font aggregator, collection, & patcher. 3,600+ icons, 50+ patched fonts: Hack, Source Code Pro, more. Glyph collections: Font Awesome, Material Design Icons, Octicons, & more
CSS Other UpdatedJun 13, 2025 -
Speech-AI-Forge Public
Forked from lenML/Speech-AI-Forge🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Python GNU Affero General Public License v3.0 UpdatedJun 7, 2025 -
YuE Public
Forked from multimodal-art-projection/YuEYuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Python Apache License 2.0 UpdatedJun 4, 2025 -
Deep-Live-Cam Public
Forked from hacksider/Deep-Live-Camreal time face swap and one-click video deepfake with only a single image
Python GNU Affero General Public License v3.0 UpdatedJun 3, 2025 -
opik Public
Forked from comet-ml/opikDebug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Python Apache License 2.0 UpdatedJun 3, 2025 -
F5R-TTS Public
Forked from FrontierLabs/F5R-TTSOfficial code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"
Python MIT License UpdatedJun 3, 2025 -
agenticSeek Public
Forked from Fosowl/agenticSeekFully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
Python GNU General Public License v3.0 UpdatedJun 3, 2025 -
WeClone Public
Forked from xming521/WeClone🚀 One-stop solution for creating your digital avatar from chat logs 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天…
Python GNU Affero General Public License v3.0 UpdatedJun 2, 2025 -
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python Apache License 2.0 UpdatedJun 2, 2025 -
ragflow Public
Forked from infiniflow/ragflowRAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
TypeScript Apache License 2.0 UpdatedMay 31, 2025 -
voice-pro Public
Forked from abus-aikorea/voice-proGradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…
Python MIT License UpdatedMay 31, 2025 -
Qwen-Agent Public
Forked from QwenLM/Qwen-AgentAgent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Python Apache License 2.0 UpdatedMay 29, 2025 -
HelloGitHub Public
Forked from 521xueweihan/HelloGitHub分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Python UpdatedMay 28, 2025