Stars
Free and Open Source Enterprise Resource Planning (ERP)
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
A modular graph-based Retrieval-Augmented Generation (RAG) system
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
Industry leading face manipulation platform
Official inference repo for FLUX.1 models
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
GUI for a Vocal Remover that uses Deep Neural Networks.
Build resilient language agents as graphs.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
リアルタイムボイスチェンジャー Realtime Voice Changer
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A Deep Learning based project for colorizing and restoring old images (and video!)
WebUI extension for ControlNet
State-of-the-Art Text Embeddings
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
Experience macOS just like before
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Translate the video from one language to another and add dubbing.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation