Stars
🦜🔗 The platform for reliable agents.
Robust Speech Recognition via Large-Scale Weak Supervision
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
The definitive Web UI for local AI, with powerful features and easy setup.
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
A generative speech model for daily dialogue.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Easily train a good VC model with voice data <= 10 mins!
SoftVC VITS Singing Voice Conversion
Rembg is a tool to remove images background
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Open Source AI Platform - AI Chat with advanced features that works with every LLM
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Yet another voice assistant, but alive.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
set prompt to divided region
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.