Lists (32)
Sort Name ascending (A-Z)
academic
acoustic echo cancellation
AIGC
audio codec
audio codecs
audio separation
audio tools
bandwidth extension
beamforming
computer vision
deep learning
diffusion
entertainments
hearing aid
LLM
mircophone array
music tools
noise reduction
packet loss compensation
programming related
simulation tools
singing voice tools
sound source localization
spatial audio
speaker recognition
speech dereverberation
speech diarization
speech frontend
speech recognition
speech separation
speech signal processing
speech voice tools
Starred repositories
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
No fortress, purely open ground. OpenManus is Coming.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
High-Resolution Image Synthesis with Latent Diffusion Models
Making large AI models cheaper, faster and more accessible
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
A cryptocurrency trading API with more than 100 exchanges in JavaScript / TypeScript / Python / C# / PHP / Go
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
A generative speech model for daily dialogue.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
A toolkit for developing and comparing reinforcement learning algorithms.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…