Stars
🤖💵 Agentic Payment Service for Open Agent Skills Ecosystem.
《动手学大模型Dive into LLMs》系列编程实践教程
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Added vLLM support to IndexTTS for faster inference.
MOSS-Speech is a true speech-to-speech large language model without text guidance.
Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
The SoundTouch Library is originally written by Olli Parviainen in C++. Although a .NET wrapper library is available, this library aims to be a complete rewrite in C#.
参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中
C# implementation of LangChain. We try to be as close to the original as possible in terms of abstractions, but are open to new entities.
Thor is a powerful artificial intelligence model management tool, whose main purpose is to achieve unified management and use of multiple AI models. Through Thor, users can easily manage and utiliz…
Search + Chat = SearChat(AI Chat with Search), Support OpenAI/Anthropic/VertexAI/Gemini, DeepResearch, SearXNG, Docker. AI对话式搜索引擎,支持DeepResearch, 支持OpenAI/Anthropic/VertexAI/Gemini接口、聚合搜索引擎SearXNG,…
The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, e…
【更方便更安全的管理PandoraNext】通过手机端和电脑端使小白能快速部署属于自己的免费Open API中转站。tokensTool支持通过PandoraNext管理刷新所有token,支持分享,支持share_token,pool_token一键自定义放入oneapi。tokensTool全面支持PandoraNext部署方法且支持热部署,自定义后缀,登录黑名单IP和登录日志,保护隐私…
A Realtime Chat Application using flutter, Asp.Net Core Web Api, SignalR , WebRTC etc.
Bootstrap Blazor is an enterprise-level UI component library based on Bootstrap and Blazor.