Starred repositories
High-Quality Voice Cloning TTS for 600+ Languages
Magenta RealTime: An Open-Weights Live Music Model
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Claude Code skill that removes signs of AI-generated writing from text
基于 Playwright 和AI实现的闲鱼多任务实时/定时监控与智能分析系统,配备了功能完善的后台管理UI。帮助用户从闲鱼海量商品中,找到心仪产品。
A command-line tool that can plot graph of any binary implicit function equation or inequality, supporting both Cartesian and polar coordinates.
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch, with lengths specified for samples in batch.
P2P Voice/Video phone App for local networks.
Model Context Protocol Servers
The official MCP server implementation for the Perplexity API Platform
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Curated list of open source alternatives to proprietary software.
👾 Fast and simple video download library and CLI tool written in Go
A curated list of research papers and resources on code-switching
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Dual-Path Attention and Recurrent Network for speech separation
Libri-CSS: dataset and evaluation pipeline
Speech separation with utterance-level PIT experiments