Lists (11)
Sort Name ascending (A-Z)
Stars
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
A Model Context Protocol (MCP) server and CLI that provides tools for agent use when working on iOS and macOS projects.
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
此仓库存储我在YouTube频道分享的N8N工作流配置文件,用户可直接下载JSON文件导入N8N使用
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Backported SwiftUI navigation APIs introduced in WWDC22
Bringing simple and powerful navigation tools to all Swift platforms, inspired by SwiftUI.
A Swift command line tool for generating your Xcode project
Instant voice cloning by MIT and MyShell. Audio foundation model.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Easily train a good VC model with voice data <= 10 mins!
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🕳 bore is a simple CLI tool for making tunnels to localhost
A lightweight and high-performance reverse proxy for NAT traversal, written in Rust. An alternative to frp and ngrok.
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone