Skip to content
View orlandoxu's full-sized avatar
  • tencent
  • sichuan, chengdu, china

Block or report orlandoxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,134 1,414 Updated May 15, 2026

An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms

Python 79,042 8,116 Updated Apr 3, 2026

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

Python 1,146 111 Updated Feb 25, 2026

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 2,119 168 Updated Feb 2, 2026

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,049 771 Updated Mar 26, 2026

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

Python 2,656 269 Updated Jan 30, 2026

A Model Context Protocol (MCP) server and CLI that provides tools for agent use when working on iOS and macOS projects.

TypeScript 5,574 273 Updated May 13, 2026

The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 102,379 12,399 Updated May 15, 2026

此仓库存储我在YouTube频道分享的N8N工作流配置文件,用户可直接下载JSON文件导入N8N使用

1,415 340 Updated Jan 28, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 372,202 77,083 Updated May 16, 2026

An image viewer à la Twitter

Swift 2,550 389 Updated May 17, 2025

Backported SwiftUI navigation APIs introduced in WWDC22

Swift 917 60 Updated Aug 10, 2025

Bringing simple and powerful navigation tools to all Swift platforms, inspired by SwiftUI.

Swift 2,266 173 Updated Apr 1, 2026

A Swift command line tool for generating your Xcode project

Swift 8,433 881 Updated Apr 14, 2026

WaveNet vocoder

Python 2,374 493 Updated Jul 29, 2023

CMU US English Dictionary

Python 771 174 Updated Oct 24, 2025

g2p: English Grapheme To Phoneme Conversion

Python 919 135 Updated Jan 5, 2023

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,508 4,078 Updated Apr 19, 2025

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 1,094 215 Updated Oct 23, 2024

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 20,522 2,532 Updated Mar 16, 2026

Easily train a good VC model with voice data <= 10 mins!

Python 35,607 5,041 Updated Nov 24, 2024

On-device TTS model by Neuphonic

Python 5,868 641 Updated Apr 24, 2026

https://hf.co/hexgrad/Kokoro-82M

JavaScript 7,032 763 Updated Aug 6, 2025

in preparation...

Python 585 126 Updated Nov 5, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,317 6,096 Updated Aug 16, 2024

AI-Powered, Non-Intrusive Terminal Assistant

Go 1,814 111 Updated May 14, 2026

🕳 bore is a simple CLI tool for making tunnels to localhost

Rust 11,162 498 Updated Feb 4, 2026

A lightweight and high-performance reverse proxy for NAT traversal, written in Rust. An alternative to frp and ngrok.

Rust 13,555 757 Updated Apr 16, 2026

SoTA open-source TTS

Python 24,728 3,279 Updated May 1, 2026

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 25,286 3,950 Updated Mar 6, 2026
Next