Skip to content
View mayfool's full-sized avatar

Block or report mayfool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

VITA-QINYU: Expressive Spoken Language Model for Role-Playing and Singing

Python 100 4 Updated Apr 3, 2026

High-Quality Voice Cloning TTS for 600+ Languages

Python 1,767 277 Updated Apr 4, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 109,395 18,138 Updated Apr 4, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,795 172 Updated Apr 5, 2026

A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic

Python 36,746 2,905 Updated Apr 6, 2026
Python 179 15 Updated Aug 25, 2025

Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click

TypeScript 13,905 2,392 Updated Apr 4, 2026

🔬 Harness Vibe Research with Self-evolving AI Scientists

Python 2,812 144 Updated Apr 5, 2026

PersonaPlex code.

Python 6,894 1,041 Updated Mar 2, 2026

AI agents running research on single-GPU nanochat training automatically

Python 66,491 9,533 Updated Mar 26, 2026

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,551 343 Updated Jun 21, 2025

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。

Python 285 21 Updated Mar 19, 2026

【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。

2,158 208 Updated Mar 30, 2024

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 349,067 69,906 Updated Apr 6, 2026

[ICASSP 2026]Official code for "Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum"

Python 24 3 Updated Jan 22, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 10,339 1,320 Updated Mar 17, 2026

Tools for merging pretrained large language models.

Python 6,946 682 Updated Mar 15, 2026

Very fast, accurate speaker diarization

Python 245 26 Updated Mar 25, 2026

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,813 94 Updated Apr 18, 2025

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

C++ 2,787 247 Updated Jan 22, 2026

Soprano: Instant, Ultra-Realistic Text-to-Speech

Python 1,215 107 Updated Jan 15, 2026

Bilibili东川路第一可爱猫猫虫的AI笔记

163 4 Updated Mar 18, 2026

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 554 22 Updated Jan 4, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,735 294 Updated Apr 4, 2026

An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.

Python 235 13 Updated Feb 26, 2026

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 11,332 1,285 Updated Apr 3, 2026

Fast audio super resolution from 16khz to 48khz.

Python 205 19 Updated Jan 3, 2026
Next