mayfool

mayfool

11 followers · 118 following

Achievements

Lists (27)

Sort

Starred repositories

GAIR-NLP / daVinci-LLM

119 10 Updated Mar 31, 2026

VITA-MLLM / VITA-QinYu

VITA-QINYU: Expressive Spoken Language Model for Role-Playing and Singing

Python 100 4 Updated Apr 3, 2026

k2-fsa / OmniVoice

High-Quality Voice Cloning TTS for 600+ Languages

Python 1,767 277 Updated Apr 4, 2026

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 109,395 18,138 Updated Apr 4, 2026

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,795 172 Updated Apr 5, 2026

meituan-longcat / LongCat-AudioDiT

Python 328 26 Updated Apr 3, 2026

hesreallyhim / awesome-claude-code

A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic

Python 36,746 2,905 Updated Apr 6, 2026

FunAudioLLM / CV3-Eval

Python 179 15 Updated Aug 25, 2025

THU-MAIC / OpenMAIC

Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click

TypeScript 13,905 2,392 Updated Apr 4, 2026

EvoScientist / EvoScientist

🔬 Harness Vibe Research with Self-evolving AI Scientists

Python 2,812 144 Updated Apr 5, 2026

NVIDIA / personaplex

PersonaPlex code.

Python 6,894 1,041 Updated Mar 2, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 66,491 9,533 Updated Mar 26, 2026

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,551 343 Updated Jun 21, 2025

OpenBMB / UltraEval-Audio

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测，知己知彼。

Python 285 21 Updated Mar 19, 2026

km1994 / LLMsNineStoryDemonTower

【LLMs九层妖塔】分享 LLMs在自然语言处理（ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等）、信息检索（langchain）、语言合成、语言识别、多模态等领域（Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等）等实战与经验。

2,158 208 Updated Mar 30, 2024

blindTissue / logit_lens_llama_advanced

Python 18 Updated Jan 13, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 349,067 69,906 Updated Apr 6, 2026

malradhi / PACodec

[ICASSP 2026]Official code for "Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum"

Python 24 3 Updated Jan 22, 2026

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 10,339 1,320 Updated Mar 17, 2026

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 6,946 682 Updated Mar 15, 2026

narcotic-sh / senko

Very fast, accurate speaker diarization

Python 245 26 Updated Mar 25, 2026

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,813 94 Updated Apr 18, 2025

supertone-inc / supertonic

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

C++ 2,787 247 Updated Jan 22, 2026

ekwek1 / soprano

Soprano: Instant, Ultra-Realistic Text-to-Speech

Python 1,215 107 Updated Jan 15, 2026

ZHAOoops / AI-Notes

Bilibili东川路第一可爱猫猫虫的AI笔记

163 4 Updated Mar 18, 2026

yongliang-wu / DFT

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 554 22 Updated Jan 4, 2026

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,735 294 Updated Apr 4, 2026

ASLP-lab / VoiceSculptor

An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.

Python 235 13 Updated Feb 26, 2026

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 11,332 1,285 Updated Apr 3, 2026

ysharma3501 / FlashSR

Fast audio super resolution from 16khz to 48khz.

Python 205 19 Updated Jan 3, 2026

mayfool

Lists (27)

algro

am

Annotation

BIG

books

cv

data_process

dataset

diffusion_models

expressive_tts

frontend

fun

Go

mos-predict

multilingual

nlp

others

separate

sing

star

TODO

toy

tts_data_process

tts_framework

ttsing

vae

vocoder

Starred repositories

Vue.js

speech-synthesis

Android