Skip to content
View Shy-98's full-sized avatar

Block or report Shy-98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DuplexSLA: A Full-Duplex Spoken Language Model with Synchronized Speech, Language, and Action

80 Updated May 20, 2026

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,777 2,158 Updated May 18, 2026

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 575 43 Updated May 18, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 114 7 Updated Mar 18, 2026

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 2,164 169 Updated Feb 2, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 217,670 33,399 Updated Jun 17, 2026

Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.

Python 35,244 4,349 Updated Jun 18, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,370 79,410 Updated Jun 18, 2026

AffectSpeech: A Large-Scale Emotional Speech Dataset with Fine-Grained Textual Descriptions for Speech Emotion Captioning and Synthesis

67 3 Updated Jun 12, 2026

A framework for efficient model inference with omni-modality models

Python 5,191 1,134 Updated Jun 18, 2026

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.

Python 15,049 1,262 Updated Jun 17, 2026

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

TypeScript 3,472 872 Updated Jun 18, 2026

Edit Banana: A framework for converting statistical formats into editable.

Python 5,349 361 Updated Jun 16, 2026

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,976 356 Updated Jan 4, 2024

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 12,018 1,558 Updated Mar 17, 2026

[TACL'26] VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 370 25 Updated Jun 11, 2026

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Python 152 12 Updated Sep 14, 2023

本科华五,曾赴美qs50读博,某兄弟院校副教授,校园门卫亭女性主理人,为防止炸号的备份平台,是本人。

897 11 Updated Jan 14, 2026

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,641 584 Updated Oct 24, 2024

Reference-aware automatic speech evaluation toolkit

Python 182 15 Updated Dec 5, 2024

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 335 15 Updated Feb 5, 2026

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 654 41 Updated Jun 18, 2026
Python 695 121 Updated Sep 12, 2025
Python 181 12 Updated Jul 9, 2024

[ACL 2026]

Python 21 1 Updated Dec 6, 2025

⚡ Clash for Lab 是为实验室环境设计的科学上网工具,无需sudo权限,优雅地一键式脚本安装

Shell 366 19 Updated Feb 1, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 21,723 2,504 Updated May 25, 2026
Next