Shy-98

Haiyang Shy-98

28 followers · 22 following

Achievements

Stars

hyzhang24 / DuplexSLA

DuplexSLA: A Full-Duplex Spoken Language Model with Synchronized Speech, Language, and Action

80 Updated May 20, 2026

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,777 2,158 Updated May 18, 2026

stepfun-ai / SteptronOss

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 575 43 Updated May 18, 2026

vspeech / Qwen3-TTS-Train

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 114 7 Updated Mar 18, 2026

TEN-framework / ten-vad

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 2,164 169 Updated Feb 2, 2026

affaan-m / ECC

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 217,670 33,399 Updated Jun 17, 2026

github / awesome-copilot

Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.

Python 35,244 4,349 Updated Jun 18, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,370 79,410 Updated Jun 18, 2026

jeremychee4 / AffectSpeech

AffectSpeech: A Large-Scale Emotional Speech Dataset with Fine-Grained Textual Descriptions for Speech Emotion Captioning and Synthesis

67 3 Updated Jun 12, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 5,191 1,134 Updated Jun 18, 2026

meituan-longcat / LongCat-AudioDiT

Python 525 47 Updated Apr 3, 2026

WEIFENG2333 / VideoCaptioner

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理！- A powered tool for easy and efficient video subtitling.

Python 15,049 1,262 Updated Jun 17, 2026

common-voice / common-voice

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

TypeScript 3,472 872 Updated Jun 18, 2026

BIT-DataLab / Edit-Banana

Edit Banana: A framework for converting statistical formats into editable.

Python 5,349 361 Updated Jun 16, 2026

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,976 356 Updated Jan 4, 2024

QwenLM / Qwen3-TTS

Python 12,018 1,558 Updated Mar 17, 2026

MatthewCYM / VoiceBench

[TACL'26] VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 370 25 Updated Jun 11, 2026

0nutation / USLM

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Python 152 12 Updated Sep 14, 2023

UniSecurityGuard / UniSecurityGuard

本科华五，曾赴美qs50读博，某兄弟院校副教授，校园门卫亭女性主理人，为防止炸号的备份平台，是本人。

897 11 Updated Jan 14, 2026

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,641 584 Updated Oct 24, 2024