seaniezhao

🎯

Focusing on pytorch

sean seaniezhao

🎯

Focusing on pytorch

I'm currently working at a startup company. we focus on Music Generation, Singing Synthesis, etc. Anyone interesting in this area feel free to contact me.

116 followers · 67 following

timedomAIn
Beijing
seanweichat

Achievements

x3 x3

Achievements

x3 x3

Organizations

Lists (28)

Sort

Stars

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,418 179 Updated Dec 23, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 10,835 1,159 Updated Apr 9, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 24,388 2,003 Updated Dec 1, 2025

Suxiaoqinx / Netease_url

网易云无损解析

Python 1,765 292 Updated Nov 17, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,971 454 Updated Dec 21, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,915 3,842 Updated Dec 23, 2025

woct0rdho / transformers-qwen3-moe-fused

Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth

Python 217 10 Updated Nov 6, 2025

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 3,542 405 Updated Oct 9, 2025

manoskary / weavemuse

An open agentic system built on smolagents, integrating multimodal state-of-the-art music AI models for understanding, generation, and interaction.

Python 24 2 Updated Dec 3, 2025

musistudio / claude-code-router

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 23,871 1,882 Updated Dec 18, 2025

coleam00 / context-engineering-intro

Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…

Python 11,924 2,527 Updated Nov 16, 2025

QwenLM / qwen-code

An open-source AI agent that lives in your terminal.

TypeScript 16,681 1,429 Updated Dec 23, 2025

GuanYixuan / pyJianYingDraft

轻量、灵活、易上手的Python剪映草稿生成及导出工具，构建全自动化视频剪辑/混剪流水线。本项目的CapCut版本正于 https://github.com/GuanYixuan/pyCapCut 内开发

Python 2,435 487 Updated Nov 5, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,500 481 Updated Oct 27, 2025

pnlong / PDMX

PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing

Python 91 4 Updated Jun 1, 2025

meta-pytorch / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,169 568 Updated Aug 22, 2025

tencent-ailab / SongBloom

The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Python 699 75 Updated Dec 4, 2025

magenta / magenta-realtime

Python 943 97 Updated Dec 17, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 51,427 8,972 Updated Nov 17, 2025

bytedance / flowgram.ai

FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build AI workflow platforms faster and simpler.

TypeScript 7,459 634 Updated Dec 22, 2025

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,682 153 Updated Dec 21, 2025

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,492 420 Updated Jun 27, 2025

AaronZ345 / GTSinger

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 337 13 Updated Aug 15, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,396 320 Updated Jun 21, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 50,893 4,223 Updated Dec 23, 2025

revsic / flowmodels

PyTorch-implementations of Flow Models for toy data

Python 11 4 Updated Aug 1, 2025

fallenshock / FlowEdit

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 890 43 Updated Dec 23, 2025

blazickjp / arxiv-mcp-server

A Model Context Protocol server for searching and analyzing arXiv papers

Python 1,969 152 Updated Aug 19, 2025

SonyCSLParis / music2latent

Encode and decode audio samples to/from compressed latent representations!

Python 241 25 Updated Sep 19, 2025

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 7,221 669 Updated Aug 15, 2025

sean seaniezhao

Organizations

Lists (28)

3d-rendering

AI_tricks

audio_framework

audio-generation

bigData

blockchain

chatGPTxxx

dataset

DeepLearning—learning

dsp

game_framework

game_graphic

game_physics

image_generation

infra

interesting

large_model

MIR_ASR

ML_model deploy/optimization

music-generation

nlp

other_tools

server_dev

TTS_or_singing-sythesis

ui_framework

vocoder

voice-conversion

webui

Stars