Skip to content
View seaniezhao's full-sized avatar
🎯
Focusing on pytorch
🎯
Focusing on pytorch

Organizations

@timedomain-tech

Block or report seaniezhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,418 179 Updated Dec 23, 2025

Spark-TTS Inference Code

Python 10,835 1,159 Updated Apr 9, 2025

SOTA Open Source TTS

Python 24,388 2,003 Updated Dec 1, 2025

网易云无损解析

Python 1,765 292 Updated Nov 17, 2025

Efficient Triton Kernels for LLM Training

Python 5,971 454 Updated Dec 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,915 3,842 Updated Dec 23, 2025

Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth

Python 217 10 Updated Nov 6, 2025

Generative models for conditional audio generation

Python 3,542 405 Updated Oct 9, 2025

An open agentic system built on smolagents, integrating multimodal state-of-the-art music AI models for understanding, generation, and interaction.

Python 24 2 Updated Dec 3, 2025

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 23,871 1,882 Updated Dec 18, 2025

Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…

Python 11,924 2,527 Updated Nov 16, 2025

An open-source AI agent that lives in your terminal.

TypeScript 16,681 1,429 Updated Dec 23, 2025

轻量、灵活、易上手的Python剪映草稿生成及导出工具,构建全自动化视频剪辑/混剪流水线。本项目的CapCut版本正于 https://github.com/GuanYixuan/pyCapCut 内开发

Python 2,435 487 Updated Nov 5, 2025

Open-source unified multimodal model

Python 5,500 481 Updated Oct 27, 2025

PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing

Python 91 4 Updated Jun 1, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,169 568 Updated Aug 22, 2025

The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Python 699 75 Updated Dec 4, 2025
Python 943 97 Updated Dec 17, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,427 8,972 Updated Nov 17, 2025

FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build AI workflow platforms faster and simpler.

TypeScript 7,459 634 Updated Dec 22, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,682 153 Updated Dec 21, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,492 420 Updated Jun 27, 2025

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 337 13 Updated Aug 15, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,396 320 Updated Jun 21, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 50,893 4,223 Updated Dec 23, 2025

PyTorch-implementations of Flow Models for toy data

Python 11 4 Updated Aug 1, 2025

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 890 43 Updated Dec 23, 2025

A Model Context Protocol server for searching and analyzing arXiv papers

Python 1,969 152 Updated Aug 19, 2025

Encode and decode audio samples to/from compressed latent representations!

Python 241 25 Updated Sep 19, 2025

Multilingual Voice Understanding Model

Python 7,221 669 Updated Aug 15, 2025
Next