seaniezhao

Follow

🎯

Focusing on pytorch

sean seaniezhao

🎯

Focusing on pytorch

Follow

I'm currently working at a startup company. we focus on Music Generation, Singing Synthesis, etc. Anyone interesting in this area feel free to contact me.

122 followers · 67 following

timedomAIn
Beijing
seanweichat

Achievements

Achievements

Organizations

Lists (28)

Sort

3d-rendering

unity or other 3D rendering related

AI_tricks

audio_framework

12 repositories

audio-generation

models for audio generation

bigData

blockchain

chatGPTxxx

20 repositories

dataset

DeepLearning—learning

dsp

game_framework

game_graphic

game_physics

image_generation

xxGAN, diffusion..

infra

10 repositories

interesting

57 repositories

large_model

24 repositories

MIR_ASR

15 repositories

ML_model deploy/optimization

10 repositories

music-generation

nlp

other_tools

11 repositories

server_dev

TTS_or_singing-sythesis

deep-learning paper for MIR, TTS for SInging-synthesis

37 repositories

ui_framework

vocoder

voice-conversion

webui

Stars

43 stars written in Jupyter Notebook

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,970 4,684 Updated Aug 19, 2024

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 23,228 2,823 Updated Jun 12, 2025

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,982 2,568 Updated Mar 13, 2025

Stability-AI / StableLM

StableLM: Stability AI Language Models

Jupyter Notebook 15,764 1,022 Updated Apr 8, 2024

NVIDIA / DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 14,722 3,401 Updated Aug 12, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,841 1,712 Updated Feb 29, 2024

jupyter / notebook

Jupyter Interactive Notebook

Jupyter Notebook 12,935 5,592 Updated Feb 6, 2026

leisurelicht / wtfpython-cn

wtfpython的中文翻译/持续🚧.../ 能力有限，欢迎帮我改进翻译

Jupyter Notebook 12,755 2,041 Updated Nov 29, 2024

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,835 872 Updated Jun 10, 2024

bmild / nerf

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,800 1,445 Updated Apr 12, 2025

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 10,663 1,011 Updated Nov 21, 2025

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,141 1,005 Updated Feb 7, 2026

Vaibhavs10 / insanely-fast-whisper

Jupyter Notebook 8,809 631 Updated Oct 25, 2025

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,462 801 Updated Mar 15, 2025

DmitryUlyanov / deep-image-prior

Image restoration with neural networks but without learning.

Jupyter Notebook 8,065 1,448 Updated Apr 27, 2023

cloneofsimo / lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,524 500 Updated Mar 22, 2024

alembics / disco-diffusion

Jupyter Notebook 7,432 1,103 Updated Jul 9, 2023

bentrevett / pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,675 1,365 Updated Jan 20, 2024

NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,305 1,426 Updated Jun 12, 2024

datawhalechina / joyful-pandas

pandas中文教程

Jupyter Notebook 5,074 1,938 Updated Apr 24, 2024

mdeff / fma

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,552 459 Updated Jan 5, 2023

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,420 242 Updated May 21, 2023

innnky / emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

Jupyter Notebook 1,396 169 Updated Mar 30, 2023

yitu-opensource / T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Jupyter Notebook 1,191 177 Updated Oct 27, 2023

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,110 63 Updated Mar 20, 2025

Edresson / YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 1,053 97 Updated Nov 4, 2024

vincentherrmann / pytorch-wavenet

An implementation of WaveNet with fast generation

Jupyter Notebook 1,022 233 Updated Sep 17, 2020

tomhartke / knowledge-graph-from-GPT

Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.

Jupyter Notebook 694 52 Updated Oct 21, 2025

zalandoresearch / pytorch-vq-vae

PyTorch implementation of VQ-VAE by Aäron van den Oord et al.

Jupyter Notebook 601 103 Updated Nov 13, 2019

audeering / w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 539 51 Updated May 22, 2023