Skip to content
View hojinYang's full-sized avatar
✈️
✈️

Block or report hojinYang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of AnimateDiff.

Python 11,957 1,031 Updated Jul 31, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 98,072 11,119 Updated Dec 25, 2025

Machine Learning Engineering Open Book

Python 16,093 988 Updated Dec 20, 2025

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,763 164 Updated May 25, 2024

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,992 412 Updated May 10, 2023

LLM papers I'm reading, mostly on inference and model compression

748 38 Updated Dec 21, 2023
Python 800 47 Updated Jul 8, 2024

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,541 91 Updated Apr 24, 2025

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,612 281 Updated Jan 12, 2025

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,736 497 Updated Dec 25, 2025

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

Python 275 20 Updated Oct 30, 2023

Build AI Agents, Visually

TypeScript 47,558 23,432 Updated Dec 23, 2025

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 411 35 Updated Feb 21, 2024

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 464 40 Updated Apr 24, 2024

WhisperTalk is an audio-to-text model based on the transformer architecture which takes audio input and generates predictions for the next utterance.

Python 7 Updated Jul 5, 2023

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 2,368 280 Updated Nov 16, 2025

SoftVC VITS Singing Voice Conversion

Python 27,882 5,078 Updated Nov 11, 2023

Easily train a good VC model with voice data <= 10 mins!

Python 33,537 4,774 Updated Nov 24, 2024

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,337 448 Updated Aug 24, 2025

The code for the bark-voicecloning model. Training and inference.

Python 711 116 Updated Sep 13, 2023

문맥을 고려한 한국어 텍스트 데이터 증강

Python 21 2 Updated Oct 18, 2024

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,139 885 Updated Dec 25, 2025

Code for "Learning to summarize from human feedback"

Python 1,056 152 Updated Sep 5, 2023

PyTorch deep learning projects made easy.

Python 5,064 1,107 Updated Jun 4, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,382 586 Updated Oct 28, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,084 1,090 Updated Nov 18, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,706 204 Updated Sep 9, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,911 2,681 Updated Dec 15, 2025

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 26,691 4,099 Updated Jun 19, 2025

A modular RL library to fine-tune language models to human preferences

Python 2,376 203 Updated Mar 1, 2024
Next