hojinYang

✈️

Hojin Yang hojinYang

✈️

learner

39 followers · 61 following

hojinYang.github.io

Achievements

Stars

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 11,957 1,031 Updated Jul 31, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 98,072 11,119 Updated Dec 25, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 16,093 988 Updated Dec 20, 2025

OpenPipe / OpenPipe

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,763 164 Updated May 25, 2024

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,992 412 Updated May 10, 2023

evanmiller / LLM-Reading-List

LLM papers I'm reading, mostly on inference and model compression

748 38 Updated Dec 21, 2023

shikras / shikra

Python 800 47 Updated Jul 8, 2024

lucidrains / soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,541 91 Updated Apr 24, 2025

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,612 281 Updated Jan 12, 2025

lucidrains / x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,736 497 Updated Dec 25, 2025

lucidrains / spear-tts-pytorch

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

Python 275 20 Updated Oct 30, 2023

FlowiseAI / Flowise

Build AI Agents, Visually

TypeScript 47,558 23,432 Updated Dec 23, 2025

YuanGongND / whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 411 35 Updated Feb 21, 2024

YuanGongND / ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 464 40 Updated Apr 24, 2024

hojinYang / whispertalk

WhisperTalk is an audio-to-text model based on the transformer architecture which takes audio input and generates predictions for the next utterance.

Python 7 Updated Jul 5, 2023

yxlllc / DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 2,368 280 Updated Nov 16, 2025

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 27,882 5,078 Updated Nov 11, 2023

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 33,537 4,774 Updated Nov 24, 2024

serp-ai / bark-with-voice-clone

Forked from suno-ai/bark

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,337 448 Updated Aug 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hojin Yang hojinYang

Achievements

Achievements

Block or report hojinYang

Stars

guoyww / AnimateDiff

comfyanonymous / ComfyUI

stas00 / ml-engineering

OpenPipe / OpenPipe

enhuiz / vall-e

evanmiller / LLM-Reading-List

shikras / shikra

lucidrains / soundstorm-pytorch

lucidrains / audiolm-pytorch

lucidrains / x-transformers

lucidrains / spear-tts-pytorch

FlowiseAI / Flowise

YuanGongND / whisper-at

YuanGongND / ltu

hojinYang / whispertalk

yxlllc / DDSP-SVC

svc-develop-team / so-vits-svc

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

serp-ai / bark-with-voice-clone

gitmylo / bark-voice-cloning-HuBERT-quantizer

kyle-bong / K-TACC

skypilot-org / skypilot

openai / summarize-from-feedback

victoresque / pytorch-template

FMInference / FlexLLMGen

salesforce / LAVIS

linto-ai / whisper-timestamped

microsoft / unilm

mozilla / DeepSpeech

allenai / RL4LMs