jihoojung0106

🏠

Working from home

JIHOO JUNG jihoojung0106

🏠

Working from home

2 followers · 2 following

Highlights

Lists (4)

Sort

Stars

467 stars written in Python

Clear filter

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,192 11,720 Updated Dec 15, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,637 6,606 Updated Jan 22, 2026

ageitgey / face_recognition

The world's simplest facial recognition api for Python and the command line

Python 56,090 13,716 Updated Aug 21, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,456 5,949 Updated Aug 16, 2024

floodsung / Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Python 39,465 7,334 Updated Nov 27, 2022

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,394 4,777 Updated Jun 2, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,697 6,738 Updated Feb 5, 2026

deezer / spleeter

Deezer source separation library including pretrained models.

Python 28,020 3,068 Updated Apr 2, 2025

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 27,976 5,085 Updated Nov 11, 2023

spotDL / spotify-downloader

Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

Python 23,820 2,055 Updated Nov 15, 2025

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 23,452 1,757 Updated Mar 13, 2025

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 22,131 3,023 Updated Jan 25, 2026

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,984 2,133 Updated Jan 27, 2026

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 19,475 2,193 Updated Feb 4, 2026

state-spaces / mamba

Mamba SSM architecture

Python 17,142 1,580 Updated Jan 12, 2026

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,684 3,325 Updated Feb 5, 2026

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,816 1,563 Updated Feb 4, 2026

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,048 2,073 Updated Feb 2, 2026

mlfoundations / open_clip

An open source implementation of CLIP.

Python 13,345 1,234 Updated Nov 4, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,541 1,190 Updated Feb 5, 2026