JaesungHuh

Follow

🎹

jaesunghuh JaesungHuh

🎹

Follow

RØDE microphones, Prev @ VGG group

56 followers · 39 following

Achievements

Achievements

Stars

mtdvio / every-programmer-should-know

A collection of (mostly) technical things every software developer should know about

98,353 8,692 Updated Dec 29, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,612 11,926 Updated Dec 15, 2025

3b1b / manim

Animation engine for explanatory math videos

Python 85,534 7,184 Updated Mar 14, 2026

meta-llama / llama

Inference code for Llama models

Python 59,257 9,822 Updated Jan 26, 2025

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,779 6,297 Updated Sep 18, 2024

typst / typst

A markup-based typesetting system that is powerful and easy to learn.

Rust 52,273 1,514 Updated Mar 25, 2026

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,952 3,969 Updated Mar 25, 2026

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

29,947 2,427 Updated Jun 18, 2024

conwnet / github1s

One second to read GitHub code with VS Code.

TypeScript 23,294 907 Updated Mar 24, 2026

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,914 2,202 Updated Mar 25, 2026

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 18,765 2,700 Updated Mar 25, 2026

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,827 2,051 Updated Nov 19, 2024

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,715 1,701 Updated Apr 7, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,698 1,394 Updated Mar 3, 2026

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 11,366 1,673 Updated Mar 25, 2026

openai / DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Python 10,874 1,888 Updated Jan 31, 2024

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,466 1,737 Updated Mar 25, 2026

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,401 1,032 Updated Mar 12, 2026

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 9,001 845 Updated Nov 21, 2025

fastrepl / char

AI notepad for meetings

Rust 8,074 565 Updated Mar 25, 2026

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,843 1,387 Updated Dec 6, 2023

1adrianb / face-alignment

🔥 2D and 3D Face alignment library build using pytorch

Python 7,504 1,383 Updated Aug 30, 2024

facebookresearch / dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 7,492 1,028 Updated Jul 3, 2024

codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation

Python 6,520 1,327 Updated Sep 15, 2023

FoundationVision / ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Python 6,188 1,095 Updated Jun 19, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,611 581 Updated May 30, 2025

facebookresearch / co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,884 355 Updated Mar 3, 2026

facebookresearch / deit

Official DeiT repository

Python 4,328 590 Updated Mar 15, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 4,081 317 Updated Aug 31, 2024

google / lyra

A Very Low-Bitrate Codec for Speech Compression

C++ 3,950 368 Updated Aug 20, 2024