Skip to content
View JaesungHuh's full-sized avatar
🎹
🎹

Block or report JaesungHuh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of (mostly) technical things every software developer should know about

98,353 8,692 Updated Dec 29, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,612 11,926 Updated Dec 15, 2025

Animation engine for explanatory math videos

Python 85,534 7,184 Updated Mar 14, 2026

Inference code for Llama models

Python 59,257 9,822 Updated Jan 26, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,779 6,297 Updated Sep 18, 2024

A markup-based typesetting system that is powerful and easy to learn.

Rust 52,273 1,514 Updated Mar 25, 2026

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,952 3,969 Updated Mar 25, 2026

A playbook for systematically maximizing the performance of deep learning models.

29,947 2,427 Updated Jun 18, 2024

One second to read GitHub code with VS Code.

TypeScript 23,294 907 Updated Mar 24, 2026

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,914 2,202 Updated Mar 25, 2026

Development repository for the Triton language and compiler

MLIR 18,765 2,700 Updated Mar 25, 2026

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,827 2,051 Updated Nov 19, 2024

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,715 1,701 Updated Apr 7, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,698 1,394 Updated Mar 3, 2026

A PyTorch-based Speech Toolkit

Python 11,366 1,673 Updated Mar 25, 2026

PyTorch package for the discrete VAE used for DALL·E.

Python 10,874 1,888 Updated Jan 31, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,466 1,737 Updated Mar 25, 2026

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,401 1,032 Updated Mar 12, 2026

ImageBind One Embedding Space to Bind Them All

Python 9,001 845 Updated Nov 21, 2025

AI notepad for meetings

Rust 8,074 565 Updated Mar 25, 2026

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,843 1,387 Updated Dec 6, 2023

🔥 2D and 3D Face alignment library build using pytorch

Python 7,504 1,383 Updated Aug 30, 2024

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 7,492 1,028 Updated Jul 3, 2024

Google AI 2018 BERT pytorch implementation

Python 6,520 1,327 Updated Sep 15, 2023

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Python 6,188 1,095 Updated Jun 19, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,611 581 Updated May 30, 2025

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,884 355 Updated Mar 3, 2026

Official DeiT repository

Python 4,328 590 Updated Mar 15, 2024

An open-source framework for training large multimodal models.

Python 4,081 317 Updated Aug 31, 2024

A Very Low-Bitrate Codec for Speech Compression

C++ 3,950 368 Updated Aug 20, 2024
Next