shamuiscoding

🎯

Focusing

Toby Kim shamuiscoding

🎯

Focusing

Building Nari Labs

97 followers · 38 following

Achievements

Highlights

Stars

30 stars written in Jupyter Notebook

Clear filter

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,992 11,515 Updated Nov 6, 2025

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,658 4,650 Updated Aug 19, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,626 2,480 Updated Mar 13, 2025

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,683 2,037 Updated Nov 19, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,505 1,684 Updated Feb 29, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,690 1,163 Updated Nov 14, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,662 1,710 Updated Apr 26, 2025

Vaibhavs10 / insanely-fast-whisper

Jupyter Notebook 8,722 625 Updated Oct 25, 2025

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,635 965 Updated Oct 23, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,407 520 Updated Oct 8, 2025

google / flax

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,895 753 Updated Nov 4, 2025

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,355 650 Updated Sep 26, 2024

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,767 295 Updated Jun 12, 2025

mlc-ai / web-stable-diffusion

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook 3,693 235 Updated Mar 12, 2024

serp-ai / bark-with-voice-clone

Forked from suno-ai/bark

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,332 451 Updated Aug 24, 2025

Jupyter Notebook 37 3 Updated Dec 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Toby Kim shamuiscoding

Achievements

Achievements

Highlights

Block or report shamuiscoding

Stars

rasbt / LLMs-from-scratch

suno-ai / bark

facebookresearch / audiocraft

neonbjb / tortoise-tts

CompVis / latent-diffusion

facebookresearch / seamless_communication

SakanaAI / AI-Scientist

Vaibhavs10 / insanely-fast-whisper

pyannote / pyannote-audio

FoundationVision / VAR

OpenBMB / MiniCPM

google / flax

HVision-NKU / StoryDiffusion

QwenLM / Qwen2.5-Omni

mlc-ai / web-stable-diffusion

serp-ai / bark-with-voice-clone

google-research / big_vision

srush / Triton-Puzzles

gepa-ai / gepa

Vaibhavs10 / fast-whisper-finetuning

willccbb / mlx_parallm

smsharma / minified-generative-models

jax-ml / bonsai

xi-j / Mamba-TasNet

FrenchKrab / IS2023-powerset-diarization

sanchit-gandhi / seq2seq-speech

cgarciae / nanoGPT-jax

AakashKumarNain / mistral_jax

kvfrans / tfds_builders

vineet2104 / DISCO