Skip to content
View shamuiscoding's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report shamuiscoding

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
30 stars written in Jupyter Notebook
Clear filter

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,992 11,515 Updated Nov 6, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,658 4,650 Updated Aug 19, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,626 2,480 Updated Mar 13, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,683 2,037 Updated Nov 19, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,505 1,684 Updated Feb 29, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,690 1,163 Updated Nov 14, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,662 1,710 Updated Apr 26, 2025
Jupyter Notebook 8,722 625 Updated Oct 25, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,635 965 Updated Oct 23, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,407 520 Updated Oct 8, 2025

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,895 753 Updated Nov 4, 2025

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,355 650 Updated Sep 26, 2024

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,767 295 Updated Jun 12, 2025

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook 3,693 235 Updated Mar 12, 2024

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,332 451 Updated Aug 24, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,214 201 Updated May 19, 2025

Puzzles for learning Triton

Jupyter Notebook 2,099 172 Updated Nov 18, 2024

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,487 107 Updated Nov 6, 2025
Jupyter Notebook 549 42 Updated Jul 10, 2024

Fast parallel LLM inference for MLX

Jupyter Notebook 225 17 Updated Jul 7, 2024

Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEs, etc

Jupyter Notebook 136 9 Updated Dec 20, 2023

Minimal, lightweight JAX implementations of popular models.

Jupyter Notebook 117 19 Updated Nov 6, 2025
Jupyter Notebook 101 4 Updated Oct 1, 2024

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 91 8 Updated Oct 18, 2023

Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.

Jupyter Notebook 37 6 Updated Feb 23, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Jupyter Notebook 37 3 Updated Dec 3, 2023

This is a port of Mistral-7B model in JAX

Jupyter Notebook 32 Updated Jul 1, 2024

A set of TFDS dataset builders for common datasets

Jupyter Notebook 8 1 Updated May 31, 2025

A novel Disfluency Correction & Machine translation Dataset for English, Hindi, German and French

Jupyter Notebook 3 2 Updated Oct 31, 2023