Skip to content
View RayeRen's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@msra-alumni @MLNLP-World @NATSpeech

Block or report RayeRen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
73 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 72,351 10,580 Updated Jun 18, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,372 6,228 Updated Sep 18, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,961 4,684 Updated Aug 19, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,505 3,897 Updated Jul 23, 2024

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

Jupyter Notebook 25,862 12,865 Updated Oct 3, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,978 2,567 Updated Mar 13, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,804 2,053 Updated Nov 19, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 14,720 3,400 Updated Aug 12, 2024

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,677 2,839 Updated Feb 5, 2026

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,833 1,712 Updated Feb 29, 2024
Jupyter Notebook 12,275 1,436 Updated Jan 30, 2026

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,793 1,446 Updated Apr 12, 2025

Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Jupyter Notebook 10,716 888 Updated Sep 20, 2024

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 10,112 1,320 Updated Nov 9, 2023

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,123 1,004 Updated Feb 6, 2026

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,425 1,230 Updated Jul 30, 2024

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,760 358 Updated Feb 3, 2026

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,304 1,426 Updated Jun 12, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,813 345 Updated Jan 21, 2025

Probabilistic reasoning and statistical analysis in TensorFlow

Jupyter Notebook 4,408 1,123 Updated Feb 5, 2026

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,800 252 Updated Dec 12, 2023

Singing Voice Conversion via diffusion model

Jupyter Notebook 2,718 821 Updated Dec 14, 2025

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 2,079 351 Updated Jul 14, 2024

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,826 264 Updated Aug 19, 2025

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,800 232 Updated Nov 29, 2022

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,637 349 Updated Apr 22, 2024

functorch is JAX-like composable function transforms for PyTorch.

Jupyter Notebook 1,436 108 Updated Aug 21, 2025

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,418 242 Updated May 21, 2023

Demonstrations of Magenta Models

Jupyter Notebook 1,345 426 Updated Jan 6, 2026

Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow

Jupyter Notebook 1,228 359 Updated Sep 19, 2021
Next