rom1504

Romain Beaumont rom1504

Interested in machine learning (computer vision, natural language processing, deep learning), node.js (network, bots, web), and programming in general

2.2k followers · 48 following

Achievements

x4 x3 x2 x3

Achievements

x4 x3 x2 x3

Organizations

Starred repositories

81 stars written in Jupyter Notebook

Clear filter

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 72,364 10,584 Updated Jun 18, 2024

google-research / google-research

Google Research

Jupyter Notebook 37,244 8,326 Updated Feb 6, 2026

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,529 3,903 Updated Jul 23, 2024

ageron / handson-ml2

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 29,842 13,247 Updated Jun 13, 2024

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 20,202 2,877 Updated Aug 18, 2024

google-gemini / cookbook

Examples and guides for using the Gemini API

Jupyter Notebook 16,404 2,457 Updated Feb 9, 2026

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,804 2,053 Updated Nov 19, 2024

google-deepmind / deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,685 2,842 Updated Feb 5, 2026

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,839 1,712 Updated Feb 29, 2024

google-research / vision_transformer

Jupyter Notebook 12,284 1,437 Updated Jan 30, 2026

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,141 1,005 Updated Feb 7, 2026

alembics / disco-diffusion

Jupyter Notebook 7,432 1,103 Updated Jul 9, 2023

facebookresearch / DensePose

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Jupyter Notebook 7,154 1,312 Updated Jan 18, 2023

google / automl

Google Brain AutoML

Jupyter Notebook 6,451 1,463 Updated Mar 2, 2025

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,427 1,230 Updated Jul 30, 2024

bentrevett / pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,675 1,365 Updated Jan 20, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,669 755 Updated Aug 5, 2024

tensorflow / tpu

Reference models and tools for Cloud TPUs.

Jupyter Notebook 5,267 1,762 Updated Feb 5, 2026

google-research / simclr

SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners

Jupyter Notebook 4,452 660 Updated May 22, 2023

facebookresearch / LASER

Language-Agnostic SEntence Representations

Jupyter Notebook 3,658 462 Updated May 2, 2024

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,352 208 Updated May 19, 2025

facebookresearch / vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Jupyter Notebook 3,293 331 Updated Mar 3, 2024

google / brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 3,055 326 Updated Feb 6, 2026

rinongal / textual_inversion

Jupyter Notebook 3,047 286 Updated Feb 27, 2023

ai-forever / Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Jupyter Notebook 2,819 315 Updated May 1, 2024

hooram / ownphotos

Self hosted alternative to Google Photos

Jupyter Notebook 2,773 226 Updated Dec 7, 2022

MubertAI / Mubert-Text-to-Music

A simple notebook demonstrating prompt-based music generation via Mubert API

Jupyter Notebook 2,740 233 Updated May 4, 2023

rom1504 / clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,724 239 Updated Aug 15, 2025

switchablenorms / DeepFashion2

DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf

Jupyter Notebook 2,560 379 Updated Jan 28, 2025

google-research-datasets / Objectron

Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the came…

Jupyter Notebook 2,321 264 Updated Jul 20, 2022

Romain Beaumont rom1504

Organizations

Starred repositories

Minecraft