Skip to content
View rom1504's full-sized avatar

Organizations

@webtorrent @camomile-project @SpockBotMC @PrismarineJS @ProtoDef-io @MephisTools

Block or report rom1504

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

81 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 72,364 10,584 Updated Jun 18, 2024

Google Research

Jupyter Notebook 37,244 8,326 Updated Feb 6, 2026

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,529 3,903 Updated Jul 23, 2024

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 29,842 13,247 Updated Jun 13, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 20,202 2,877 Updated Aug 18, 2024

Examples and guides for using the Gemini API

Jupyter Notebook 16,404 2,457 Updated Feb 9, 2026

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,804 2,053 Updated Nov 19, 2024

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,685 2,842 Updated Feb 5, 2026

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,839 1,712 Updated Feb 29, 2024
Jupyter Notebook 12,284 1,437 Updated Jan 30, 2026

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,141 1,005 Updated Feb 7, 2026
Jupyter Notebook 7,432 1,103 Updated Jul 9, 2023

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Jupyter Notebook 7,154 1,312 Updated Jan 18, 2023

Google Brain AutoML

Jupyter Notebook 6,451 1,463 Updated Mar 2, 2025

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,427 1,230 Updated Jul 30, 2024

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,675 1,365 Updated Jan 20, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,669 755 Updated Aug 5, 2024

Reference models and tools for Cloud TPUs.

Jupyter Notebook 5,267 1,762 Updated Feb 5, 2026

SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners

Jupyter Notebook 4,452 660 Updated May 22, 2023

Language-Agnostic SEntence Representations

Jupyter Notebook 3,658 462 Updated May 2, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,352 208 Updated May 19, 2025

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Jupyter Notebook 3,293 331 Updated Mar 3, 2024

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 3,055 326 Updated Feb 6, 2026
Jupyter Notebook 3,047 286 Updated Feb 27, 2023

Kandinsky 2 — multilingual text2image latent diffusion model

Jupyter Notebook 2,819 315 Updated May 1, 2024

Self hosted alternative to Google Photos

Jupyter Notebook 2,773 226 Updated Dec 7, 2022

A simple notebook demonstrating prompt-based music generation via Mubert API

Jupyter Notebook 2,740 233 Updated May 4, 2023

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,724 239 Updated Aug 15, 2025

DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf

Jupyter Notebook 2,560 379 Updated Jan 28, 2025

Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the came…

Jupyter Notebook 2,321 264 Updated Jul 20, 2022
Next