Skip to content
View joanrod's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report joanrod

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…

Python 4,394 246 Updated Nov 7, 2025

GroundCUA

Python 125 14 Updated Mar 24, 2026

Generating figures from research papers, using textual captions from the paper.

Python 43 4 Updated Jul 17, 2023

Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer…

Python 10 1 Updated Jan 30, 2023

Karras et al. (2022) diffusion models for PyTorch

Python 2,587 400 Updated Feb 12, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,081 33,053 Updated Apr 29, 2026

Fast and memory-efficient exact attention

Python 23,587 2,664 Updated Apr 29, 2026

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Python 523 39 Updated Jan 27, 2024

Home of StarCoder: fine-tuning & inference!

Python 7,514 532 Updated Feb 27, 2024
Python 7,826 527 Updated Apr 14, 2024

Ongoing research training transformer models at scale

Python 394 52 Updated Aug 20, 2024

Denoising Diffusion Probabilistic Models from scratch

Python 1 Updated Jun 2, 2023
Python 30 Updated May 27, 2023

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Python 1,941 107 Updated Jan 12, 2025

Video editing with Python

Python 14,579 2,053 Updated Mar 7, 2026

Let us control diffusion models!

Python 33,846 3,015 Updated Feb 25, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,214 1,104 Updated Nov 18, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,642 1,338 Updated Apr 28, 2026
Jupyter Notebook 328 27 Updated Sep 20, 2022

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,436 205 Updated Feb 7, 2026

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 33,358 3,994 Updated Mar 25, 2026

v objective diffusion inference code for PyTorch.

Python 719 105 Updated Nov 29, 2022

A microservice that interacts and fetches information about the most important companies of the world

Kotlin 3 1 Updated Mar 6, 2023

Course content and resources for the AIAIART course.

Jupyter Notebook 567 47 Updated Nov 3, 2022

Model API for GALACTICA

Python 1 Updated Nov 18, 2022

PyTorch package for the discrete VAE used for DALL·E.

Python 10,867 1,885 Updated Jan 31, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,482 1,223 Updated Jul 30, 2024

High-fidelity performance metrics for generative models in PyTorch

Python 1,178 87 Updated Feb 17, 2026

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from V…

Python 83 2 Updated Jan 30, 2023
Next