Skip to content
View ogkalu2's full-sized avatar

Block or report ogkalu2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
9 stars written in Jupyter Notebook
Clear filter

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,951 4,683 Updated Aug 19, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,602 556 Updated Nov 10, 2025

Kandinsky 2 — multilingual text2image latent diffusion model

Jupyter Notebook 2,819 316 Updated May 1, 2024

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Jupyter Notebook 1,713 130 Updated Jan 29, 2024
Jupyter Notebook 948 71 Updated Jun 24, 2025
Jupyter Notebook 791 74 Updated Aug 7, 2024

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Jupyter Notebook 485 37 Updated Oct 30, 2023

(CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Jupyter Notebook 200 16 Updated Jul 13, 2025

This is a Korean OCR Python code using the Pororo library

Jupyter Notebook 86 32 Updated May 24, 2023