Stars
9
stars
written in Jupyter Notebook
Clear filter
🔊 Text-Prompted Generative Audio Model
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Kandinsky 2 — multilingual text2image latent diffusion model
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
(CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
This is a Korean OCR Python code using the Pororo library