- Hong Kong
Stars
A latent text-to-image diffusion model
[NeurIPS 2024 Best Paper Award][GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". Aā¦
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focuseā¦
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Official implementation of "Designing an Encoder for StyleGAN Image Manipulation" (SIGGRAPH 2021) https://arxiv.org/abs/2102.02766
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
A tiny 1000 line LLVM-based numeric specializer for scientific Python code.
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
Non-Official Pytorch implementation of "Face Identity Disentanglement via Latent Space Mapping" https://arxiv.org/abs/2005.07728 Using StyleGAN2 instead of StyleGAN
Official code for "Seeing Faces in Things: A Model and Dataset for Pareidolia" ECCV 2024