Stars
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…
Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
Discovering Interpretable GAN Controls [NeurIPS 2020]
Official implementation of "Designing an Encoder for StyleGAN Image Manipulation" (SIGGRAPH 2021) https://arxiv.org/abs/2102.02766
Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744
Projecting images to latent space with StyleGAN2.
Blend Between Multiple Images in JupyterLab.
This repository contains State of the Art Tokenizer, Language model and Classifier for Urdu, which is one of the Official Languages of India and spoken in various states of India.
Morphosis is a web application designed for performing edits to faces, utilizing StyleGAN2 and Pixel2Style2Pixel
Wav2Vec2 speech model transfer learned on the Urdu language
Can the V-JEPA2 model be used as a world model?