Highlights
- Pro
Stars
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Minimal reproduction of DeepSeek R1-Zero
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
Python library to manipulate ESC/POS printers
Keras Generative Adversarial Networks
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
A MMU system developed by ANNEX Engineering
Examples of using Python for Twitter social data mining, using the python-twitter-tools framework.
Public repo for Augmented Multiscale Deep InfoMax representation learning
ObamaNet : Photo-realistic lip-sync from audio (Unofficial port)
An extension of the PyTorch library containing various tools for performing deep learning in hyperbolic space.
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
A VGG-based perceptual loss function for PyTorch.
Ambilight via WiFi. Prismatik plugin to support WLED
Semantic Segmentation : Multiclass fine tuning of DeepLabV3 with PyTorch
rezaakb / peft-vit
Forked from bwconrad/vit-finetuneParameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting
Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704
Memory, Attention and Composition (MAC) Network for CLEVR/GQA implemented in PyTorch
Codebase for "Decoding language spatial relations to 2D spatial arrangements" (Findings of EMNLP 2020).