Stars
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
[CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Fast and differentiable MS-SSIM and SSIM for pytorch.
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
Automatically collect and visualize usage statistics in Ubuntu/OSX environments.
An All-MLP solution for Vision, from Google AI
🦋A PyTorch implementation of BigGAN with pretrained weights and conversion scripts.
PyCIL: A Python Toolbox for Class-Incremental Learning
The web framework for building LLM microservices [deprecated]
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
Structured and typehinted GPT responses in Python
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA