Stars
[ICLR 2021] Zero-shot Synthesis with Group-Supervised Learning
Related papers and codes for vision-based robotic grasping
Python books free to read online or download
An easy-to-use Python library for processing and manipulating 3D point clouds and meshes.
A curated list of awesome work on VAEs, disentanglement, representation learning, and generative models.
Google Research
Implementation of 6-DoF GraspNet with tensorflow and python. This repo has been tested with python 2.7 and tensorflow 1.12.
Release of the YCB-Affordance dataset (CVPR 2020 Oral)
PyTorch package for the discrete VAE used for DALL·E.
Awesome work on object 6 DoF pose estimation
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Code Repository for Liquid Time-Constant Networks (LTCs)
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
A selection of state-of-the-art research materials on trajectory prediction
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…
Replication of the model in Time-Contrastive Networks: Self-Supervised Learning from Video for a manipulation task in a simulation.
An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-located with NAACL 2018.
GLAC Net: GLocal Attention Cascading Network for the Visual Storytelling Challenge
Code for the ACL paper "No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling"
Implementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.
Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer (NeurIPS 2019)