Object representation learning

This repository contains a PyTorch implementation of the MultiObject Network (MONet). MONet is a model trained to explain a scene in a fixed number of steps, which allows it to reconstruct objects separately, even when they are occluded by other objects:

Instructions

1. Set up environment

Create a conda environment with all the requirements (edit environment.yml if you want to change the name of the environment):

conda env create -f environment.yml

Activate the environment

source activate pytorch

2. Generate data

We use Sacred to log the experiments and also as a command line interface. To generate the sprites dataset, from the data folder run

python data.py generate_sprites_multi

3. Train model

With the default options, the training script trains MONet with 5 slots, using a VAE with a latent dimension of 10. Training takes around 4 hours on GPU:

python train.py

Also check out the notebooks folder for examples with pretrained models.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
data		data
jobs		jobs
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
models.py		models.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Object representation learning

Instructions

1. Set up environment

2. Generate data

3. Train model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Object representation learning

Instructions

1. Set up environment

2. Generate data

3. Train model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages