Combining Universal Transformer and Flow-based models

Experiment 1: Invertible Universal Transformer

This experiment tests the capability of combining invertible neural networks (iRevNet, Reversible ResNet) and the universal transformer. The idea is to get a memory-efficient backpropagation of the UT allowing us to train it on smaller GPUs.

First version implementation
Translation from UT parameters to invertible UT parameters (taking half of hidden size)
Verify current implementation
Check parameter setting for attention (might also reduce channel size)
Adding parameter for deciding whether to share the layers or not

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Baseline		Baseline
Invertible_NN		Invertible_NN
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Combining Universal Transformer and Flow-based models

Experiment 1: Invertible Universal Transformer

About

Uh oh!

Releases

Packages

Languages

phlippe/InvertibleUT

Folders and files

Latest commit

History

Repository files navigation

Combining Universal Transformer and Flow-based models

Experiment 1: Invertible Universal Transformer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages