Unsupervised Learning from Video with Deep Neural Embeddings

Please see codes in build_data to prepare different datasets, you need to have kinetics at least to run the training. After that, please see codes in tf_model to train the model and evaluate it. Finally, check show_results.ipynb in notebook folder to see how the training progress can be checked and compared to our training trajectory.

Pretrained weights for VIE-3DResNet (updated 12/31/2020)

Weights can be downloaded at this link.

How to get responses from intermediate layers

Check function test_video_model in script tf_model/generate_resps_from_ckpt.py. The outputs will be stored in a dictionary, with keys like encode_x (x is from 1 to 10). Layer encode_1 is the output of the first pooling layer. The other layers are outputs from the following residual blocks (ResNet18 has 9 residual blocks in total). The output is in shape (batch_size, channels, temporal_dim, spatial_dim, spatial_dim).

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
build_data		build_data
misc		misc
notebook		notebook
pt_loader		pt_loader
tf_model		tf_model
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Unsupervised Learning from Video with Deep Neural Embeddings

Pretrained weights for VIE-3DResNet (updated 12/31/2020)

How to get responses from intermediate layers

About

Uh oh!

Releases

Packages

Languages

remullo/VIE

Folders and files

Latest commit

History

Repository files navigation

Unsupervised Learning from Video with Deep Neural Embeddings

Pretrained weights for VIE-3DResNet (updated 12/31/2020)

How to get responses from intermediate layers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages