Supervised Grammar Variational Autoencoder

This repository contains the code used in the paper: https://doi.org/10.1021/acs.jcim.1c01573
This code is inspired in the paper Grammar Variational Autoencoder, whose code can be found at https://github.com/mkusner/grammarVAE.

A more comprehensible version of the model implemented in pytorch is been developed here: https://github.com/Monge88/pytorch-sgvae

Creating the dataset

To create the molecular dataset, use:

python make_qm9_dataset_grammar.py

Training the model

When training the model you can specify the number of epochs, batch size, latent space dimension and the property to be used. Call:

python model.py --epochs=100 --batch=256 --latent_dim=56 --property=energy_of_LUMO

The name of the property should match the columns names in the QM9_STAR.pkl data file. To check the column names in the data file, use:

import pickle
with open("data/QM9_STAR.pkl", "rb") as data:
    df = pickle.load(data) 
    print(df.columns)

Testing the model

To plot the latent space, train and test the property prediction model, call:

python encode_decode_qm9.py --epochs=100 --batch=256 --latent_dim=56 --property=energy_of_LUMO

Notice that you have to specify the same arguments used to train the model, as they are used in the weights' file name.
Finally, to test the models' prior validity and reconstruction accuracy, call:

python prior_validity.py --epochs=100 --batch=256 --latent_dim=56 --property=energy_of_LUMO
python reconstruction.py --epochs=100 --batch=256 --latent_dim=56 --property=energy_of_LUMO

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
__pycache__		__pycache__
data		data
.gitattributes		.gitattributes
README.md		README.md
encode_decode_qm9.py		encode_decode_qm9.py
make_qm9_dataset_grammar.py		make_qm9_dataset_grammar.py
model.py		model.py
molecule_vae.py		molecule_vae.py
prior_validity.py		prior_validity.py
prop_prediction_model.py		prop_prediction_model.py
qm9_grammar.py		qm9_grammar.py
reconstruction.py		reconstruction.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supervised Grammar Variational Autoencoder

Creating the dataset

Training the model

Testing the model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Supervised Grammar Variational Autoencoder

Creating the dataset

Training the model

Testing the model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages