Interpretable Machine Learning Unveils Carbonic Anhydrase Inhibition via Conformal and Counterfactual Prediction

All the codes to reproduce the paper.

Citation

For now, please cite the preprint version.

Contact

Milad Rayka, milad.rayka@yahoo.com
Masoumeh Shams, masoumehshams.gh@gmail.com

Install

1- Clone hca_ml Github repository.

git clone https://github.com/miladrayka/hca_ml.git

2- Change directory to hca_ml and make a new environment from the cheminf_env.yaml file by Mamba package manager:

mamba env create -f cheminf_env.yaml

Usage

To reproduce all results, tables, and figures, uncompress the Data.tar.xz and Results.tar.xz folders and refer to workflow.ipynb.

CAInsight GUI

CAInsight is an interpretable and uncertainty-aware machine learning software designed to predict the activity of human carbonic anhydrase (hCA) isoforms. Specifically, we focus on predicting the activity of three isoforms: hCA II, hCA IX, and hCA XII.

The primary model relies on a Support Vector Machine (SVM) in conjunction with an Extended Connectivity Fingerprint (ECFP). Each hCA isoform has its own SVM-ECFP binary classifier that returns labels indicating whether they are active or inactive. We enhance our models with conformal prediction (CP), which quantifies the uncertainty in our predictions. In this context, CP can return an active label, an inactive label, a combination of both labels, or an empty set, depending on a specified epsilon value. Lastly, we employ counterfactual explainability (see exmol) to enhance the interpretability of our model.

To run CAInsight, change directory to hca_ml, then type the following in the terminal:

streamlit run gui.py

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
__pycache__		__pycache__
Data.tar.xz		Data.tar.xz
LICENSE		LICENSE
Logo.png		Logo.png
README.md		README.md
Results.tar.xz		Results.tar.xz
backend.py		backend.py
cheminf_env.yml		cheminf_env.yml
conformal_prediction.py		conformal_prediction.py
data_split.py		data_split.py
feature_generation.py		feature_generation.py
gin.py		gin.py
gui.py		gui.py
hp_optimization.py		hp_optimization.py
mcnemar_test.py		mcnemar_test.py
neural_network.py		neural_network.py
pipeline.py		pipeline.py
plots.py		plots.py
rdkit_descriptors.txt		rdkit_descriptors.txt
retrieve_data.py		retrieve_data.py
title_logo.png		title_logo.png
train_test.py		train_test.py
utils.py		utils.py
workflow.ipynb		workflow.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Interpretable Machine Learning Unveils Carbonic Anhydrase Inhibition via Conformal and Counterfactual Prediction

Citation

Contact

Install

Usage

CAInsight GUI

Copy Right

About

Uh oh!

Languages

License

miladrayka/hca_ml

Folders and files

Latest commit

History

Repository files navigation

Interpretable Machine Learning Unveils Carbonic Anhydrase Inhibition via Conformal and Counterfactual Prediction

Citation

Contact

Install

Usage

CAInsight GUI

Copy Right

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages