Affinity2Vec: Drug-Target Binding Affinity Prediction Method Developed using Representation Learning, Graph Mining, and Machine Learning

This repositery provides an implementation of Affinity2Vec tool which is described in a research paper:

Scientific Report Journal

Received: 22 June 2021
Accepted: 08 March 2022
Published: 19 March 2022

https://doi.org/10.1038/s41598-022-08787-9

This code is implemented using Python 3.8.

For any qutions please contact the first author:

Maha A. Thafar

Email: maha.thafar@kaust.edu.sa

Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center, Computer (CBRC), King Abdullah University of Science and Technology (KAUST)
Collage of Computers and Information Technology, Taif University (TU).

Getting Started

Prerequisites:

There are several required Python packages to run the code:

gensim
numpy
Scikit-learn
keras
deepchem
protVec
xgboost
pandas

These packages can be installed using pip or conda as the follwoing example

pip install -r requirements.txt

Files Description:

There are 5 folders:

1.Input folder: that includes two folder for 2 datasets include:

Davis dataset,
KIBA dataset,
where each one of these folder has all required data of drug-target binding affinity (in Adjacency matrix format), drug-drug and target-target similarities in (square matrix format), the drugs' SMILES in dictionary format with drugs' IDs, and the proteins' amino-acid sequences in dictionary format with proteins' IDs

2.Embedding folder: that has two folders coressponding for 2 datasets, each folder contains the generated seq2seq embeddings for drugs, and generated ProtVec embeddings for proteins.

3.aupr folder: to convert the data first to binary and then calculate aupr evaluation metric

4.Code_to_generate_Embeddings folder: we add seq2seq model code and ProtVec model code that are necessory to generate the embeddings

5. Predictions Figures folder: These two. figures represent the binding affinities predicted by Affinity2Vec best model vs. actual binding affinity values for Davis and KIBA datasets

6. PDBBind_Refined folder: This folder has all materials related to PDBBind Refined dataset. It also has the generated embeddings for all Compounds' SMILES and Proteins' amino-acide sequences

There are 6 files:

(two main functions, one main for each dataset, and the other functions are same for all datasets which are imported in each main function)

training_functions.py --> for several training and processing functions such as Cosine_similarity, normalization, etc.
pathScores.py --> to calculate and return all meta-path scores for 6 path structures
evaluation.py --> define all evalution metrics used in our experments.
2 main functions, one for each dataset:

Affinity2Vec_Davis.py

Affinity2Vec_KIBA.py

Jupyter noteboook for Affinity2Vec models using PDBBind Refined dataset

Installing:

To get the development environment runining, the code get 2 parameteres from the user which is the dataset name and the model version (the defual dataset is nr) run:

python Affinity2Vec_Davis.py

python Affinity2Vec_KIBA.py

More details:

about the source code that we utilized to generate the drugs' SMILES embedding, please refere to the main source code:

https://github.com/XericZephyr/seq2seq-fingerprint

about proteins' amio-acid sequences embeddings please refere to the main source:

https://github.com/ehsanasgari/Deep-Proteomics

To cite this work:

Thafar, M.A., Alshahrani, M., Albaradei, S. et al. Affinity2Vec: drug-target binding affinity prediction through representation learning, graph mining, and machine learning. Sci Rep 12, 4751 (2022). https://doi.org/10.1038/s41598-022-08787-9

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
EMBED		EMBED
Input		Input
PDBBind_Refined		PDBBind_Refined
Predictions Figures		Predictions Figures
Trained_models		Trained_models
aupr		aupr
Affinity2Vec_Davis.py		Affinity2Vec_Davis.py
Affinity2Vec_Kiba.py		Affinity2Vec_Kiba.py
Affinity2Vec_PDBBind_generate_EMBED_Full_Code.ipynb		Affinity2Vec_PDBBind_generate_EMBED_Full_Code.ipynb
README.md		README.md
evaluation.py		evaluation.py
pathScores_functions.py		pathScores_functions.py
requirements.txt		requirements.txt
training_functions.py		training_functions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Affinity2Vec: Drug-Target Binding Affinity Prediction Method Developed using Representation Learning, Graph Mining, and Machine Learning

This repositery provides an implementation of Affinity2Vec tool which is described in a research paper:

Getting Started

Prerequisites:

Files Description:

There are 5 folders:

There are 6 files:

Installing:

More details:

To cite this work:

About

Releases

Packages

Languages

MahaThafar/Affinity2Vec

Folders and files

Latest commit

History

Repository files navigation

Affinity2Vec: Drug-Target Binding Affinity Prediction Method Developed using Representation Learning, Graph Mining, and Machine Learning

This repositery provides an implementation of Affinity2Vec tool which is described in a research paper:

Getting Started

Prerequisites:

Files Description:

There are 5 folders:

There are 6 files:

Installing:

More details:

To cite this work:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages