EquiVS

Codes and data for "Employing Molecular Conformations for Ligand-based Virtual Screening with Equivariant Graph Neural Network and Deep Multiple Instance Learning"

Reference

If you make advantage of the EquiVS model and our benchmark dataset proposed in our paper, please consider citing the following in your manuscript:

Gu, Y.; Li, J.; Kang, H.; Zhang, B.; Zheng, S. Employing Molecular Conformations for Ligand-Based Virtual Screening with Equivariant Graph Neural Network and Deep Multiple Instance Learning. Molecules 2023, 28, 5982. https://doi.org/10.3390/molecules28165982

Benchmark Dataset

Download our released dataset files from Google Drive. Then unzip the data folder to this repo. The files are as shown:

data

molecules
The molecular conformer SDF files.

target
The sub-datasets (CSV files) for specific targets, columns include 'Activity type', 'Ligand name', 'SMILES', 'Conformer_path', etc.

molecule_structure.csv
The mapping file for each molecule SMILES and its corresponding conformer path in 'data' folder.

Our bioactivity prediction dataset is curated from "A consensus compound/bioactivity dataset for data-driven drug design and chemogenomics"(https://zenodo.org/record/6398019#.YwuT93ZByUk). The data preprocessing cases can be found in data_preprocessing.ipynb.

EquiVS model

Environment Requirement

torch==1.8.0
dgl==0.5.2

Train or evaluate EquiVS model

Simply reproduce our model:

sh bash.sh

Note that the TARGET_processed folder will be created in data/task/ to save molecular graphs (dgl_graph.bin) and labels (info.pkl).
OR:

python main.py -id {DEVICE_ID} -sp {SAVE_PATH} -se {SEED} -mo {MODE: train OR eval}

For more (general, training, and model) arguments, please see main.py.

The EquiVS model will automatically train on all stored sub-dataset in data/target folder. The training results, models, and predictions will be stored in result/{SAVE_PATH} folder.

Optimal conformer discovery

Please refer the model interpretation case in model_interpret.ipynb.

Contact

We welcome you to contact us (email: yg3191@nyu.edu) for any questions and cooperations.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
bash.sh		bash.sh
conformer.py		conformer.py
data_preprocessing.ipynb		data_preprocessing.ipynb
eval.py		eval.py
layer.py		layer.py
main.py		main.py
model.py		model.py
model_interpret.ipynb		model_interpret.ipynb
model_interpret.py		model_interpret.py
model_structure.png		model_structure.png
preprocessing.py		preprocessing.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EquiVS

Reference

Benchmark Dataset

EquiVS model

Environment Requirement

Train or evaluate EquiVS model

Optimal conformer discovery

Contact

About

Releases 1

Packages

Languages

gu-yaowen/EquiVS

Folders and files

Latest commit

History

Repository files navigation

EquiVS

Reference

Benchmark Dataset

EquiVS model

Environment Requirement

Train or evaluate EquiVS model

Optimal conformer discovery

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages