Skip to content

Latest commit

 

History

History
9 lines (7 loc) · 649 Bytes

README.md

File metadata and controls

9 lines (7 loc) · 649 Bytes

Bioinformatics-Drug-Discovery

  • Collecting the data(from the ChEMBL database) and preprocessing for SARS coronavirus 3C-like proteinase.
  • Exploratory analysis(chemical space analysis in this case) via the Lipinski descriptors to evaluate the druglikeness of a compound.
  • Calculated bioactivity fingerprint descriptors from the PaDEL descriptor and stored them in a data frame for further model building.
  • Regression model building with random forest, scatter plot between experimental and predicted pIC50 values.
  • Comparison of several machine learning models using lazypredict, visualisation and deployment(currently under process).