Published November 29, 2021 | Version v1
Journal article Open

Protein domain-based prediction of drug/compound–target interactions and experimental validation on LIM kinases

  • 1. Hacettepe University
  • 2. Middle East Technical University
  • 3. University College Dublin
  • 4. University of Durham
  • 5. European Bioinformatics Institute
  • 6. University of Chicago

Description

Predictive approaches such as virtual screening have been used in drug discovery with the objective of reducing developmental time and costs. Current machine learning and network-based approaches have issues related to generalization, usability, or model interpretability, especially due to the complexity of target proteins' structure/function, and bias in system training datasets. Here, we propose a new method "DRUIDom" (DRUg Interacting Domain prediction) to identify bio-interactions between drug candidate compounds and targets by utilizing the domain modularity of proteins, to overcome problems associated with current approaches. DRUIDom is composed of two methodological steps. First, ligands/compounds are statistically mapped to structural domains of their target proteins, with the aim of identifying their interactions. As such, other proteins containing the same mapped domain or domain pair become new candidate targets for the corresponding compounds. Next, a million-scale dataset of small molecule compounds, including those mapped to domains in the previous step, are clustered based on their molecular similarities, and their domain associations are propagated to other compounds within the same clusters. Experimentally verified bioactivity data points, obtained from public databases, are meticulously filtered to construct datasets of active/interacting and inactive/non-interacting drug/compound–target pairs (~2.9M data points), and used as training data for calculating parameters of compound–domain mappings, which led to 27,032 high-confidence associations between 250 domains and 8,165 compounds, and a finalized output of ~5 million new compound–protein interactions. DRUIDom is experimentally validated by syntheses and bioactivity analyses of compounds predicted to target LIM-kinase proteins, which play critical roles in the regulation of cell motility, cell cycle progression, and differentiation through actin filament dynamics. We showed that LIMK-inhibitor-2 and its derivatives significantly block the cancer cell migration through inhibition of LIMK phosphorylation and the downstream protein cofilin. One of the derivative compounds (LIMKi-2d) was identified as a promising candidate due to its action on resistant Mahlavu liver cancer cells. The results demonstrated that DRUIDom can be exploited to identify drug candidate compounds for intended targets and to predict new target proteins based on the defined compound–domain relationships. Datasets, results, and the source code of DRUIDom are fully-available at: https://github.com/cansyl/DRUIDom.

Data availability

Datasets, results, and the source code of DRUIDom are fully-available at: https://github.com/cansyl/DRUIDom.

Files

journal.pcbi.1009171.pdf

Files (3.7 MB)

Name Size Download all
Supporting information
md5:bac7373ed4d39727bace9b25d7632c75
511.2 kB Preview Download
Article
md5:75c1cea6fb19b1fcbd3cc29a7a92904b
3.2 MB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pcbi.1009171
Other
oai:uchicago.tind.io:14822

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Medicine