This repository contains code and data for the ASAP Discovery x OpenADMET Antiviral Ligand Poses Challenge submission, focusing on computational docking of potential antiviral compounds against MERS-CoV and SARS-CoV-2 Mpro targets.
- Clone this repository:
git clone https://github.com/fulopjoz/polaris-antiviral-submission.git
cd polaris-antiviral-submission
- Make the setup script executable and run it:
chmod +x setup.sh
./setup.sh
- Run the docking pipeline:
conda activate ezdock
python easydock_pipeline_sub.py
The pipeline will create all necessary directories and output files automatically.
Important Notes:
- If you want to rerun the docking process, you must first delete the existing
.db
files in theoutput_cmd
directory, otherwise EasyDock will use the cached results - With the current configuration (exhaustiveness=320), the docking process takes approximately 3 hours using 30 CPU cores
I developed a docking and pose prediction pipeline using EasyDock with a substructure-based alignment approach. The pipeline performs the following steps:
- Docking: Uses EasyDock with Vina to generate initial poses for both MERS-CoV Mpro and SARS-CoV-2 Mpro targets
- Pose Extraction: Extracts 3D poses from the resulting SQLite databases
- Substructure Alignment: Computes RMSD based on isoquinoline substructure matching against reference ligands
- Filtering and Reranking: Filters and reranks poses based on RMSD values with a tolerance of 0.5Å
- Final Pose Selection: Creates the final pose predictions by matching SMILES from the test set
- Substructure Matching: Uses isoquinoline substructure pattern for alignment
- RMSD Calculation: Implements a quick RMSD calculation based on matched substructure atoms
- Pose Filtering: Selects poses with RMSD < 2.0Å when possible
- Fallback Mechanism: Includes fallback options when no poses meet the RMSD threshold
- RDKit 2024.09.6
- EasyDock 0.4.dev0
- NumPy 1.26.4
- Pandas 2.2.2
The main pipeline script is easydock_pipeline_sub.py
, which can be run to reproduce the entire workflow.
data/
: Contains input SMILES and configuration filesposes/
: Contains output pose files at various stages of processingoutput_cmd/
: Contains docking results databasesligand_poses_reference_structures/
: Contains reference ligand structures for alignment
This work utilizes the following tools and methods:
- Repository: https://github.com/ci-lab-cz/easydock
- License: BSD 3-clause
- Citation: Minibaeva, G., Ivanova, A. & Polishchuk, P. EasyDock: customizable and scalable docking tool. J Cheminform 15, 102 (2023). https://doi.org/10.1186/s13321-023-00772-2
- Citation: Trott O, Olson AJ. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem. 2010 Jan 30;31(2):455-61. doi: 10.1002/jcc.21334. PMID: 19499576; PMCID: PMC3041641.
Jozef Fulop
This project is licensed under the MIT License - see the LICENSE file for details.
This project uses EasyDock which is licensed under the BSD 3-clause license. See the EasyDock repository for details.