Docking At UTMB


What is Docking At UTMB?
Docking At UTMB is a web-resource that provides controlled access to molecular docking software running on a cluster of computers at UTMB. Approved users can login, upload a protein PDB file along with an active-site specification, and dock that protein against one of several small-molecule libraries.
How Can I Get Access to Docking At UTMB?
For temporary access to perform single protein-ligand dockings, login as user "anonymous" (password "anonymous").
To request an individual account to dock against virtual libraries, please e-mail Dr. Stan Watowich at watowich@xray.utmb.edu.
How Do You Suggest Using Docking At UTMB?
Preamble: Computer docking and virtual screening are inexact, but potentially very valuable, tools. This site is intended to provide easy access to researchers wishing to perform small numbers of docking or virtual screening experiments, but who do not have the necessary computer resources and/or computational biology backgrounds. This interface can handle most protein-ligand docking experiments, but there will always be systems where this simple interface will fail. For those cases, researchers will need to develop the necessary expertise or collaborations to perform the experiment.

Suggested Use: Before beginning any virtual screening experiment against their target protein, researchers should (must) validate that the docking software is working correctly for their system. The target protein can frequently be found in the Protein Data Bank as a cocrystal structure with a bound ligand. The ligand should be extracted from the cocrystal structure and used in a single docking experiment against the target protein. If the docked structure predicted by this test does not reproduce the cocrystal structure, there is little point performing a virtual screen with the target protein. This test can identify problems such as incorrect His charging in the active site, incorrect search space, missing cofactors, etc. Manually correct these problem and repeat the docking test.

If the above validation proves successful, researchers can request an individual account (e-mail Dr. Stan Watowich) in order to screen their target protein against either a Maybridge "soluble" drug-like library of 5,163 small molecules, the MayBridge HitFinder library of 14,400 small molecules, or a Chembridge "soluble" drug-like library of 23,311 small molecules. This virtual screening experiment will return predictions of good binding ligands. These ligands can be purchased from Maybridge or Chembridge for wet-bench validation. Success rates (i.e., binders found from experimental testing vs number of computer predtions tested) are quite variable, but can range as high as 40% for some test systems. See the question "How good is the docking software" for enrichment details for a number of test systems.

If researchers have screened the Maybridge and/or Chembridge libraries and experimentally validated the Autodock Vina predictions, they can request access to screen our virtual library of ~600,000 commercially-available drug-like molecules.

What Docking Software is Used?
Autodock Vina [1], written by O. Trott and A. Olson at the Scripps Institute is used to perform the actual docking. This open-source program has been made freely available by the authors.

AutodockTools, written by G. Morris and A. Olson at the Scripps Institute is used to convert proteins and ligands to the format required by AutoDock Vina. This open-source program has been made freely available by the authors.

[1]
AutoDock Vina: Improving the Speed and Accuracy of Docking with a New Scoring Function, Efficient Optimization and Multithreading
O. Trott and A. J. Olson
Journal of Computational Chemistry 31 (2010) 455-461
How Good is the Docking Software?
Before using Autodock Vina for this project it was tested in-house against the Directory of Useful Decoys (DUD) Database [1], a database designed specifically for rigorously testing docking algorithms. The testing protocol was taken from from an earlier paper [2] written at Wyeth Research that tested six other molecular docking programs against the DUD database. The results showed Autodock Vina to be a strong competitor against the other programs, and at the top of the pack in many cases.

[1]
Benchmarking Sets for Molecular Docking
N. Huang, B. K. Shoichet, and John Irwin
J. Med. Chem., 2006, 49(23), pp 6789-6801

[2]
Comparison of Several Molecular Docking Programs: Pose Prediction and Virtual Screening Accuracy
J. B. Cross, D. C. Thompson, B. K. Rai, J. C. Baber, K. Y. Fan, Y. Hu, and C. Humblet
J. Chem. Inf. Model., 2009, 49, 1455-1474
What Parameters Do I Input?
Job Name A single word project name/descriptor.
Protein File Hit the "Choose File" icon and upload a protein file from your computer.
Active Site Enter an {x, y, z} to designate the center of the search volume.
Typically, this coordinate is the center of the protein's active site.
Grid Size Specify, in angstroms, the size of the box around the active site to search within.
Too large a value can lead to inaccurate binding positions outside of the active site.
Database Enter the ligand database to search or upload a single ligand file from your computer.
What Will Docking At UTMB Actually Return to Me?
When used to dock a single ligand, Docking At UTMB will return:

Log A log file describing the tasks performed and noting any warnings or errors.
DockedLigand.pdb A file containing the five best (i.e. lowest docking scores) poses of the docked ligand in PDB format.
DockedLigand.log A file containing the Vina docking scores for the five best poses.
Poses with similar docking scores should be treated as equally likely.
Protein.pdbqt A file containing the target protein in PDBQT format.

When used to dock against one of the provided ligand libraries, Docking At UTMB will return:

Log A log file describing the tasks performed and noting any warnings or errors.
Report A file listing the top 500 binders, sorted such that the best (lowest) score is first.
The ligands are referenced by ZINC codes, and these can be extracted and pasted into the ZINC web site
to return chemical structures and catalog numbers.
TopResults/ A folder containing the top 500 binders, each with their top five poses in PDB format.
Additionally, a separate log file for each binder containing the Vina scores for it's five best poses.
Protein.pdbqt A file containing the target protein in PDBQT format.

What Small-Molecule Libraries are Available?
Maybridge "soluble" drug-like library (5,164 molecules):
Chembridge "soluble" drug-like library (23,311 molecules):
MayBridge HitFinder library (14,400 molecules):
Custom Drug-Like ZINC Subset (597,000 molecules):
Are Any Changes Made to My Protein Before it is Docked?
Before being utilized by Autodock Vina, any uploaded PDB protein is - by necessity - converted to the PDBQT format. This involves stripping all heteroatoms (including waters), re-protonating all atoms, and reassigning Gasteiger charges to all atoms. This conversion is done using tools (prepare_receptor4.py) provided in AutoDockTools. The PDBQT formatted protein is eventually returned as part of the results for your inspection.

If you wish to avoid any of these steps, provide your protein file already in PDBQT format. None of the above processing steps are performed on protein files already in the PDBQT format. Information on how to generate PDBQT files for proteins (including assigning hydrogens to His residues) can be found in the AutoDockTools manual and tutorial.

What Should I Know About the "Upload Single Ligand" Option?
Single ligands can be uploaded in PDB, MOL2, or PDBQT format. Ligands uploaded in PDB or MOL2 format are automatically converted into PDBQT format. This involves re-protonating all atoms, reassigning Gasteiger charges to all atoms, and algorithmically determining which bonds are rotatable. Note that some functional groups, including F, do not seem to be processed by AutoDockTools.

If you wish to avoid the automatic conversion process (e.g., if your ligand generates an error message during conversion or functional groups were not converted correctly), provide your ligand file already in PDBQT format. None of the above processing steps are performed on ligand files already in the PDBQT format. AutoDockTools (see above) can be used to interactively convert ligands to PDBQT format.

Ligand files are limited to 200 total atoms and 15 total rotatable bonds.

How Long will the Docking Take?
Docking times will vary according to the library you select. Currently the Test library takes ~15 minutes, the Maybridge 5K library takes ~12 hours, the MayBridge HitFinder library takes ~2.5 days, the Chembridge 23K library takes ~3 days, and the ZINC library takes roughly 90 days.

We are moving this portal to the supercomputer at the University of Texas (TACC), and this should improve performance by a factor of ~500.