From 849ad35f413e579e656efc8b6d03bedaac21cb34 Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:04:54 -0700 Subject: [PATCH 1/9] Fix typo in README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index fa323949..0132032c 100644 --- a/README.md +++ b/README.md @@ -34,7 +34,7 @@ There are no formal releases yet, so here we only note particularly important ch ## Challenge overview -The SAMPL7 phase of challenges currently includes host-guest binding on three systems: A pair of Gibb Deeep Cavity Cavitands (GDCCs), a new "TrimerTrip" molecule from Lyle Isaacs and his group, and a series of cyclodextrin derivatives from Mike Gilson's group. Each host binds one or more guests, and each system involves a total of 9-20 binding free energy calculations. +The SAMPL7 phase of challenges currently includes host-guest binding on three systems: A pair of Gibb Deep Cavity Cavitands (GDCCs), a new "TrimerTrip" molecule from Lyle Isaacs and his group, and a series of cyclodextrin derivatives from Mike Gilson's group. Each host binds one or more guests, and each system involves a total of 9-20 binding free energy calculations. Additional details are provided below. A later phase of SAMPL7 is expected to include logD prediction (hopefully with pKa values provided) for a series of small moleculs in several solvents; data is currently being collected in partnership with GSK. From 4fb919bf4f9e21a1b622375820bc0153575407f3 Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:19:53 -0700 Subject: [PATCH 2/9] Add disclaimers/info to README.md --- README.md | 11 ++++++----- 1 file changed, 6 insertions(+), 5 deletions(-) diff --git a/README.md b/README.md index 0132032c..b1aff470 100644 --- a/README.md +++ b/README.md @@ -21,16 +21,17 @@ All three host-guest components of this challenge are now final and launched, th - Submission file formats - GSK logD challenge information as soon as available +## Disclaimers: +- As usual, we make no warranty as to correctness of protonation states, tautomers, conformations and poses provided in these directories. In some cases the most relevant such states may not be known, or multiple states perhaps should be considered. Please exercise caution and due diligence. +- We make an effort to indicate which files are original source files, and which are derived files, so that participants can refer to the original source files to help resolve any uncertainties. We encourage participants to do so. +- While we make every effort to ensure correctness of the files we provide, it is not uncommon for there to be some errors. *Please* sign up for our e-mail list, since if any critical bugs are found, we will e-mail out appropriate announcements. + ## Changes and Data Set Versions ### Release versions +- **Release 0.1** (July 22, 2019): Finalizes all three host-guest systems and provides sdf, mol2 and PDB files for all guests. Fixes several critical bugs, including **fixing several incorrect cyclodextrin-derivative host structure files**, **fixing errors in a draft TrimerTrip structure file**, **fixing the SMILES string for TrimerTrip guest `g15`**, and **finalizing TrimerTrip guest list**. ### Changes not in a release -There are no formal releases yet, so here we only note particularly important changes. -- **Critical bug**: Isaacs/TrimerTrip guest `g15` had an incorrect SMILES string as originally provided. This was corrected in the provided input files on July 18, 2019. -- **Finalized systems**: Finalized details of TrimerTrip hosts from Lyle Isaacs, July 22, 2019. Materials added this date included several modifications to which guests are being considered relative to what had been provided earlier. GDCC and cyclodextrin challenges were finalized at an earlier date. -- **Critical bug**: Isaacs/TrimerTrip host draft 3D structure files (mol2, sdf, PDB) had incorrect host due to a bond perception error in generating the files initially. -- **Critical bug**: Cyclodextrin hosts 9, 24 and 36 had incorrect starting PDB files (and thus host mol2/sdf files as well) due to errors in their construction in the Gilson lab. 2D structures of these compounds were correct, however. This has now been fixed. ## Challenge overview From e74e8d1043c0334ea8eeddee818e7ebdc75de207 Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:21:06 -0700 Subject: [PATCH 3/9] Add disclaimer to host-guest README --- host_guest/README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/host_guest/README.md b/host_guest/README.md index 23d4ea58..5b5e8c39 100644 --- a/host_guest/README.md +++ b/host_guest/README.md @@ -4,3 +4,5 @@ - `Isaacs_clip`: Files for the TrimerTrip challenge, on a clip shaped host from Lyle Isaacs which binds a series of guests - `cyclodextrin_derivative`: Files for the Gilson lab's cyclodextrin derivative challenge, on a series of *hosts* binding two different guests. - `scripts`: Scripts used in preparing some of the files present here and doing other processing. + +Note that we have typically selected reasonable protonation states and conformers of the hosts and guests, but these may be controversial, uncertain, or change upon binding, so participants are encouraged to exercise care in selecting which states are modeled. From 077ebef517e5e2a9a586a636d57de895fe1d45db Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:24:12 -0700 Subject: [PATCH 4/9] Remove tentative language from README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index b1aff470..6a108d8b 100644 --- a/README.md +++ b/README.md @@ -46,7 +46,7 @@ One host-guest series is based on the Gibb Deep Cavity Cavitands (GDCCs), famili ### Modified acyclic cucurbituril (TrimerTrip) binding of guests -The Isaacs lab is contributing data on binding of a series of guests to an acyclic cucubituril host, codenamed "TrimerTrip", as detailed in `host_guest/Isaacs_clip` tentatively. Guest selection is still being finalized to ensure good dynamic range, but it will likely include compounds which overlap with the GDCC and cyclodextrin-derivative challenges, with a total of roughly ten complexes being examined. +The Isaacs lab is contributing data on binding of a series of guests to an acyclic cucubituril host, codenamed "TrimerTrip", as detailed in `host_guest/Isaacs_clip`. Guests include compounds which overlap with the GDCC and cyclodextrin-derivative challenges, with a total of roughly 15 complexes being examined. ### The cyclodextrin derivatives challenge From 1f36ad20e2b34546e888e94cb0611618483b5103 Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:26:48 -0700 Subject: [PATCH 5/9] Add Isaacs disclaimers --- host_guest/Isaacs_clip/README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/host_guest/Isaacs_clip/README.md b/host_guest/Isaacs_clip/README.md index 471833d8..5a342114 100644 --- a/host_guest/Isaacs_clip/README.md +++ b/host_guest/Isaacs_clip/README.md @@ -11,6 +11,10 @@ Following this, the challenge is expected to launch around June 1, 2019, with ex ![](images/SAMPL7.jpg) +## Disclaimer + +Note that we have typically selected reasonable protonation states and conformers of the host and guests, but these may be controversial, uncertain, or change upon binding, so participants are encouraged to exercise care in selecting which states are modeled. In this case in particular, the host structure was [simulated for a period of time](host_files/README.md) in the absence of any guests prior to deposition, which may or may not mean that additional equilibration is needed in the presence of guests before conducting binding free energy calculations. + # What's here - Hosts: `host_files`: Folder containing PDB, MOL2, SDF and Chemdraw files for the TrimerTrip host. Contains a detailed description of input file generation for the TrimerTrip host. From dc5359cd71b19c7d121d146e710a3211ab75bb55 Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:27:29 -0700 Subject: [PATCH 6/9] Add disclaimers --- host_guest/cyclodextrin_derivatives/README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/host_guest/cyclodextrin_derivatives/README.md b/host_guest/cyclodextrin_derivatives/README.md index 7f21fb8e..e7412f33 100644 --- a/host_guest/cyclodextrin_derivatives/README.md +++ b/host_guest/cyclodextrin_derivatives/README.md @@ -33,6 +33,10 @@ Binding in all systems was assayed in 25 mM pH 6.8 sodium phosphate buffer at 27 Compounds exhibiting strange binding patterns have already been removed prior to construction of this set. [Full experimental details are available.](../host_guest_description.md) +### Disclaimers + +Note that we have typically selected reasonable protonation states and conformers of the hosts and guests, but these may be controversial, uncertain, or change upon binding, so participants are encouraged to exercise care in selecting which states are modeled. + # What's here - Hosts: `host_files`: Folder containing beta cyclodextrin plus the eight other cyclodextrin derivative hosts, provided by Katy Kellett and Michael Gilson, initially only in PDB format with all protons present. The Chemdraw files are also provided. MOL2 and SDF files of the hosts were added on 7/18/19. From 4330ba0757b6973e2f8608592e132f5107da9132 Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:29:48 -0700 Subject: [PATCH 7/9] Add discussion of original source files for Gibb case --- host_guest/GDCC_and_guests/README.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/host_guest/GDCC_and_guests/README.md b/host_guest/GDCC_and_guests/README.md index f7da368b..201e8040 100644 --- a/host_guest/GDCC_and_guests/README.md +++ b/host_guest/GDCC_and_guests/README.md @@ -34,8 +34,15 @@ Buffer conditions are expected to be 10 mM sodium phosphate, but this will be co For positively charged guests, chloride salts are expected to be used. +### Disclaimers + +The protonation state of the host may in some cases be not completely certain, and participants are also encouraged to carefully select guest protonation states. Protonation states and conformations may or may not match those of the files provided here. + # What's here - Hosts: `host_files`: Folder containing PDB, MOL2 and SDF files for the Octa Acid (OA) host and the exo-OA host. Contains the the jupyter notebook used to generate the MOL2 and SDF files of the guests. - Guests: `guest_files`: Folder containing files for the eight guests, SMILES strings, and the jupyter notebook used to generate PDB, MOL2 and SDF files of the guests. - Images: `images`: Folder containing images of the guest and host structures for this challenge in PDF format. + +**Source files**: +For this portion of the challenge, the original source files were two (provided) host PDB files from Paolo Suating (in the `host_files` directory) and a `SAMPL7_guests.cdx` file from Bruce Gibb (in the `guest_files` directory). All other files here were derived from those files. From 656b05de022c992c4a362eba507545318b0156a0 Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:32:17 -0700 Subject: [PATCH 8/9] Add info on surce files for TrimerTrip --- host_guest/Isaacs_clip/README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/host_guest/Isaacs_clip/README.md b/host_guest/Isaacs_clip/README.md index 5a342114..1327f4f4 100644 --- a/host_guest/Isaacs_clip/README.md +++ b/host_guest/Isaacs_clip/README.md @@ -20,3 +20,8 @@ Note that we have typically selected reasonable protonation states and conformer - Hosts: `host_files`: Folder containing PDB, MOL2, SDF and Chemdraw files for the TrimerTrip host. Contains a detailed description of input file generation for the TrimerTrip host. - Guests: `guest_files`: Folder containing files for the 15 guests, SMILES strings, and the jupyter notebook used to generate PDB, MOL2 and SDF files of the guests. - Images: `images`: Folder containing an image of the guest and host structures for this challenge in JPG file format. + +**Source files** +In this case the original source files provided were: +- `guest_files/SAMPL7.cdx`, from Lyle Isaacs, providing a 2D structure of the host and all of the guests +- A (not deposited) `.cif` crystallographic file of the host, from which we prepared the host structure and checked it against the 2D structure. From f6c746d681b6998d36dc9070919bcba1e21bdfaf Mon Sep 17 00:00:00 2001 From: David Mobley Date: Mon, 22 Jul 2019 14:36:11 -0700 Subject: [PATCH 9/9] Add description of provided files for cyclodextrin case --- host_guest/cyclodextrin_derivatives/README.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/host_guest/cyclodextrin_derivatives/README.md b/host_guest/cyclodextrin_derivatives/README.md index e7412f33..6c25262c 100644 --- a/host_guest/cyclodextrin_derivatives/README.md +++ b/host_guest/cyclodextrin_derivatives/README.md @@ -42,3 +42,11 @@ Note that we have typically selected reasonable protonation states and conformer - Hosts: `host_files`: Folder containing beta cyclodextrin plus the eight other cyclodextrin derivative hosts, provided by Katy Kellett and Michael Gilson, initially only in PDB format with all protons present. The Chemdraw files are also provided. MOL2 and SDF files of the hosts were added on 7/18/19. - Guests: `guest_files`: Contains MOL2, PDB and SDF files and files containing the isomeric SMILES strings and codenames of the guests. - Images: `images`: Folder containing images of the guest and host structures for this challenge in JPG and PDF format. + +**Source files**: +- PDB files of all hosts from the Gilson lab, which served to generate host sdf and mol2 files +- SMILES strings for the guests, which were used to generate PDB and SDF files for the guests, as well as names + +**Auxiliary files**: +- ChemDraw files of all hosts from the Gilson lab +- mol2 files for the guests from the Gilson lab