|
| 1 | + |
| 2 | + |
| 3 | +```python |
| 4 | +# Default Libraries |
| 5 | +import os |
| 6 | +import StringIO |
| 7 | + |
| 8 | +# External Libraries |
| 9 | +import pandas as pd |
| 10 | +import numpy as np |
| 11 | +import matplotlib |
| 12 | +import matplotlib.pyplot as plt |
| 13 | +import seaborn as sns |
| 14 | +import networkx as nx |
| 15 | +from IPython.display import FileLink, FileLinks |
| 16 | + |
| 17 | +# Own Libraries |
| 18 | +import rstoolbox # Learn about this library at: http://lpdi-epfl.github.io/RosettaSilentToolbox |
| 19 | + |
| 20 | +import readme |
| 21 | + |
| 22 | +# Global Parameters |
| 23 | +plt.rcParams['svg.fonttype'] = 'none' |
| 24 | +rstoolbox.utils.format_Ipython() |
| 25 | +``` |
| 26 | + |
| 27 | + |
| 28 | + |
| 29 | + |
| 30 | +<style> |
| 31 | +table.dataframe { |
| 32 | + font-family: monospace; |
| 33 | +} |
| 34 | +</style> |
| 35 | + |
| 36 | + |
| 37 | + |
| 38 | +# Design of a RSVF site IV carrier from TOP7 |
| 39 | + |
| 40 | +> This summary does not track all attempts and variations, just the pathway towards obtaining the designs that were experimentally characterized. |
| 41 | +
|
| 42 | + |
| 43 | +```python |
| 44 | +# Direct information obtain from scoring TOP7 with Rosetta |
| 45 | +baseline = rstoolbox.io.get_sequence_and_structure(os.path.join('selection', 'baseline', '1qys.pdb')) |
| 46 | +baseline |
| 47 | +``` |
| 48 | + |
| 49 | + |
| 50 | + |
| 51 | + |
| 52 | +<div> |
| 53 | +<style scoped> |
| 54 | + .dataframe tbody tr th:only-of-type { |
| 55 | + vertical-align: middle; |
| 56 | + } |
| 57 | + |
| 58 | + .dataframe tbody tr th { |
| 59 | + vertical-align: top; |
| 60 | + } |
| 61 | + |
| 62 | + .dataframe thead th { |
| 63 | + text-align: right; |
| 64 | + } |
| 65 | +</style> |
| 66 | +<table border="1" class="dataframe"> |
| 67 | + <thead> |
| 68 | + <tr style="text-align: right;"> |
| 69 | + <th></th> |
| 70 | + <th>score</th> |
| 71 | + <th>fa_atr</th> |
| 72 | + <th>fa_rep</th> |
| 73 | + <th>fa_sol</th> |
| 74 | + <th>fa_intra_rep</th> |
| 75 | + <th>fa_intra_sol_xover4</th> |
| 76 | + <th>lk_ball_wtd</th> |
| 77 | + <th>fa_elec</th> |
| 78 | + <th>pro_close</th> |
| 79 | + <th>hbond_sr_bb</th> |
| 80 | + <th>hbond_lr_bb</th> |
| 81 | + <th>hbond_bb_sc</th> |
| 82 | + <th>hbond_sc</th> |
| 83 | + <th>dslf_fa13</th> |
| 84 | + <th>omega</th> |
| 85 | + <th>fa_dun</th> |
| 86 | + <th>p_aa_pp</th> |
| 87 | + <th>yhh_planarity</th> |
| 88 | + <th>ref</th> |
| 89 | + <th>rama_prepro</th> |
| 90 | + <th>time</th> |
| 91 | + <th>description</th> |
| 92 | + <th>sequence_A</th> |
| 93 | + <th>structure_A</th> |
| 94 | + <th>phi_A</th> |
| 95 | + <th>psi_A</th> |
| 96 | + </tr> |
| 97 | + </thead> |
| 98 | + <tbody> |
| 99 | + <tr> |
| 100 | + <th>0</th> |
| 101 | + <td>191.625</td> |
| 102 | + <td>-498.177</td> |
| 103 | + <td>194.188</td> |
| 104 | + <td>320.719</td> |
| 105 | + <td>1.444</td> |
| 106 | + <td>19.121</td> |
| 107 | + <td>-7.686</td> |
| 108 | + <td>-112.63</td> |
| 109 | + <td>0.0</td> |
| 110 | + <td>-29.587</td> |
| 111 | + <td>-31.008</td> |
| 112 | + <td>-2.424</td> |
| 113 | + <td>-4.577</td> |
| 114 | + <td>0.0</td> |
| 115 | + <td>0.935</td> |
| 116 | + <td>296.765</td> |
| 117 | + <td>-8.478</td> |
| 118 | + <td>0.0</td> |
| 119 | + <td>22.146</td> |
| 120 | + <td>30.876</td> |
| 121 | + <td>2.0</td> |
| 122 | + <td>1qys_0001</td> |
| 123 | + <td>DIQVQVNIDDNGKNFDYTYTVTTESELQKVLNELKDYIKKQGAKRVRISITARTKKEAEKFAAILIKVFAELGYNDINVTFDGDTVTVEGQL</td> |
| 124 | + <td>LEEEEEEEELLLLEEEEEEEELLLLHHHHHHHHHHHHHHHHLLLEEEEEEELLLHHHHHHHHHHHHHHHHHLLLLEEEEEEELLEEEEEEEL</td> |
| 125 | + <td>[0.0, -91.4576, -100.939, ...]</td> |
| 126 | + <td>[-165.465, 111.061, 116.221, ...]</td> |
| 127 | + </tr> |
| 128 | + </tbody> |
| 129 | +</table> |
| 130 | +</div> |
| 131 | + |
| 132 | + |
| 133 | + |
| 134 | +## Generation of fragments to guide the folding process |
| 135 | + |
| 136 | +Structure-based fragments to guide the folding were generated applying the RosettaScript [mkfragments](design/mkfragments.xml) to `1QYS`, the PDB containing **TOP7**. |
| 137 | + |
| 138 | +## Making the designs |
| 139 | + |
| 140 | +The construction of the **TOP7**-derived designs implies the grafting to RSVF's site IV epitope with the presence of its neutralizing antibody 101F [motif.pdb](design/motif.pdb). |
| 141 | + |
| 142 | +The process is split in several steps. |
| 143 | + |
| 144 | +### Folding |
| 145 | + |
| 146 | +Site IV was grafted into **TOP7** by means of the _NubInitioMover_ with the RosettaScript [ffl.xml](design/ffl.xml) into a [SLURM](https://slurm.schedmd.com/)-based cluster through the submiter script [submit_ffl.sh](design/submit_ffl.sh). |
| 147 | + |
| 148 | +A total of 800 decoys were generated without allowing for any sequence mutation to the template residues. The aim of this firts run was to explore and find a folded conformation that would: |
| 149 | + |
| 150 | +* carry and stabilize at backbone lever site IV. |
| 151 | +* fold back to a conformation close to **TOP7** keeping the proper beta pairing. |
| 152 | +* shift enought the conformation to be productive in presenting site IV to 101F (avoid clashes). |
| 153 | + |
| 154 | +From all the decoys, [87_ffl_twostrand_1qys_Lys_0001](design/87_ffl_twostrand_1qys_Lys_0001.pdb) was selected as seed for the next step of the process. The values of this decoy in the context of the design population are marked with a red line. |
| 155 | + |
| 156 | + |
| 157 | +```python |
| 158 | +step1 = rstoolbox.io.parse_rosetta_file('design/top7_ffl_folding.minisilent.gz') |
| 159 | +readme.plot_step1(step1) |
| 160 | +``` |
| 161 | + |
| 162 | + |
| 163 | + |
| 164 | + |
| 165 | + |
| 166 | +### Full Design |
| 167 | + |
| 168 | +Full design was carried out applying to [87_ffl_twostrand_1qys_Lys_0001](design/87_ffl_twostrand_1qys_Lys_0001.pdb) the RosettaScript [design_full.xml](design/design_full.xml) submited via the SLURM script [submit_design_full.sh](design/submit_design_full.sh), generating a total of 92 designs from which [sd_top7_12top7_graft_0001](design/sd_top7_12top7_graft_0001.pdb) (highlighted in red in the distributions) was selected. |
| 169 | + |
| 170 | + |
| 171 | +```python |
| 172 | +fulldes = rstoolbox.io.parse_rosetta_file('design/sd_top7_', {'sequence': 'P'}) |
| 173 | +readme.plot_full(fulldes) |
| 174 | +``` |
| 175 | + |
| 176 | + |
| 177 | + |
| 178 | + |
| 179 | + |
| 180 | +### Part Design |
| 181 | + |
| 182 | +Partial design was carried out applying to [87_ffl_twostrand_1qys_Lys_0001](design/87_ffl_twostrand_1qys_Lys_0001.pdb) the RosettaScript [design_part.xml](design/design_part.xml) submited via the SLURM script [submit_design_part.sh](design/submit_design_part.sh), generating a total of 100 designs from which [sd_top7_nterm2_45top7_graft_0002](design/sd_top7_nterm2_45top7_graft_0002.pdb) (highlighted in red in the distributions) was selected. |
| 183 | + |
| 184 | + |
| 185 | +```python |
| 186 | +partdes = rstoolbox.io.parse_rosetta_file('design/sd_top7_nterm2_', {'sequence': 'P'}) |
| 187 | +readme.plot_part(partdes) |
| 188 | +``` |
| 189 | + |
| 190 | + |
| 191 | + |
| 192 | + |
| 193 | + |
| 194 | +### Remodel and Resurface |
| 195 | + |
| 196 | +Both selected designs were submited to REMODEL to shorten the first loop of the structure and fix surface residues in order to improve the quality of the decoys, yielding the final two designs as described in the paper. |
| 197 | + |
| 198 | + |
| 199 | +```python |
| 200 | +designs = readme.load_experimental() |
| 201 | +readme.data_plot(designs) |
| 202 | +``` |
| 203 | + |
| 204 | + |
| 205 | + |
| 206 | + |
0 commit comments