Last modified by richtesn on 2023/06/15 15:51

From version 8.1
edited by richtesn
on 2021/08/02 13:32
Change comment: Uploaded new attachment "clustering_electrostratics.png", version {1}
To version 10.1
edited by richtesn
on 2021/08/02 13:36
Change comment: There is no comment for this version

Summary

Details

Page properties
Title
... ... @@ -1,1 +1,1 @@
1 -2.2 Compare electrostatic potentials surrounding a set of protein isoforms with multipipsa
1 +2.4 Compare electrostatic potentials surrounding a set of protein isoforms with multipipsa
Content
... ... @@ -9,50 +9,63 @@
9 9  
10 10  == Background ==
11 11  
12 -~* For details on computation of the electrostatic potential see :ref:`electrostatic_potential`
12 +~* For details on computation of the electrostatic potential see [[Calculate the electrostatic potential of a protein from its atomic structure>>doc:Collabs.computation-of-protein-association-rates.d.WebHome||target="_blank"]].
13 13  
14 -~* PIPSA analysis: PIPSA provides a method for quantitatively comparing the three dimensional interaction property fields of a set of structurally similar proteins. The structures of the proteins must be suitably aligned for the comparison to be reasonable. The method was first described in |doi_blomberg|. In this use case, the interaction property we consider is the electrostatic potential. In the PIPSA method, interaction potential fields are discretised on three dimensional grids, and compared at grid points lying within a skin surrounding the proteins. The skin of the protein is defined as the region beginning at a distance  :math:`\sigma` from the van-der-Waals surface of the protein's atoms with a thickness  :math:`\delta` . Commonly used parameters for this skin are :math:`\sigma =3 Angstrom and :math:`\delta = 4 Angstrom. The region to be compared can be further restricted as the intersection of this skin with a sphere or as cone (as shown in the figure below).
14 +~* PIPSA analysis: PIPSA provides a method for quantitatively comparing the three dimensional interaction property fields of a set of structurally similar proteins. The structures of the proteins must be suitably aligned for the comparison to be reasonable. The method was first described in [[ Blomberg et al (1999)>>https://doi.org/10.1002/(SICI)1097-0134(19991115)37:3%3C379::AID-PROT6%3E3.0.CO;2-K]]. In this use case, the interaction property we consider is the electrostatic potential. In the PIPSA method, interaction potential fields are discretised on three dimensional grids, and compared at grid points lying within a skin surrounding the proteins. The skin of the protein is defined as the region beginning at a distance {{formula}}\sigma{{/formula}} from the van-der-Waals surface of the protein's atoms with a thickness  {{formula}}\delta{{/formula}}. Commonly used parameters for this skin are {{formula}}\sigma = 3\ \mathrm\text{\AA}{{/formula}} and {{formula}}\delta = 4 \ \mathrm\text{\AA}{{/formula}}. The region to be compared can be further restricted as the intersection of this skin with a sphere or as cone (as shown in the figure below).
15 15  
16 +(% style="text-align:center" %)
17 +[[image:protein_surface.png]]
18 +
16 16  ~* Similarity measure: The interaction fields of two structures can be compared quantitatively by calculating pairwise similarity indices. One such measure is the
17 17  Hodgkin index, which is given by:
18 18  
19 -where *(p1,p2)(p1,p2)* is the scalar product of the fields surrounding proteins 1 and 2 in the analysis region. They can be calculated as:
22 +(% style="text-align:center" %)
23 +[[image:SI12_equation.png]]
20 20  
21 -~|
25 +where {{formula}}(p1,p2)(p1,p2){{/formula}} is the scalar product of the fields surrounding proteins 1 and 2 in the analysis region. They can be calculated as:
22 22  
23 -where  *i*, *j* and *k* are the three-dimensional spatial coordinates, and  :math:`ϕ_1(i,j,k)ϕ_2(i,j,k)`   is the potential at point  *(i,j,k)*  on the grid for the template protein. The similarity index runs from  −1 , for completely anti-correlated potentials, to  1  for identical potentials. At 0 , there is no correlation between the two potentials. The pairwise similarity can be converted to a distance measure, for use in clustering, using
27 +(% style="text-align:center" %)
28 +[[image:grafik.png||height="78" width="298"]]
24 24  
25 -Input Data
30 +where  //i//, //j// and //k// are the three-dimensional spatial coordinates, and {{formula}}ϕ_1(i,j,k)ϕ_2(i,j,k){{/formula}} is the potential at point  (//i,j,k//)  on the grid for the template protein. The similarity index runs from  −1 , for completely anti-correlated potentials, to  1  for identical potentials. At 0 , there is no correlation between the two potentials. The pairwise similarity can be converted to a distance measure, for use in clustering, using
26 26  
27 -In this use case, we use as our input structure a structure of the catalytic domain of the enzyme adenylyl cyclase 5 (AC5), modelled during the work described in |doi_tong|.
32 +(% style="text-align:center" %)
33 +[[image:D12_equation.png]]
34 +
35 +== Input Data ==
36 +
37 +In this use case, we use as our input structure a structure of the catalytic domain of the enzyme adenylyl cyclase 5 (AC5), modelled during the work described in [[Tong et al (2016)>>https://doi.org/10.1002/prot.25167]].
28 28  The structures of the AC isoforms were created via homology modelling using the same template. The region where there are significant structural differences between the isoforms is in a flexible loop region that was not defined in the template structure. There are also variations in sequence length across AC isoforms in this region.
29 29  
30 -Procedure
40 +== Procedure ==
31 31  
32 32  ~* Structure of AC5 is visualized. The catalytic domain of AC5 is a dimer consisting of two protein chains. In the full structure of AC5 these are two chains connected by a series of transmembrane helices that anchor the protein in the post-synaptic membrane.
33 33  
34 -~* Then the Pdb2Pqr method is used to generate hydrogen atoms in the protein structure. Proteins contain a number of ionisable amino acids, which can exist in different protonation states, depending on the pH of the solution they are in. PDB2PQR can predict the states of these amino acids, at a given pH (defined as 7.4, a normal physiological pH), then add all missing hydrogen atoms to the structure, and assign atomic charges and radii to all atoms. By default, multipipsa assigns charges and radii from the Amber force field.
44 +~* Then the PDB2PQR method is used to generate hydrogen atoms in the protein structure. Proteins contain a number of ionisable amino acids, which can exist in different protonation states, depending on the pH of the solution they are in. PDB2PQR can predict the states of these amino acids, at a given pH (defined as 7.4, a normal physiological pH), then add all missing hydrogen atoms to the structure, and assign atomic charges and radii to all atoms. By default, multipipsa assigns charges and radii from the Amber force field.
35 35  
36 36  ~* The  APBS method is used to solve the linearised Poisson-Boltzmann equation to obtain the electrostatic potential in the dx and UHBD file formats. It also creates a dx file describing the solvent excluded volume of AC5. This is used for visualisation later.
37 37  
38 38  ~* Finally, electrostatic similarity between AC isoforms is computed.
39 39  
40 -Results
50 +== Results ==
41 41  
42 -
43 43  Visualization of the AC5 dimer structure:
44 44  
45 -.. figure:: system_electrostratics_2.png
54 +(% style="text-align:center" %)
55 +[[image:system_electrostratics_2.png]]
46 46  
47 -~|
48 48  
49 49  The MultiPIPSA analysis creates as output an image file showing the pairwise distances between AC isoforms as a 2D heatmap. The results are also clustered using a single linkage hierarchical method. The resulting dendograms are shown along the edges of the heatmap.
50 50  
60 +(% style="text-align:center" %)
61 +[[image:clustering_electrostratics.png]]
51 51  
52 -.. figure:: clustering_electrostratics.png
63 +== Related Jupyter Notebook ==
53 53  
65 +Please note: For technical reasons the notebooks currently are not running live in the collaboratory.
54 54  
67 +{{jupyterlink}}
68 +https://drive.ebrains.eu/lib/dc76fd2c-0817-475a-afd0-c38eb48d2da1/file/JN/Compare_the_electrostatic_potentials_surrounding_a_set_of_protein_isoforms_with_multipipsa.ipynb
69 +{{/jupyterlink}}
55 55  
56 56  
57 -
58 -