[gmx-users] Umbrella Sampling Results and Issues

Agnivo Gosai agnivogromacs14 at gmail.com
Mon Nov 24 19:03:16 CET 2014


Dear Users

I am doing an umbrella sampling MD to calculate the binding energy between
a DNA and a protein. I am following Dr. Lemkuls tutorial for the same.

Firstly I put the complex in a rectangular box and used the
AMBER99SB-Parmbsc0 forcefield for the topology generation. Then I used SD
for minimization , 1 ns restrained NVT equilibration ( V-rescale , T = 300
K) , 1 ns restrained NPT equilibration ( Berendsen couplings in both , ref
Press = 1 bar , however achieved average press = 0.4 bar with very low
error , RMSD ) , 500 ps Steered Molecular Dynamics keeping DNA fixed and
pulling Protein ( Using Nose-Hoover thermostat , Parinello - Rahman
barostat ).

I pull for a total of 5 nm.

This is the relevant mdp entry for the SMD

; Pull code
pull            = umbrella
pull_geometry   = distance  ; simple distance increase
pull_dim        = N N Y     ; Z axis reaction coordinate
pull_start      = yes       ; define initial COM distance > 0
pull_ngroups    = 1         ; applying puling force to protein only
pull_group0     = DNA       ; reference fixed group for pulling
pull_group1     = Protein   ; the molecule to which pulling force is applied
pull_rate1      = 0.01      ; 0.01 nm per ps = 10 nm per ns
pull_k1         = 1000      ; kJ mol^-1 nm^-2 force constant for pulling

After this I extracted the conf.gro files and using the perl/python scripts
from the tutorial extracted 21 windows (0.2 nm spacing) from the SMD
trajectory for the umbrella sampling. In each window I did a 100 ps short
NPT equilibration restraining only the DNA and then an 1 ns umbrella
sampling simulation restraining the DNA with the following mdp entry :

; Pull code
pull            = umbrella
pull_geometry   = distance
pull_dim        = N N Y
pull_start      = yes
pull_ngroups    = 1
pull_group0     = DNA
pull_group1     = Protein
pull_init1      = 0
pull_rate1      = 0.0
pull_k1         = 1000      ; kJ mol^-1 nm^-2
pull_nstxout    = 100      ; every 0.2 ps
pull_nstfout    = 100      ; every 0.2 ps

Then I ran the g_wham tool on the results.

I am attaching the PMF profile and the Histogram for reference :

​
 PMF
<https://docs.google.com/file/d/0B-U8uULVZjfRaHE4STc2QVo3VmM/edit?usp=drive_web>
​​
 HISTOGRAM
<https://docs.google.com/file/d/0B-U8uULVZjfRZ2IxbXZSeTc1QUE/edit?usp=drive_web>
​
As observed , the PMF is having 3 steps for particular reaction coordinates
and at those regions the Histogram is also not showing ideal overlapping
behaviour.

This is the ERROR output of g_wham :

WARNING, no data point in bin 26 (z=3.04567) !
You may not get a reasonable profile. Check your histograms!

The error file is also attached :

​
 WHAM1_ERROR.txt
<https://docs.google.com/file/d/0B-U8uULVZjfRbkllV1hZLXR6MjQ/edit?usp=drive_web>
​
So, my questions are :

Should I decrease the nm spacing to say 0.1 nm around those particular
reaction coordinates where the "no data point in bin" error is coming ? ?
Should I be going for asymmetric windows

OR , should I simultaneously increase my simulation time for umbrella
sampling to say 5 ns from 1 ns ?

Kindly suggest

------------------------------------------------------------------------


Thanks & Regards
Agnivo Gosai
Grad Student, Iowa State University.


More information about the gromacs.org_gmx-users mailing list