[gmx-users] why Blue Gene/Q is so slow?
DeChang Li
li.dc06 at gmail.com
Tue Jul 17 09:00:19 CEST 2012
Dear all,
I am running a 9000 atom system with GBSA (Gromacs 4.5.5) in a
Blue Gene/Q cluster. I got the speed 1.002 ns/day with 8 cores.
However, in my own workstation with 8 cores the same system can reach
nearly 10 ns/day (Intel(R) Xeon(R) CPU E5620 @ 2.40GHz). Can anyone
tell me what's wrong in my simulation? Any suggestion will be
appreciated.
Following is my md.mdp file:
constraints = hbonds
constraint_algorithm = LINCS
lincs_order = 4
comm_mode = Angular
comm_grps = system
integrator = sd
;annealing = single single
;annealing_npoints = 2 2
;annealing_time = 0 500 0 500
;annealing_temp = 200 300 200 300
dt = 0.002 ; ps !
nsteps = 5000000 ; total 5000 ps.
nstcomm = 10
nstcalcenergy = 10
nstxout = 10000 ; collect data every 1 ps
nstenergy = 10000
nstvout = 10000
nstlog = 1000
;nstxtcout = 50000
;xtc_grps = system
nstfout = 0
nstlist = 10
ns_type = grid
pbc = no
rlist = 1.2
coulombtype = cut-off
rcoulomb = 1.2
rvdw = 1.2
fourierspacing = 0.12
fourier_nx = 0
fourier_ny = 0
fourier_nz = 0
pme_order = 4
ewald_rtol = 1e-5
optimize_fft = yes
;energygrps = alpha1 alpha2 alpha3 beta1 beta2 beta3 gamma
;DispCorr = EnerPres
; Berendsen temperature coupling is on in two groups
Tcoupl =
tau_t = 0.5
tc-grps = system
ref_t = 300
; Pressure coupling is on
Pcoupl = no ;berendsen
tau_p = 1.0
compressibility = 4.5e-5
ref_p = 1.0
; Generate velocites is on at 300 K.
gen_vel = yes
gen_temp = 300
gen_seed = -1
implicit_solvent = GBSA
gb_algorithm = OBC
rgbradii = 1.2
sa_surface_tension = 2.25936
Here is the preformace info:
M E G A - F L O P S A C C O U N T I N G
RF=Reaction-Field FE=Free Energy SCFE=Soft-Core/Free Energy
T=Tabulated W3=SPC/TIP3p W4=TIP4p (single or pairs)
NF=No Forces
Computing: M-Number M-Flops % Flops
-----------------------------------------------------------------------------
Generalized Born Coulomb 61.482892 2951.179 0.4
GB Coulomb + LJ 2565.481100 156494.347 19.4
Outer nonbonded loop 152.268546 1522.685 0.2
1,4 nonbonded interactions 116.143224 10452.890 1.3
Born radii (HCT/OBC) 2868.222234 524884.669 64.9
Born force chain rule 2868.222234 43023.334 5.3
NS-Pairs 516.814696 10853.109 1.3
Reset In Box 4.464788 13.394 0.0
CG-CoM 4.482576 13.448 0.0
Bonds 22.174434 1308.292 0.2
Angles 80.586114 13538.467 1.7
Propers 160.742142 36809.951 4.6
Virial 4.636254 83.453 0.0
Update 44.478894 1378.846 0.2
Stop-CM 4.455894 44.559 0.0
Calc-Ekin 44.487788 1201.170 0.1
Lincs 44.951630 2697.098 0.3
Lincs-Mat 261.822552 1047.290 0.1
Constraint-V 44.951630 359.613 0.0
Constraint-Vir 2.251163 54.028 0.0
-----------------------------------------------------------------------------
Total 808731.820 100.0
-----------------------------------------------------------------------------
D O M A I N D E C O M P O S I T I O N S T A T I S T I C S
av. #atoms communicated per step for force: 2 x 660.5
av. #atoms communicated per step for LINCS: 2 x 34.3
Average load imbalance: 1.7 %
Part of the total run time spent waiting due to load imbalance: 1.4 %
R E A L C Y C L E A N D T I M E A C C O U N T I N G
Computing: Nodes Number G-Cycles Seconds %
-----------------------------------------------------------------------
Domain decomp. 8 502 59.421 37.1 0.5
DD comm. load 8 8 0.004 0.0 0.0
Comm. coord. 8 5001 16.575 10.4 0.2
Neighbor search 8 502 136.093 85.1 1.2
Force 8 5001 9744.582 6090.7 88.3
Wait + Comm. F 8 5001 90.905 56.8 0.8
Write traj. 8 2 0.954 0.6 0.0
Update 8 5001 72.936 45.6 0.7
Constraints 8 10002 171.445 107.2 1.6
Comm. energies 8 502 10.427 6.5 0.1
Rest 8 732.742 458.0 6.6
-----------------------------------------------------------------------
Total 8 11036.086 6897.9 100.0
-----------------------------------------------------------------------
Parallel run - timing based on wallclock.
NODE (s) Real (s) (%)
Time: 862.243 862.243 100.0
14:22
(Mnbf/s) (MFlops) (ns/day) (hour/ns)
Performance: 3.047 937.940 1.002 23.946
Finished mdrun on node 0 Tue Jul 17 16:06:48 2012
More information about the gromacs.org_gmx-users
mailing list