[gmx-users] GROMACS 5.1.2 performance

Neha Gandhi n.gandhiau at gmail.com
Wed Dec 14 12:45:02 CET 2016


Dear list,

I think the question on gromacs performance on cpu cluster has been raised
many times in the mailing list. My apologies for reiterating the question.
I am using a system ~80000 atoms with virtual sites (hence timestep of 4
fs). The job hasn't completed yet but it seems that the jobs are running
slower than my experience running with older versions of GROMACS.

Here is the job script
#!/bin/bash -l
#PBS -N sgk
#PBS -l walltime=24:00:00
#PBS -l select=6:ncpus=16:mpiprocs=16:mem=20gb
#PBS -j oe

export OMP_NUM_THREADS=1

module purge
module load gromacs/5.1.2-foss-2016a-hybrid
mpirun -np 96 gmx_mpi mdrun -v -deffnm npt


and the output log indicates load imbalance :

Number of logical cores detected (48) does not match the number reported by
OpenMP (10).
Consider setting the launch configuration manually!

Running on 6 nodes with total 248 logical cores
  Logical cores per node:   40 - 48
Hardware detected on host cl3n073 (the node of MPI rank 0):
  CPU info:
    Vendor: GenuineIntel
    Brand:  Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz
    Family:  6  model: 63  stepping:  2
    CPU features: apic avx avx2 clfsh cmov cx8 cx16 f16c fma htt lahf_lm
mmx msr nonstop_tsc pcid pclmuldq pdcm pdpe1gb popcnt pse rdrnd rdtscp sse2
sse3 sse4.1 sse4.2 ssse3 tdt x2apic
    SIMD instructions most likely to fit this hardware: AVX_256 - AVX2_256
    SIMD instructions selected at GROMACS compile time: AVX_256


DD  step 34999  vol min/aver 0.553  load imb.: force 46.2%  pme mesh/force
0.506

           Step           Time         Lambda
          35000       70.00000        0.00000

   Energies (kJ/mol)
       G96Angle    Proper Dih.  Improper Dih.          LJ-14     Coulomb-14
    9.21820e+03    6.85434e+03    2.03852e+03   -4.52887e+01    8.24856e+04
        LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.      Potential
    1.76075e+05   -4.85764e+03   -1.52098e+06    3.15077e+03   -1.24606e+06
    Kinetic En.   Total Energy    Temperature Pres. DC (bar) Pressure (bar)
    2.19009e+05   -1.02705e+06    2.99843e+02   -8.61175e+01   -1.48459e+02
   Constr. rmsd
    2.75565e-05

DD  step 39999  vol min/aver 0.475  load imb.: force 89.0%  pme mesh/force
0.948

           Step           Time         Lambda
          40000       80.00000        0.00000

   Energies (kJ/mol)
       G96Angle    Proper Dih.  Improper Dih.          LJ-14     Coulomb-14
    9.32970e+03    6.83305e+03    2.07167e+03   -2.67360e+01    8.24534e+04
        LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.      Potential
    1.78434e+05   -4.86516e+03   -1.52332e+06    3.15600e+03   -1.24593e+06
    Kinetic En.   Total Energy    Temperature Pres. DC (bar) Pressure (bar)
    2.19647e+05   -1.02628e+06    3.00717e+02   -8.63844e+01    7.45115e+01
   Constr. rmsd
    2.80701e-05

DD  step 44999  vol min/aver 0.704  load imb.: force 106.3%  pme mesh/force
0.720

           Step           Time         Lambda
          45000       90.00000        0.00000

   Energies (kJ/mol)
       G96Angle    Proper Dih.  Improper Dih.          LJ-14     Coulomb-14
    9.30472e+03    6.53990e+03    2.08250e+03    1.40282e+01    8.27162e+04
        LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.      Potential
    1.77677e+05   -4.87223e+03   -1.52383e+06    3.05237e+03   -1.24731e+06
    Kinetic En.   Total Energy    Temperature Pres. DC (bar) Pressure (bar)
    2.19610e+05   -1.02770e+06    3.00666e+02   -8.66354e+01    4.81709e+01
   Constr. rmsd
    2.77685e-05

Writing checkpoint, step 46075 at Wed Dec 14 21:28:11 2016


DD  step 49999  vol min/aver 0.543  load imb.: force 16.6%  pme mesh/force
0.848

           Step           Time         Lambda
          50000      100.00000        0.00000

   Energies (kJ/mol)
       G96Angle    Proper Dih.  Improper Dih.          LJ-14     Coulomb-14
    9.09212e+03    6.76222e+03    1.95816e+03   -6.36247e+01    8.23450e+04
        LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.      Potential
    1.77732e+05   -4.87147e+03   -1.52371e+06    3.15838e+03   -1.24760e+06
    Kinetic En.   Total Energy    Temperature Pres. DC (bar) Pressure (bar)
    2.18646e+05   -1.02895e+06    2.99346e+02   -8.66085e+01    1.49550e+01
   Constr. rmsd
    2.95505e-05

I appreciate your feedback.

Regards,
Neha


More information about the gromacs.org_gmx-users mailing list