[gmx-users] Issue with regression test.

Raymond Arter raymondarter at gmail.com
Wed Aug 1 12:49:26 CEST 2018


Dear All,

I'm building Gromacs 2018.2 and I've run in to a small issue that I would
like feed back on.

One of the regression tests fails with the following entered in to the log
file.

----
Mdrun cannot use the requested (or automatic) number of ranks, retrying
with 8.

Abnormal return value for ' gmx mdrun    -nb cpu   -notunepme >mdrun.out
2>&1' was 1
Retrying mdrun with better settings...

Abnormal return value for ' gmx mdrun -ntmpi 12      -notunepme >mdrun.out
2>&1' was 1
Retrying mdrun with better settings...

Abnormal return value for ' gmx mdrun -ntmpi 6      -notunepme >mdrun.out
2>&1' was -1
FAILED. Check mdrun.out, md.log file(s) in nbnxn_vsite for nbnxn_vsite
Re-running orientation-restraints using CPU-based PME
Re-running pull_geometry_angle using CPU-based PME
Re-running pull_geometry_angle-axis using CPU-based PME
Re-running pull_geometry_dihedral using CPU-based PME
----

However, on the advice of a colleague I run the following command in the
directory:

    gmx mdrun -ntmpi 4 -notunepme

And got the following result:

----
Reading file topol.tpr, VERSION 2018.2 (single precision)
Non-default thread affinity set, disabling internal thread affinity
Can not increase nstlist because verlet-buffer-tolerance is not set or used

Using 4 MPI threads
Using 3 OpenMP threads per tMPI thread

On host ****** 4 GPUs auto-selected for this run.
Mapping of GPU IDs to the 4 GPU tasks in the 4 ranks on this node:
  PP:0,PP:1,PP:2,PP:3

Back Off! I just backed up traj.trr to ./#traj.trr.1#

Back Off! I just backed up ener.edr to ./#ener.edr.1#
starting mdrun 'Protein'
20 steps,      0.1 ps.

step 20 Turning on dynamic load balancing, because the performance loss due
to load imbalance is 3.3 %.

Writing final coordinates.

Back Off! I just backed up confout.gro to ./#confout.gro.1#

 Dynamic load balancing report:
 DLB was turned on during the run due to measured imbalance.
 Average load imbalance: 13.0%.
 The balanceable part of the MD step is 25%, load imbalance is computed
from this.
 Part of the total run time spent waiting due to load imbalance: 3.3%.
 Steps where the load balancing was limited by -rdd, -rcon and/or -dds: X 0
% Y 0 %

NOTE: 6 % of the run time was spent in domain decomposition,
      17 % of the run time was spent in pair search,
      you might want to increase nstlist (this has no effect on accuracy)

               Core t (s)   Wall t (s)        (%)
       Time:        1.932        0.161     1200.0
                 (ns/day)    (hour/ns)
Performance:       56.337        0.426
----

Is this just a problem with the regression test and the build of Gromacs is
fine, or
is there a problem with the build I have done?

Thanks in advance.

R.


More information about the gromacs.org_gmx-users mailing list