[gmx-users] Restarting a REST2 simulation
Joseph, Benjamin Philipp
benjamin.joseph at rwth-aachen.de
Thu Apr 9 11:23:47 CEST 2020
Dear members of the mailing list,
I restarted my replica exchange with solute tempering (REST2) simulation (16 replicas on 20 = 480 cores) with the following command:
srun gmx_mpi mdrun -plumed plumed.dat -s topol.tpr -multidir rep0 rep1 rep2 rep3 rep4 rep5 rep6 rep7 rep8 rep9 rep10 rep11 rep12 rep13 rep14 rep15 -replex 10000 -hrex -cpi state.cpt -append
and am running into problems as the simulation does not run many steps after the restart. I get the following error message:
GROMACS: gmx mdrun, version 2018.3
Executable: /usr/local/software/jureca/Stages/Devel-2018b/software/GROMACS/2018.3-intel-para-2018b-plumed/bin/gmx_mpi
Data prefix: /usr/local/software/jureca/Stages/Devel-2018b/software/GROMACS/2018.3-intel-para-2018b-plumed
Working dir: /p/scratch/cias-5/joseph1/new/S_1/sys1/neu/topos
Command line:
gmx_mpi mdrun -plumed plumed.dat -s topol.tpr -multidir rep0 rep1 rep2 rep3 rep4 rep5 rep6 rep7 rep8 rep9 rep10 rep11 rep12 rep13 rep14 rep15 -replex 10000 -hrex -cpi state.cpt -append
simulation part is not equal for all subsystems
subsystem 0: 4
subsystem 1: 4
subsystem 2: 4
subsystem 3: 4
subsystem 4: 4
subsystem 5: 4
subsystem 6: 4
subsystem 7: 4
subsystem 8: 4
subsystem 9: 3
subsystem 10: 3
subsystem 11: 4
subsystem 12: 3
subsystem 13: 4
subsystem 14: 4
subsystem 15: 4
-------------------------------------------------------
Program: gmx mdrun, version 2018.3
Source file: src/gromacs/mdlib/main.cpp (line 115)
MPI rank: 0 (out of 480)
-------------------------------------------------------
Program: gmx mdrun, version 2018.3
Source file: src/gromacs/mdlib/main.cpp (line 115)
MPI rank: 90 (out of 480)
Fatal error:
-------------------------------------------------------
...
-------------------------------------------------------
Program: gmx mdrun, version 2018.3
Source file: src/gromacs/mdlib/main.cpp (line 115)
MPI rank: 300 (out of 480)
Fatal error:
The 16 subsystems are not compatible
When I look at every md.log file they all stopped at the same simulation step and so I do not understand why they all are not at the same simulation part. Thanks a lot in advance for your help!
Best regards,
Benjamin
More information about the gromacs.org_gmx-users
mailing list