[gmx-users] Problems with REMD in Gromacs 4.6.3

gigo gigo at ibb.waw.pl
Fri Jul 12 04:59:44 CEST 2013


Dear GMXers,
With Gromacs 4.6.2 I was running REMD with 144 replicas. Replicas were 
separate MPI jobs of course (OpenMPI 1.6.4). Each replica I run on 4 
cores with OpenMP. There is Torque installed on the cluster build of 
12-cores nodes, so I used the following script:

#!/bin/tcsh -f
#PBS -S /bin/tcsh
#PBS -N test
#PBS -l nodes=48:ppn=12
#PBS -l walltime=300:00:00
#PBS -l mem=288Gb
#PBS -r n
cd $PBS_O_WORKDIR
mpiexec -np 144 --loadbalance mdrun_mpi -v -cpt 20 -multi 144 -ntomp 4 
-replex 2000

It was working just great with 4.6.2. It does not work with 4.6.3. The 
new version was compiled with the same options in the same environment. 
Mpiexec spreads the replicas evenly over the cluster. Each replica forks 
4 threads, but only one of them uses any cpu. Logs end at the citations. 
Some empty energy and trajectory files are created, nothing is written 
to them.
Please let me know if you have any immediate suggestion on how to make 
it work (maybe based on some differences between versions), or if I 
should fill the bug report with all the technical details.
Best Regards,

Grzegorz Wieczorek

Ps. I'm sending this message for the 3rd time - it did not appear on 
the list the last 2 times. Just in case - sorry for the spam.



More information about the gromacs.org_gmx-users mailing list