[gmx-users] Segmentation fault error from mdrun

Mark Abraham Mark.Abraham at anu.edu.au
Wed Dec 7 23:08:13 CET 2011


On 8/12/2011 7:36 AM, rainy908 wrote:
> Hi,
>
> I encounter the following error when trying to execute mdrun:
>
> # Running Gromacs: read TPR and write output to /gpfs disk
>   $MPIRUN  $MDRUN -v -nice 0 -np $NSLOTS \
>   -s n12_random_50_protein_all.tpr \
>   -o n12_random_50_protein_all.trr \
>   -c n12_random_50_protein_all.gro \
>   -g n12_random_50_protein_all.log \
>   -x n12_random_50_protein_all.xtc \
>   -e n12_random_50_protein_all.edr

You can save yourself some typing with mdrun -deffnm. Also note that 
mdrun -np does nothing for GROMACS 4.x.

>
> Error:
>
> [compute-0-7:12377] Failing at address: 0x7159fd0
> [compute-0-30:07435] [ 1] mdrun [0x761971]
> [compute-0-30:07435] *** End of error message ***
> [compute-0-29:15535] [ 0] /lib64/libpthread.so.0 [0x39df60e7c0]
> [compute-0-29:15535] [ 1] mdrun [0x761d60]
> [compute-0-29:15535] *** End of error message ***
> [compute-1-29:19799] [ 0] /lib64/libpthread.so.0 [0x33aac0e7c0]
> [compute-1-29:19799] [ 1] mdrun [0x762065]
> [compute-1-29:19799] *** End of error message ***
> [compute-0-29:15537] [ 0] /lib64/libpthread.so.0 [0x39df60e7c0]
> [compute-0-29:15537] [ 1] mdrun [0x762065]
> [compute-0-29:15537] *** End of error message ***
> [compute-0-29:15536] [ 0] /lib64/libpthread.so.0 [0x39df60e7c0]
> [compute-0-29:15536] [ 1] mdrun [0x762065]
> [compute-0-29:15536] *** End of error message ***
> [compute-1-31:11981] [ 0] /lib64/libpthread.so.0 [0x374f00e7c0]
> [compute-1-31:11981] [ 1] mdrun [0x761d60]
> [compute-1-31:11981] *** End of error message ***
> [compute-1-31:11982] [ 0] /lib64/libpthread.so.0 [0x374f00e7c0]
> [compute-1-31:11982] [ 1] mdrun [0x761960]
> [compute-1-31:11982] *** End of error message ***
> [compute-0-29:15538] [ 0] /lib64/libpthread.so.0 [0x39df60e7c0]
> [compute-0-29:15538] [ 1] mdrun [0x761960]
> [compute-0-29:15538] *** End of error message ***
> [compute-0-7:12377] [ 0] /lib64/libpthread.so.0 [0x387c60e7c0]
> [compute-0-7:12377] [ 1] mdrun [0x729641]
> [compute-0-7:12377] *** End of error message ***
> [compute-1-29:19796] [ 0] /lib64/libpthread.so.0 [0x33aac0e7c0]
> [compute-1-29:19796] [ 1] mdrun [0x762065]
> [compute-1-29:19796] *** End of error message ***
> [compute-1-31.local][[50630,1],32][btl_tcp_frag.c:216:mca_btl_tcp_frag_recv] mca_btl_tcp_frag_recv: readv failed: Connection reset by peer (104)
> --------------------------------------------------------------------------
> mpirun noticed that process rank 35 with PID 32477 on node compute-1-8.local exited on signal 11 (Segmentation fault).
> --------------------------------------------------------------------------
>
> This is a parallel job that caused segmentation fault on compute-1-8, thus causing the entire job to fail.

You need to look at stderr, stdout (some of which are above) and the 
.log file to find out what GROMACS thought caused the crash. You also 
need to use an mpi-enabled mdrun.

Mark



More information about the gromacs.org_gmx-users mailing list