[gmx-users] GPU job often stopped

Albert mailmd2011 at gmail.com
Mon Apr 29 15:32:51 CEST 2013


On 04/29/2013 03:31 PM, Szilárd Páll wrote:
> The segv indicates that mdrun crashed and not that the machine was
> restarted. The GPU detection output (both on stderr and log) should
> show whether ECC is "on" (and so does the nvidia-smi tool).
>
> Cheers,
> --
> Szilárd

yes it was on:


Reading file heavy.tpr, VERSION 4.6.1 (single precision)
Using 4 MPI threads
Using 8 OpenMP threads per tMPI thread

5 GPUs detected:
   #0: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
   #1: NVIDIA GeForce GTX 650, compute cap.: 3.0, ECC:  no, stat: compatible
   #2: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
   #3: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
   #4: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible

4 GPUs user-selected for this run: #0, #2, #3, #4




More information about the gromacs.org_gmx-users mailing list