[gmx-users] GXM 512 allocation issue
Carlo Camilloni
carlo.camilloni at gmail.com
Mon Jul 4 14:16:07 CEST 2016
Dear GMX users,
the last week I began using a GPU cluster and I am now getting an error I have never seen before:
85 Source code file: /homea/hmu24/hmu240/Codes/gromacs/src/gromacs/utility/smalloc.c, line: 227
86
87 Fatal error:
88 Not enough memory. Failed to realloc 323880 bytes for sort->sort2, sort->sort2=67733e20
89 (called from file /homea/hmu24/hmu240/Codes/gromacs/src/gromacs/domdec/domdec.cpp, line 9215)
90 For more information and tips for troubleshooting, please check the GROMACS
I am using GMX 5.1.2, these are the details of the architecture from the log file
(i am running the simulation using 4 MPI + 6 OpenMP threads using 4 GPUs)
the errors is trigger randomly.
Is this also happening to others?
Precision: single
38 Memory model: 64 bit
39 MPI library: MPI
40 OpenMP support: enabled (GMX_OPENMP_MAX_THREADS = 32)
41 GPU support: enabled
42 OpenCL support: disabled
43 invsqrt routine: gmx_software_invsqrt(x)
44 SIMD instructions: AVX2_256
45 FFT library: fftw-3.3.4-sse2-avx
46 RDTSCP usage: enabled
47 C++11 compilation: disabled
48 TNG support: enabled
49 Tracing support: disabled
50 Built on: Tue 21 Jun 21:34:57 CEST 2016
51 Built by: hmu240 at jrl03 [CMAKE]
52 Build OS/arch: Linux 3.10.0-327.18.2.el7.x86_64 x86_64
53 Build CPU vendor: GenuineIntel
54 Build CPU brand: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz
55 Build CPU family: 6 Model: 63 Stepping: 2
56 Build CPU features: aes apic avx avx2 clfsh cmov cx8 cx16 f16c fma htt lahf_lm mmx msr nonstop_tsc pcid pclmuldq pdcm pdpe1gb popcnt pse rdrnd rdtscp sse2 sse3 sse4.1 sse4.2 ssse3 tdt x2apic
57 C compiler: /usr/bin/cc GNU 4.8.5
58 C compiler flags: -march=core-avx2 -Wextra -Wno-missing-field-initializers -Wno-sign-compare -Wpointer-arith -Wall -Wno-unused -Wunused-value -Wunused-parameter -O3 -DNDEBUG -funroll-all-loop s -fexcess-precision=fast -Wno-array-bounds
59 C++ compiler: /usr/bin/c++ GNU 4.8.5
60 C++ compiler flags: -march=core-avx2 -Wextra -Wno-missing-field-initializers -Wpointer-arith -Wall -Wno-unused-function -O3 -DNDEBUG -funroll-all-loops -fexcess-precision=fast -Wno-array-bound s
61 Boost version: 1.55.0 (internal)
62 CUDA compiler: /usr/local/software/jureca/Stages/2016a/software/CUDA/7.5.18-iccifort-2015.3.187-GCC-4.9.3-2.25/bin/nvcc nvcc: NVIDIA (R) Cuda compiler driver;Copyright (c) 2005-2015 NVIDIA Corp oration;Built on Tue_Aug_11_14:27:32_CDT_2015;Cuda compilation tools, release 7.5, V7.5.17
63 CUDA compiler flags:-gencode;arch=compute_20,code=sm_20;-gencode;arch=compute_30,code=sm_30;-gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_37,code=sm_37;-gencode;arch=compute_50,code=sm_5 0;-gencode;arch=compute_52,code=sm_52;-gencode;arch=compute_52,code=compute_52;-use_fast_math;; ;-march=core-avx2;-Wextra;-Wno-missing-field-initializers;-Wpointer-arith;-Wall;-Wno-unused-function;- O3;-DNDEBUG;-funroll-all-loops;-fexcess-precision=fast;-Wno-array-bounds;
64 CUDA driver: 7.50
65 CUDA runtime: 7.50
66
67
68 Number of logical cores detected (48) does not match the number reported by OpenMP (6).
69 Consider setting the launch configuration manually!
70
71 Running on 1 node with total 24 cores, 48 logical cores, 4 compatible GPUs
72 Hardware detected on host jrc0489 (the node of MPI rank 0):
73 CPU info:
74 Vendor: GenuineIntel
75 Brand: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz
76 Family: 6 model: 63 stepping: 2
77 CPU features: aes apic avx avx2 clfsh cmov cx8 cx16 f16c fma htt lahf_lm mmx msr nonstop_tsc pcid pclmuldq pdcm pdpe1gb popcnt pse rdrnd rdtscp sse2 sse3 sse4.1 sse4.2 ssse3 tdt x2apic
78 SIMD instructions most likely to fit this hardware: AVX2_256
79 SIMD instructions selected at GROMACS compile time: AVX2_256
80 GPU info:
81 Number of GPUs detected: 4
82 #0: NVIDIA Tesla K80, compute cap.: 3.7, ECC: yes, stat: compatible
83 #1: NVIDIA Tesla K80, compute cap.: 3.7, ECC: yes, stat: compatible
84 #2: NVIDIA Tesla K80, compute cap.: 3.7, ECC: yes, stat: compatible
85 #3: NVIDIA Tesla K80, compute cap.: 3.7, ECC: yes, stat: compatible
Best regards,
Carlo
More information about the gromacs.org_gmx-users
mailing list