[gmx-developers] GPU memory issue?

Justin A. Lemkul jalemkul at vt.edu
Mon Mar 12 14:20:37 CET 2012


Hello all,

We have a new GPU workstation equipped with a Tesla C2075.  I installed Gromacs 
4.5.5 and tried to run the DHFR benchmark but got a fatal error that the 
pre-simulation memory test had failed.  This was the first time anything had 
been run on the GPU, so it struck me as odd.  I saw the discussion on this list 
a few days ago regarding memory exhaustion, so I used Szilard's script to see 
what was going on.  I got the following:

[2012-03-12 08:37:36]  Memory usage: virt      10736	res       4920
[2012-03-12 08:37:46]  Memory usage: virt     148428	res      71188
[2012-03-12 08:37:56]  Memory usage: virt     148428	res      71188
...
[2012-03-12 08:52:18]  Memory usage: virt     148428	res      71188
[2012-03-12 08:52:28]  Memory usage: virt     148428	res      71188
[2012-03-12 08:52:38]  Memory usage: virt     149348	res      72256
[2012-03-12 08:52:48]  Memory usage: virt     149348	res      72256
[2012-03-12 08:52:58]  Memory usage: virt     149348	res      72256
[2012-03-12 08:53:08]  Memory usage: virt     149348	res      72256
[2012-03-12 08:53:18]  Memory usage: virt     149348	res      72256
[2012-03-12 08:53:28]  Memory usage: virt     149348	res      72256
...
(and onward to completion)

Is this increase in memory usage indicative of a problem?  If so, is there 
anything that can be done?  I imagine that the bump in the beginning is simply 
an effect of the simulation starting up, or is that not correct?

Looking at the observables in the simulation, there seems to be nothing wrong - 
no abnormal structures or RMSD spikes, energy terms are very stable, etc. 
Should I be looking at other things?  The error that comes up about failing the 
memory test seems to indicate that I should expect catastrophic problems, so I'd 
like to fully investigate any issues before we invest serious time in doing data 
collection.

I would also note that on the Gromacs website, it states that the Tesla C2075 is 
supported, but it is not present in the hard-coded list.  Thus I have to use 
"force-device=yes" in my mdrun-gpu command.  I suppose this is something to be 
fixed, if it hasn't already been.

-Justin

-- 
========================================

Justin A. Lemkul
Ph.D. Candidate
ICTAS Doctoral Scholar
MILES-IGERT Trainee
Department of Biochemistry
Virginia Tech
Blacksburg, VA
jalemkul[at]vt.edu | (540) 231-9080
http://www.bevanlab.biochem.vt.edu/Pages/Personal/justin

========================================



More information about the gromacs.org_gmx-developers mailing list