[gmx-users] Gromacs GPU system question

Mirco Wahab mirco.wahab at chemie.tu-freiberg.de
Sun Jun 23 00:05:20 CEST 2013


On 22.06.2013 22:18, Mare Libero wrote:
> The vendor I contacted was pushing for one of the
> high end i7 processors with hyper-threading. But from what I can read,
> most of the MD software don't make any use of it. So, using a the
> multi-cores AMD (like your  FX-8350) can be a cheaper and more
> advantageous option.

Your vendor is, in my opinion, right. The AMD consumer multicores
(Piledriver) aren't actually eight-core cpus, but rather similar
to 4 core cpus (they are called 'modules').

For testing a user-defined potential, I once compiled performance
figures over a range of actual commodity hardware (available to me).
These are all workstations and usually overclocked somehow by the
students (but only if there's no crash at all in a year ;-)
This is all *without GPU*, only the plain and raw CPU processing
power for Gromacs is checked for (last column).

  - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

  Test case:
  Two coarse-grained implicit-solvent vesicles bumping into each other
  SD integrator
  480,000 particles
  Box (110nm)³
  User-defined potential (rc=0.8225nm)
  dt=0.020ps
   --------------------------------------------
     CPU            	Arch	Cores	ns/day
   --------------------------------------------
   - X6/1090T;3.3GHz	SSE2	6C/6T	19.130
   - FX-8350;4.5GHz	AVX_FMA	4M/8T	34.175
   - i7/2600K;4.2GHz	AVX_256	4C/8T	39.073
   - i7/3770K;4.4GHz	AVX_265	4C/8T	41.931
   - i7/3930K;4.2GHz	AVX_256 6c/12T	56.891

  - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

You can see here, for CPU performance, you can't
really choose anything different from the 6-core i7/3930K.
It costs some bucks more than the 4-core-CPUs but will run
significantly faster the time you use it.

> Most of what we do is protein-protein interactions and protein stability
> studies with explicit water/ions. One of our projects now has <100,000
> atoms in a 100 Ang water box (7,800 protein atoms + 67,000 water). It's
> difficult to be more specific on the parameters since each project is
> different, but in general we do not deviate much from a standard NPT run.

10nm box/75K atoms is not very large. I guess you'd use a time step
of 0.002 ps and a united atom model + spc or spc/e water? 100ns/day
seem possible with any GPU from GTX-660 or higher. If you buy a mighty
GPU (Titan), the question will be: can your n-core-CPU saturate such a
fast GPU monster? A good compromise would be, probably, the GTX-780
which is a slightly reduced Titan for half the price and all options
open.

my € 0.02

M.




More information about the gromacs.org_gmx-users mailing list