[gmx-developers] Re: GROMACS 4.6 pre-release: new algorithms, parallelization schemes, & GPU acceleration

Mirco Wahab mirco.wahab at chemie.tu-freiberg.de
Mon Mar 5 23:19:05 CET 2012


Am 05.03.2012 22:42, schrieb Berk Hess:
> All you need to add to make the gpu work, if you compiled with gpu
> support is:
> to your mdp file:
> cutoff-scheme = verlet

OK, lets see. I removed (from mdp) completely
     vdw_type
     coulombtype

and inserted:
     cutoff-scheme = verlet

then, generated a new tpr,
     ...
     "Set rlist to 1.37 nm, buffer size 0.17 nm"
     ...
and

     c:\> mdrun -nt 1 -nb gpu -v

and *VOILA*, I can't believe this, it *does* work
- Win7/x64
- Cuda4.1
- VS2010/SP1
- fftw3f
- gsl/gslcblas
- libxml2

A first test set of ~ 1 million particles (most MARTINI W,
~10,000 lipids) will run on a i7/920/(@3GHz) at about 11.5 GFlops
with "*-nt 8*" (15.9 ns/day). On a stock GT-550 in the same
machine (*-nt 1 -nb gpu*), it reaches  14.4 GFlops (23.5 ns/day).

If I got some more time, I'll test the build against CUDA 4.0,
according to Szilárds hint (possible performance drop in 4.1).

I'm really impressed.

Thanks & regards

M.



More information about the gromacs.org_gmx-developers mailing list