[gmx-users] Gromacs on an IBM Power8
D.J.Baker at soton.ac.uk
Fri Sep 30 15:54:38 CEST 2016
Thank you for your detailed reply, and for the attached paper. I'm heading in the right direction -- I have upgraded to gcc v5.1 and installed fftw v3.3.5 with VSX support turned on. On the other hand I'm not that experienced with gromacs, and I would appreciate some advice on using the "gmx mdrun" command, please. In other words I would be keen to try out your suggestions, however I'm floundering a bit re putting sensible mdrun commands together. For the moment I have simply enabled SMT=8 on our 24 core machine and used the "obvious choice" of flags. That is...
mdrun -ntomp 8 -ntmpi 24 ....
So, for example, that gives me circa 29 ns/day using the 2 K40 cards, however as you note, the above command is sub-optimal for both the cpu and the gpu runs. If you could please give me some examples re your recommendations then that would be appreciated.
From: gromacs.org_gmx-users-bounces at maillist.sys.kth.se [mailto:gromacs.org_gmx-users-bounces at maillist.sys.kth.se] On Behalf Of Szilárd Páll
Sent: Wednesday, September 28, 2016 6:41 PM
To: Discussion list for GROMACS users <gmx-users at gromacs.org>
Cc: gromacs.org_gmx-users at maillist.sys.kth.se
Subject: Re: [gmx-users] Gromacs on an IBM Power8
- Use the recently released fftw 3.3.5, it has VSX support (contributed by us, that is by Erik Lindahl)
- Use at least gcc 5.x, it's significantly faster.
- 8 threads/core will never be optimal, 2-4 is best; currently the best way to do that is to use -pinstride
- 1 rank/core is rarely optimal (with GPUs), IIRC from when I last ran on the 2x12c IBM machines, 6-8 ranks were a good balance.
Finally, when it comes to performance, Intel and AVX2 will be hard to beat with the current Power8 chips, but with GPUs combined, we've showed such machines match/beat Haswell, e.g see http://on-demand.gputechconf.com/gtc/2015/presentation/S5504-Szilard-Pall.pdf
On Wed, Sep 28, 2016 at 4:04 PM, Baker D.J. <D.J.Baker at soton.ac.uk> wrote:
> I would appreciate advice on running gromacs on a Power8 machine, please. I recently downloaded and installed gromacs v2016 on one of our IBM Power8 machines. I understand that this version of gromacs has support for Power8 machines and it certainly performs a great deal better than earlier versions. So that is progress. On the other hand I wonder if the Power8 could do better. I used one of the gromacs benchmarks (case A) supplied by PRACE and ran the following command on the Power8..
> gmx mdrun -ntomp 8 -ntmpi 24 -s ion_channel.tpr -maxh 0.50 -resethway
> -noconfout -nsteps 10000 -g logfile -nb cpu
> In other words there are 24 cores and I turned on SMT=8. Using that configuration and the above command the performance is about the same as on a 16 core Sandybridge node...
> Performance in ns/day
> SandyBridge (16 cores) - 11.23
> Power8 (24 cores) -- 11.82
> Power8 (using 2 gpus) -- 28.34
> That's compiling with the GNU compiler v4.9.1. Should I be able to do better on the Power8 with a different/later compiler and/or different runtime settings?
> Best regards,
> Gromacs Users mailing list
> * Please search the archive at http://www.gromacs.org/Support/Mailing_Lists/GMX-Users_List before posting!
> * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
> * For (un)subscribe requests visit
> https://maillist.sys.kth.se/mailman/listinfo/gromacs.org_gmx-users or send a mail to gmx-users-request at gromacs.org.
Gromacs Users mailing list
* Please search the archive at http://www.gromacs.org/Support/Mailing_Lists/GMX-Users_List before posting!
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
* For (un)subscribe requests visit
https://maillist.sys.kth.se/mailman/listinfo/gromacs.org_gmx-users or send a mail to gmx-users-request at gromacs.org.
More information about the gromacs.org_gmx-users