[gmx-developers] mknb doesn't work on BlueGene with --enable-ppc-sqrt in single precision
Mark Abraham
Mark.Abraham at anu.edu.au
Thu Sep 3 02:08:55 CEST 2009
Roland Schulz wrote:
> Hi,
>
> on an email thread started by me on March 11 subject "Bluegene Kernel" (see
> below) Mathias said that the generic kernel should not be slower than the
> double hummer so this problem shouldn't matter.
Actually he said it the other way around, which will certainly be true :-)
> (Not saying that we shouldn't fix it just that is might not be important)
Sure - as it stands, configuration fails in the given case, and this
should be fixed correctly so that correct non-optimized kernels are
available in all cases.
Mark
> Roland
>
>
> ---------- Forwarded message ----------
> From: Erik Lindahl <lindahl at cbr.su.se>
> Date: Fri, Mar 13, 2009 at 8:58 AM
> Subject: Re: Bluegene Kernel
> To: Mathias PUETZ <mpuetz at de.ibm.com>
> Cc: Roland Schulz <roland at utk.edu>, "mark.abraham" <mark.abraham at anu.edu.au>,
> roland at rschulz.eu
>
>
> Hi,
>
> With the new kernel structure in the head branch it will be quite fine to
> have separate kernel sets for
>
> power4
> power5
> power6
> power7
> ...
> BG
>
> As long as you
>
> 1) Tell me exactly what options should we use for each architecture
> 2) Help test that it works :-)
>
>
> One question - since we now try to avoid generating kernels at build time
> (lots of reasons, coding simplicity being one of them), can you say in
> general whether xlf or xlc is faster?
>
> Cheers,
>
> Erik
>
>
> On Mar 13, 2009, at 1:28 PM, Mathias PUETZ wrote:
>
> Hi Roland,
>> each of the BlueGene "double hummer" kernels shouldn't be slower than the
>> generic ones,
>> so I think it's a more academic issue for the pair-force kernels. I had
>> checked the performance
>> of each kernel. I think the worst case was, that one or two double hummer
>> kernels ran 5% slower
>> than the generated ones.
>> However, enable-ppc-sqrt=1 also affects all other SQRT calculations in the
>> rest of the code,
>> that has not been specifically optimized, so you might get a minor
>> performance hit there,
>> if you can no longer specify --enable-ppc-sqrt=1.
>>
>> I don't quite understand, why PPC5 would be required regarding the SQRT
>> calculations.
>> FRSQRTE instructions is available since PPC4, but only Power6 (or higher)
>> and BlueGene offer enough
>> bits of precision to use just a single Newton-Raphson iteration to bring it
>> up to single precision.
>>
>> Viele Grüsse / Best regards,
>> Dr. Mathias Pütz
>>
>> IT Specialist for Application Performance
>>
>> Deep Computing - Strategic Growth Business
>> IBM Systems & Technology Group
>>
>> e-mail: mpuetz at de.ibm.com
>> mobile: + 49-(0)160-7120602
>> fax: + 49-(0)6131-84-6660
>>
>> Anschrift:
>> IBM Deutschland GmbH
>> Department B513
>> Hechtsheimer Str. 2 / Building 12
>> 55131 Mainz
>> Germany
>>
>> IBM Deutschland GmbH
>> Vorsitzender des Aufsichtsrats: Erich Clementi
>> Geschäftsführung: Martin Jetter (Vorsitzender), Reinhard Reschke, Christoph
>> Grandpierre, Matthias Hartmann, Thomas Fell, Michael Diemer
>> Sitz der Gesellschaft: Stuttgart
>> Registergericht: Amtsgericht Stuttgart, HRB 14562 WEEE-Reg.-Nr. DE 99369940
>>
>>
>>
>>
>> Roland Schulz
>> <roland at utk.edu>
>> Sent by: To
>> roland at rschulz.eu "mark.abraham"
>> <mark.abraham at anu.edu.au>, Mathias
>> PUETZ/Germany/IBM at IBMDE
>> 03/11/2009 10:03 cc
>> PM
>> Subject
>> Bluegene Kernel
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> Hi Mark, Hi Matthias,
>>
>> I'm writing you because you have both edited the "Gromacs Bluegene"
>> wiki page and both recommend to use
>> --enable-ppc-sqrt[=1] --enable-fortran --enable-bluegene
>>
>> I'm asking because the ppc-sqrt does not work on Bluegene anymore,
>> because Erik changed the single/double conversion so that it requires
>> PPC5. Should we add another option to the mknb to be PPC4 backward
>> compatible? Or is this not necssarry because Matthias' instrinsic
>> kernel is always faster?
>>
>> So my question:
>> Did you compare the performance of the intrinsic kernel with the
>> fortran and C kernel on Bluegene? Do you think the C/Fortran kernel
>> are thus needed on Bluegene? Or should we just change the compile
>> recommendation to not use -enable-ppc-sqrt so that it compiles again?
>> If the C/Fortran Kernel are not used anyhow this would not make a
>> difference, I think.
>>
>> Thanks
>> Roland
>>
>>
>> --
>> ORNL/UT Center for Molecular Biophysics cmb.ornl.gov
>> 865-241-1537, ORNL PO BOX 2008 MS6309
>>
>>
>>
> ------------
> Erik Lindahl <lindahl at cbr.su.se> Backup: <erik.lindahl at gmail.com>
> Associate Professor, Computational Structural Biology
> Center for Biomembrane Research, Dept. Biochemistry & Biophysics
> Stockholm University, SE-106 91 Stockholm, Sweden
> Tel: +46(0)8164675 Mobile: +46(0)703844534 Fax: mail a PDF instead
>
>
>
>
>
>
>
>
>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> gmx-developers mailing list
> gmx-developers at gromacs.org
> http://lists.gromacs.org/mailman/listinfo/gmx-developers
> Please don't post (un)subscribe requests to the list. Use the
> www interface or send it to gmx-developers-request at gromacs.org.
More information about the gromacs.org_gmx-developers
mailing list