[gmx-users] Hyper-threading Gromacs 5.0.1
Carsten Kutzner
ckutzne at gwdg.de
Thu Sep 11 08:29:44 CEST 2014
On 11 Sep 2014, at 05:03, Johnny Lu <johnny.lu128 at gmail.com> wrote:
> Does writing frequently (like position, velocity, energy, log every 10
> timesteps) significantly slow down the simulation?
Probably yes, have a look at the md.log output file
and how much percent is spent in write traj.
Carsten
>
> On Wed, Sep 10, 2014 at 10:31 PM, Johnny Lu <johnny.lu128 at gmail.com> wrote:
>
>> tried that, and the result was:
>>
>> Reading file npt.tpr, VERSION 5.0.1 (double precision)
>> Changing nstlist from 10 to 40, rlist from 1 to 1.028
>>
>> The number of OpenMP threads was set by environment variable
>> OMP_NUM_THREADS to 48
>> Using 1 MPI thread
>> Using 48 OpenMP threads
>>
>> WARNING: Oversubscribing the available 24 logical CPU cores with 48
>> threads.
>> This will cause considerable performance loss!
>>
>> -------------------------------------------------------
>> Program mdrun_d, VERSION 5.0.1
>> Source code file:
>> /export/data1/kho/software/gromacs5.1/gromacs-5.0.1/src/gromacs/mdlib/nbnxn_search.c,
>> line: 2577
>>
>> Fatal error:
>> 48 OpenMP threads were requested. Since the non-bonded force buffer
>> reduction is prohibitively slow with more than 32 threads, we do not allow
>> this. Use 32 or less OpenMP threads.
>> For more information and tips for troubleshooting, please check the GROMACS
>> website at http://www.gromacs.org/Documentation/Errors
>> -------------------------------------------------------
>>
>> so. i guess that is not a good idea.
>>
>> On Wed, Sep 10, 2014 at 9:55 PM, Johnny Lu <johnny.lu128 at gmail.com> wrote:
>>
>>> that 8-20% performance increase was for gromacs 4.6.5
>>>
>>> On Wed, Sep 10, 2014 at 9:52 PM, Johnny Lu <johnny.lu128 at gmail.com>
>>> wrote:
>>>
>>>> Is it a good idea to use 48 OpenMP thread, under 1 MPI thread on 24 Xeon
>>>> Processors?
>>>>
>>>> The mail list say such practice give about 8-20% performance increase
>>>>
>>>> Should I try g_tune_pme when I searched for "imbalance" in the log file
>>>> and found nothing (24 OMP thread under 1 MPI thread on 24 Xeon Processor)?
>>>> Or is that done automatically?
>>>>
>>>> Does gromacs support double precision calculation on GPU if the hardware
>>>> supports that?
>>>>
>>>> The optimize fft option is also obsolete.
>>>>
>>>> Thanks again.
>>>>
>>>
>>>
>>
> --
> Gromacs Users mailing list
>
> * Please search the archive at http://www.gromacs.org/Support/Mailing_Lists/GMX-Users_List before posting!
>
> * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
>
> * For (un)subscribe requests visit
> https://maillist.sys.kth.se/mailman/listinfo/gromacs.org_gmx-users or send a mail to gmx-users-request at gromacs.org.
--
Dr. Carsten Kutzner
Max Planck Institute for Biophysical Chemistry
Theoretical and Computational Biophysics
Am Fassberg 11, 37077 Goettingen, Germany
Tel. +49-551-2012313, Fax: +49-551-2012302
http://www.mpibpc.mpg.de/grubmueller/kutzner
http://www.mpibpc.mpg.de/grubmueller/sppexa
More information about the gromacs.org_gmx-users
mailing list