[gmx-users] Re: Re: Re: Question about parallazing Gromacs
Cesar Araujo
cesar.araujo at oulu.fi
Thu Sep 14 21:48:04 CEST 2006
> ------------------------------
>
> Message: 7
> Date: Thu, 14 Sep 2006 09:27:45 +0200
> From: "Qiao Baofu" <qiaobf at gmail.com>
> Subject: Re: [gmx-users] Re: Question about parallazing Gromacs (Qiao
> Baofu)
> To: "Discussion list for GROMACS users" <gmx-users at gromacs.org>
> Message-ID:
> <6a91f07b0609140027u68ac3085tb01120f19b854545 at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi,
>
> Thanks. I have test different cpus. Our institute has two clusters: one is
> each node has 4 cpu (A), one is one node has only 1 cpu (B). I made
> different tests on the two clusters and my local computer using the same
> system. See the following result:
>
> A (For 1 hour)
> # of cpus ; MD steps
> 4 finished (200000steps for 26:21)
> 8 finished (200000steps for 40:57)
> 12 87950
> 20 42749
> 44 5962 !!!!!
> B (For 1 hour)
> # of cpu ; MD steps
> 1 156991 for 56:12
> 2 179820
> 3 200,000 for 54:20
> 4 200,000 for 51:12
> c. Local(single cpu), 200000 steps For 1h52:38
>
> One can see that
> 1. On cluster A, one nodes(4 cpu) is just as 4 times fast as my local
> computer.
> 2. More than one nodes will decrease the performancs the gromacs,
> 3. On cluster B, the more cpu used, the faster gromacs runs. But the
> difference of speed is not apparent.
> 4. Cluster B with 4 cpus is slow as half as that Cluster A with 1 node (4
> cpus)
>
> I wonder if anyone can tell the bottlenack: the hardware on the cluster or
> gromacs?
>
>
Well, probably it is a combination of bottlenecks in both software and
hardware.
Both of them should be optimized to guarantee the best performance. From the
hardware side the delays in inter-node communication across the network can
be the reason. But also, it can be related with software issues related to
the operating
system and network configuration. I think that the first thing to do is work
in cluster
setup to achieve the best performance for your hardware configuration.
In addition, take into account that inter-node communication between two
CPU's
inside the same machine should be faster than between two CPU's across the
network.
>From the software side (Gromacs) the algorithms should be optimized with
regard
to the parallel programming model. David said in a previous answer that they
have detected some problems in the software implementation and they are
working
to solve that issue in the next version of Gromacs.
Regards,
-----------------------------------------------------------
Cesar Araujo, Lic. of Chemistry
Department of Molecular Endocrynology
Oulu University Hospital
FIN-90029 OYS, OULU, FINLAND
phone: +358 8 3155632
e-mail: cesar.araujo at oulu.fi
More information about the gromacs.org_gmx-users
mailing list