[gmx-developers] parallel make problems
Mark Abraham
mark.j.abraham at gmail.com
Wed Jun 19 14:19:54 CEST 2013
On Wed, Jun 19, 2013 at 12:20 AM, Szilárd Páll <szilard.pall at cbr.su.se>wrote:
> On Tue, Jun 18, 2013 at 7:15 PM, Mark Abraham <mark.j.abraham at gmail.com>
> wrote:
> >
> >
> >
> > On Tue, Jun 18, 2013 at 6:47 PM, Roland Schulz <roland at utk.edu> wrote:
> >>
> >> On Tue, Jun 18, 2013 at 11:05 AM, Mark Abraham <
> mark.j.abraham at gmail.com>
> >> wrote:
> >> >
> >> >
> >> >
> >> > On Mon, Jun 17, 2013 at 7:59 PM, Roland Schulz <roland at utk.edu>
> wrote:
> >> >>
> >> >> On Mon, Jun 17, 2013 at 1:10 PM, Mark Abraham
> >> >> <mark.j.abraham at gmail.com>
> >> >> wrote:
> >> >> >
> >> >> >
> >> >> >
> >> >> > On Mon, Jun 17, 2013 at 6:16 PM, Manuel Nuno Melo <m.n.melo at rug.nl
> >
> >> >> > wrote:
> >> >> >>
> >> >> >> Hi,
> >> >> >>
> >> >> >> I have also had linking problems when making in parallel. In my
> case
> >> >> >> they
> >> >> >> could be traced back to the option to let GMX download/build its
> own
> >> >> >> fftw
> >> >> >> (-DGMX_BUILD_OWN_FFTW=ON).
> >> >> >>
> >> >> >> It seems that only one of make's threads starts building fftw,
> while
> >> >> >> the
> >> >> >> others go ahead building/linking GMX. Since fftw compilation is
> not
> >> >> >> ready by
> >> >> >> the time it is needed, GMX linking is botched.
> >> >> >
> >> >> >
> >> >> > Yes, Rossen first showed this to me. I don't know if the underlying
> >> >> > issue is
> >> >> > that the dependency cannot be described properly, or that we're not
> >> >> > doing it
> >> >> > properly. If it's a problem, people are welcome to contribute a
> fix!
> >> >> > :-)
> >> >> It was working in https://gerrit.gromacs.org/#/c/1675/12. You then
> >> >> changed how the dependency works in patch set 13. You never replied
> to
> >> >> Christophs comment why this was changed (at least I can't find a
> >> >> reply). Do you remember?
> >> >
> >> >
> >> > I couldn't remember, but gerrit can - I never published a series of
> >> > responses I made back then, sorry. Now published at
> >> > https://gerrit.gromacs.org/1675
> >> >
> >> >> Otherwise I can change it back as 12 did it
> >> >> and it should work again.
> >> >
> >> >
> >> > It might do, but as I said in those secret drafts the form of patch 12
> >> > doesn't work on cmake 2.8.7 because of a bug there in
> >> > add_library(...GLOBAL)
> >> > (and I suspect is probably too global, anyway, but this probably does
> no
> >> > harm?).
> >> >
> >> > So I'm still not sure there's a convenient solution that works in all
> >> > cases.
> >> > Compromising the smooth running of a parallel make for someone
> >> > downloading
> >> > FFTW seems like the most low-impact problem of the set we could choose
> >> > to
> >> > have.
> >>
> >> Probably true. Just doesn't give a good first impression of us to new
> >> users.
> >> I think we should also consider for the future whether we really want
> >> to support ~11 unmaintained version of cmake (including for all our
> >> optional features). Downloading cmake is no big deal. They have
> >> binaries to download. And cmake doesn't fix any version but for the
> >> most recent version. So it seems odd that we try to maintain
> >> workarounds for the last ~11 versions which are all unmaintained by
> >> the cmake developers. That seems like it is going to stay a really
> >> annoying maintenance task.
> >
> >
> > True. Now that we've shown it is a PITA for the developers to work
> around a
> > handful of known issues with various 2.8.x point releases of CMake, it
> > sounds reasonable to me that we pick a late-model CMake 2.8.x as the
> > requirement for GROMACS 5. That could open the door to an alternative
> > implementation for self-built FFTW.
>
> I agree that it is annoying having to work around CMake issues. At the
> same time, I think it would be a rather "user-unfriendly" move to
> require a very late version of CMake. As a user, it is fair to expect
> that building GROMACS is as hassle-free as possible.
Right. But I think all of the CMake issues have arisen while trying to make
building GROMACS as hassle-free as possible. Here are just some of the
known issues:
1) we need something in 2.8.2 in order to download the regression tests via
CMake
2) 2.8.10 updated FindCUDA and changed the behaviour with respect to
setting the host compiler for nvcc
3) 2.8.8 provides CMAKE_<LANG>_COMPILER_VERSION, for which we currently
provide duplicate functionality in at least two places; this is mostly used
for supporting tests for known versions of compilers that have missing or
broken functionality
4) we sometimes fall back on finding MPI, which didn't work well before
2.8.5 (and find_file() has a minor bug that was fixed in 2.8.10)
5) can't check an MD5 sum of the downloaded FFTW library before 2.8.3
6) one way to link the downloaded FFTW library doesn't work on 2.8.7
Those are all things that we are trying to do to make a hassle-free GROMACS
build experience. In some of the above cases we issue a fatal error and
suggest a CMake upgrade anyway.
The principle has led the GROMACS devs to do a whole pile of "extra" work
implementing those checks, and future work reading and maintaining that
code. This is not sustainable. For GROMACS 4.6, we compromised on CMake 2.8
between developer convenience and the possibility users would not need to
install cmake, which we did before realising we'd be encouraging about half
the users to update to a real compiler. Getting cmake at the same time is
not a big deal. Requiring 2.8.10 for GROMACS 5 lets get rid of a lot of
nonsense and go back to writing MD code.
Hence, having to
> download CMake as a first step of the installation process will
> probably lead to many users not updating (early) to newer GROMACS
> versions.
>
I'd much rather spend devs time writing documentation of their awesome new
features, so that users know they can do cool science with the new version
if they get a new CMake version (which they might re-use anyway). The needs
of the few and the needs of the many are being traded off here. It's not
like we're charging money selling shrink-wrapped software. :-)
In general, the issue is the way CMake development introduces changes
> in minor versions which affect behaviour. This can easily break
> fragile code in the build system. I don't have a good suggestion to
> overcome such problems, but I think that the choice of required
> minimum CMake version should depend on what versions provide the major
> Linux OS-es.
>
RHEL ships with a 2.6 (which drives CentOS and Scientific Linux). EPEL
provides a 2.8.9, though.
Current Ubuntu LTS (precise) has 2.8.7, but the next looks like it would
have at least 2.8.11. Current stable has 2.8.10.
Fedora 19 has 2.8.10.
macports has 2.8.10
So it seems to me that requiring CMake 2.8.10 for a Feb 2014 release is
quite reasonable.
Mark
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://maillist.sys.kth.se/pipermail/gromacs.org_gmx-developers/attachments/20130619/b2f59c98/attachment.html>
More information about the gromacs.org_gmx-developers
mailing list