Re: domain decomposition error, exit codes: 137

From: Sami Niemi <saniem_at_utu.fi>
Date: Thu, 24 Jul 2008 00:17:36 +0100

Hi Cameron and Luca,

Thank you very much Cameron for pointing out the thread. Seems that
the suggestion solved my problem. At least I got Gadget running with
1024 PEs.

Thanks for Luca as well. Your comment cleared the matter further.


Cheers,
Sami


On Jul 23, 2008, at 3:43 PM, Cameron McBride wrote:

> Hello Sami,
>
> This sounds like it could be the same issue we had last
> October. In our case, Volker suggested a fix which worked fine
> for many nodes (more than 1500).
>
> The suggestion is:
> http://www.mpa-garching.mpg.de/galform/gadget/gadget-list/0177.html
>
> And the start of the thread is:
> http://www.mpa-garching.mpg.de/galform/gadget/gadget-list/0173.html
>
> Cameron
>
> Sami Niemi wrote (23 Jul 2008 10:18 EDT):
>>
>> Dear Gadget-2 Users,
>>
>> I ran into a domain decomposition error while trying to run Gadget-2
>> with 1024 cores. Here's a copy what the log says (shortened):
>> ..
>> Total number of particles : 0134217728
>>
>> allocated 0.0762939 Mbyte for ngb search.
>>
>> Allocated 16.6029 MByte for BH-tree. 64
>>
>> domain decomposition...
>> _pmii_daemon(SIGCHLD): PE 996 exit signal Killed
>>
>> .. (goes on and on with all PEs)
>>
>> _pmii_daemon(SIGCHLD): PE 888 exit signal Killed
>> Application 57082 exit codes: 137
>> Application 57082 exit signals: Killed
>>
>> I have run the same IC file with 512 and 64 cores without any
>> problems
>> so the problem must be related to the number of PEs. However, I did
>> not change any Makefile or parameter file options when I moved from
>> 512 (or 64) to 1024 cores, so everything in these files should be ok.
>>
>> The IC file I'm running contains 512**3 particles in a 20/h Mpc box
>> on
>> side, but I doubt it matters.
>>
>>
>> Thanks in advance,
>> Sami
>>
>>
>>
>> _______________________
>> Sami-Matias Niemi
>> Student Support Astronomer
>> Nordic Optical Telescope
>>
>> sami_at_not.iac.es
>> +34 662 535 441
>> +34 922 425 424
>>
>> http://users.utu.fi/saniem/
>> http://saniem.deviantart.com
>> ----------------------------------------
>>
>>
>>
>>
>> -----------------------------------------------------------
>>
>> If you wish to unsubscribe from this mailing, send mail to
>> minimalist_at_MPA-Garching.MPG.de with a subject of: unsubscribe
>> gadget-list
>> A web-archive of this mailing list is available here:
>> http://www.mpa-garching.mpg.de/gadget/gadget-list
Received on 2008-07-24 01:17:39

This archive was generated by hypermail 2.3.0 : 2023-01-10 10:01:30 CET