domain decomposition error, exit codes: 137

From: Sami Niemi <saniem_at_utu.fi>
Date: Wed, 23 Jul 2008 15:18:30 +0100

Dear Gadget-2 Users,

I ran into a domain decomposition error while trying to run Gadget-2
with 1024 cores. Here's a copy what the log says (shortened):
...
Total number of particles : 0134217728

allocated 0.0762939 Mbyte for ngb search.

Allocated 16.6029 MByte for BH-tree. 64

domain decomposition...
_pmii_daemon(SIGCHLD): PE 996 exit signal Killed

... (goes on and on with all PEs)

_pmii_daemon(SIGCHLD): PE 888 exit signal Killed
Application 57082 exit codes: 137
Application 57082 exit signals: Killed

I have run the same IC file with 512 and 64 cores without any problems
so the problem must be related to the number of PEs. However, I did
not change any Makefile or parameter file options when I moved from
512 (or 64) to 1024 cores, so everything in these files should be ok.

The IC file I'm running contains 512**3 particles in a 20/h Mpc box on
side, but I doubt it matters.


Thanks in advance,
Sami



_______________________
Sami-Matias Niemi
Student Support Astronomer
Nordic Optical Telescope

sami_at_not.iac.es
+34 662 535 441
+34 922 425 424

http://users.utu.fi/saniem/
http://saniem.deviantart.com
----------------------------------------
Received on 2008-07-23 16:18:33

This archive was generated by hypermail 2.3.0 : 2023-01-10 10:01:30 CET