什么原因使得“on signal 9 (Killed)”
-
大家好,计算后发现求解器的log在算第一个时间的时候就出现了这样的情况:
Primary job terminated normally, but 1 process returned a non-zero exit code. Per user-direction, the job has been aborted. -------------------------------------------------------------------------- [joann:06935] Read -1, expected 5328, errno = 3 [joann:06936] Read -1, expected 5400, errno = 3 [joann:06926] Read -1, expected 5400, errno = 3 [joann:06931] Read -1, expected 10368, errno = 3 [joann:06932] Read -1, expected 5616, errno = 3 [joann:06933] Read -1, expected 6984, errno = 3 -------------------------------------------------------------------------- mpirun noticed that process rank 0 with PID 0 on node joann exited on signal 9 (Killed).
对于这个问题有点迷茫,特来请教各位一下如何解决这个问题?
-
-
@bestucan 在 什么原因使得“on signal 9 (Killed)” 中说:
cfd-online.com/Forums/openfoam-solving/111374-mpirun-error-signall-9-killed.html
可能是计算机内存不够用或者内存越界之类的
您好,看到您发的帖子,我想请教一下,我最近也遇到“on signal 9 (killed)”这个问题,目前使用的服务器内存128G的,网格也才400多万,我看运行的时候内存占用不过10%左右,但是还是会出现上述情况,有点迷惑,您发的链接我也都看过,尝试了很多方法仍然没解决。看您提到了内存越界,想咨询一下如果是这个原因,那么应该怎么解决呢,多谢。
-
log文件最后是这样的
Time = 460.8
Courant Number mean: 0.114989 max: 0.387001
GAMG: Solving for p, Initial residual = 0.047574, Final residual = 0.000426877, No Iterations 12
time step continuity errors : sum local = 8.93845e-09, global = 6.90083e-17, cumulative = 5.40449e-11
Pressure gradient source: uncorrected Ubar = 0.045, pressure gradient = 7.45872e-06
GAMG: Solving for p, Initial residual = 0.0392888, Final residual = 0.000366504, No Iterations 13
time step continuity errors : sum local = 7.68053e-09, global = 6.901e-17, cumulative = 5.40449e-11
Pressure gradient source: uncorrected Ubar = 0.045, pressure gradient = 7.4512e-06
GAMG: Solving for p, Initial residual = 0.00325348, Final residual = 8.77001e-07, No Iterations 65
time step continuity errors : sum local = 1.82218e-11, global = 6.90109e-17, cumulative = 5.4045e-11
Pressure gradient source: uncorrected Ubar = 0.045, pressure gradient = 7.45132e-06
ExecutionTime = 2997.06 s ClockTime = 3053 sTime = 460.9
Courant Number mean: 0.11499 max: 0.386534Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
mpirun noticed that process rank 4 with PID 0 on node ps exited on signal 9 (Killed).