Name | famous_u295_1999_200_006635764_4 |
Workunit | 6839136 |
Created | 10 Jun 2010, 11:26:01 UTC |
Sent | 18 Jul 2010, 7:28:54 UTC |
Report deadline | 17 Oct 2010, 14:56:05 UTC |
Received | 11 Sep 2010, 15:15:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1297049 |
Run time | 9 days 13 hours 7 min 30 sec |
CPU time | 10 days 6 hours 27 min 14 sec |
Validate state | Invalid |
Credit | 4,385.28 |
Device peak FLOPS | 1.73 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> uire lockfile (-154) - waiting 35s (31110): Can't acquire lockfile (-154) - exiting (31111): Can't acquire lockfile (-154) - waiting 35s (31111): Can't acquire lockfile (-154) - exiting (31112): Can't acquire lockfile (-154) - waiting 35s (31112): Can't acquire lockfile (-154) - exiting (31113): Can't acquire lockfile (-154) - waiting 35s (31113): Can't acquire lockfile (-154) - exiting (31144): Can't acquire lockfile (-154) - waiting 35s (31144): Can't acquire lockfile (-154) - exiting (31318): Can't acquire lockfile (-154) - waiting 35s (31318): Can't acquire lockfile (-154) - exiting (31319): Can't acquire lockfile (-154) - waiting 35s (31319): Can't acquire lockfile (-154) - exiting (31320): Can't acquire lockfile (-154) - waiting 35s (31320): Can't acquire lockfile (-154) - exiting (31331): Can't acquire lockfile (-154) - waiting 35s (31331): Can't acquire lockfile (-154) - exiting (31332): Can't acquire lockfile (-154) - waiting 35s (31332): Can't acquire lockfile (-154) - exiting (31333): Can't acquire lockfile (-154) - waiting 35s (31333): Can't acquire lockfile (-154) - exiting (31334): Can't acquire lockfile (-154) - waiting 35s (31334): Can't acquire lockfile (-154) - exiting (31335): Can't acquire lockfile (-154) - waiting 35s (31335): Can't acquire lockfile (-154) - exiting (31509): Can't acquire lockfile (-154) - waiting 35s (31509): Can't acquire lockfile (-154) - exiting (31510): Can't acquire lockfile (-154) - waiting 35s (31510): Can't acquire lockfile (-154) - exiting (31511): Can't acquire lockfile (-154) - waiting 35s (31511): Can't acquire lockfile (-154) - exiting (31517): Can't acquire lockfile (-154) - waiting 35s (31517): Can't acquire lockfile (-154) - exiting (31518): Can't acquire lockfile (-154) - waiting 35s (31518): Can't acquire lockfile (-154) - exiting (31519): Can't acquire lockfile (-154) - waiting 35s (31519): Can't acquire lockfile (-154) - exiting (31520): Can't acquire lockfile (-154) - waiting 35s (31520): Can't acquire lockfile (-154) - exiting (31521): Can't acquire lockfile (-154) - waiting 35s (31521): Can't acquire lockfile (-154) - exiting (31695): Can't acquire lockfile (-154) - waiting 35s (31695): Can't acquire lockfile (-154) - exiting (31696): Can't acquire lockfile (-154) - waiting 35s (31696): Can't acquire lockfile (-154) - exiting (31697): Can't acquire lockfile (-154) - waiting 35s (31697): Can't acquire lockfile (-154) - exiting (31708): Can't acquire lockfile (-154) - waiting 35s (31708): Can't acquire lockfile (-154) - exiting (31709): Can't acquire lockfile (-154) - waiting 35s (31709): Can't acquire lockfile (-154) - exiting (31710): Can't acquire lockfile (-154) - waiting 35s (31710): Can't acquire lockfile (-154) - exiting (31711): Can't acquire lockfile (-154) - waiting 35s (31711): Can't acquire lockfile (-154) - exiting (31712): Can't acquire lockfile (-154) - waiting 35s (31712): Can't acquire lockfile (-154) - exiting (31713): Can't acquire lockfile (-154) - waiting 35s (31713): Can't acquire lockfile (-154) - exiting (31887): Can't acquire lockfile (-154) - waiting 35s (31887): Can't acquire lockfile (-154) - exiting (31888): Can't acquire lockfile (-154) - waiting 35s (31888): Can't acquire lockfile (-154) - exiting (31894): Can't acquire lockfile (-154) - waiting 35s (31894): Can't acquire lockfile (-154) - exiting (31895): Can't acquire lockfile (-154) - waiting 35s (31895): Can't acquire lockfile (-154) - exiting (31896): Can't acquire lockfile (-154) - waiting 35s (31896): Can't acquire lockfile (-154) - exiting (31897): Can't acquire lockfile (-154) - waiting 35s (31897): Can't acquire lockfile (-154) - exiting (31898): Can't acquire lockfile (-154) - waiting 35s (31898): Can't acquire lockfile (-154) - exiting (31899): Can't acquire lockfile (-154) - waiting 35s (31899): Can't acquire lockfile (-154) - exiting (32073): Can't acquire lockfile (-154) - waiting 35s (32073): Can't acquire lockfile (-154) - exiting (32074): Can't acquire lockfile (-154) - waiting 35s (32074): Can't acquire lockfile (-154) - exiting (32085): Can't acquire lockfile (-154) - waiting 35s (32085): Can't acquire lockfile (-154) - exiting (32086): Can't acquire lockfile (-154) - waiting 35s (32086): Can't acquire lockfile (-154) - exiting (32089): Can't acquire lockfile (-154) - waiting 35s (32089): Can't acquire lockfile (-154) - exiting (32090): Can't acquire lockfile (-154) - waiting 35s (32090): Can't acquire lockfile (-154) - exiting (29599): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5531): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (9599): Can't acquire lockfile (-154) - waiting 35s (8195): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (9599): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (11496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (16444): Can't acquire lockfile (-154) - waiting 35s (15013): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17843): Can't acquire lockfile (-154) - waiting 35s (16444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (19279): Can't acquire lockfile (-154) - waiting 35s (17843): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (19279): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (23564): Can't acquire lockfile (-154) - waiting 35s (22992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (23564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (24990): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (31722): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (8817): Can't acquire lockfile (-154) - waiting 35s (7491): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (8817): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (10543): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (14623): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (17024): Can't acquire lockfile (-154) - waiting 35s (15568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (18427): Can't acquire lockfile (-154) - waiting 35s (17024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (19863): Can't acquire lockfile (-154) - waiting 35s (18427): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (21267): Can't acquire lockfile (-154) - waiting 35s (19863): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (22694): Can't acquire lockfile (-154) - waiting 35s (21267): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (22694): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (30875): Can't acquire lockfile (-154) - waiting 35s (29720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (30875): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (3978): Can't acquire lockfile (-154) - waiting 35s (2809): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5401): Can't acquire lockfile (-154) - waiting 35s (5401): Can't acquire lockfile (-154) - exiting (5409): Can't acquire lockfile (-154) - waiting 35s (5409): Can't acquire lockfile (-154) - exiting (5509): Can't acquire lockfile (-154) - waiting 35s (5509): Can't acquire lockfile (-154) - exiting (5510): Can't acquire lockfile (-154) - waiting 35s (5510): Can't acquire lockfile (-154) - exiting (5516): Can't acquire lockfile (-154) - waiting 35s (5516): Can't acquire lockfile (-154) - exiting (5517): Can't acquire lockfile (-154) - waiting 35s (5517): Can't acquire lockfile (-154) - exiting (5691): Can't acquire lockfile (-154) - waiting 35s (5691): Can't acquire lockfile (-154) - exiting (5692): Can't acquire lockfile (-154) - waiting 35s (5692): Can't acquire lockfile (-154) - exiting (5723): Can't acquire lockfile (-154) - waiting 35s (5723): Can't acquire lockfile (-154) - exiting (5724): Can't acquire lockfile (-154) - waiting 35s (5724): Can't acquire lockfile (-154) - exiting (5725): Can't acquire lockfile (-154) - waiting 35s (5725): Can't acquire lockfile (-154) - exiting (5726): Can't acquire lockfile (-154) - waiting 35s (5726): Can't acquire lockfile (-154) - exiting (5737): Can't acquire lockfile (-154) - waiting 35s (5737): Can't acquire lockfile (-154) - exiting (5738): Can't acquire lockfile (-154) - waiting 35s (5738): Can't acquire lockfile (-154) - exiting (5912): Can't acquire lockfile (-154) - waiting 35s (5912): Can't acquire lockfile (-154) - exiting (5913): Can't acquire lockfile (-154) - waiting 35s (5913): Can't acquire lockfile (-154) - exiting (5914): Can't acquire lockfile (-154) - waiting 35s (5914): Can't acquire lockfile (-154) - exiting (5915): Can't acquire lockfile (-154) - waiting 35s (5915): Can't acquire lockfile (-154) - exiting (5916): Can't acquire lockfile (-154) - waiting 35s (5916): Can't acquire lockfile (-154) - exiting (5917): Can't acquire lockfile (-154) - waiting 35s (5917): Can't acquire lockfile (-154) - exiting (5923): Can't acquire lockfile (-154) - waiting 35s (5923): Can't acquire lockfile (-154) - exiting (5924): Can't acquire lockfile (-154) - waiting 35s (5924): Can't acquire lockfile (-154) - exiting (6098): Can't acquire lockfile (-154) - waiting 35s (6098): Can't acquire lockfile (-154) - exiting (6099): Can't acquire lockfile (-154) - waiting 35s (6099): Can't acquire lockfile (-154) - exiting (6100): Can't acquire lockfile (-154) - waiting 35s (6100): Can't acquire lockfile (-154) - exiting (6101): Can't acquire lockfile (-154) - waiting 35s (6101): Can't acquire lockfile (-154) - exiting (6102): Can't acquire lockfile (-154) - waiting 35s (6102): Can't acquire lockfile (-154) - exiting (6103): Can't acquire lockfile (-154) - waiting 35s (6103): Can't acquire lockfile (-154) - exiting (6114): Can't acquire lockfile (-154) - waiting 35s (6114): Can't acquire lockfile (-154) - exiting (6115): Can't acquire lockfile (-154) - waiting 35s (6115): Can't acquire lockfile (-154) - exiting (6289): Can't acquire lockfile (-154) - waiting 35s (6289): Can't acquire lockfile (-154) - exiting (6290): Can't acquire lockfile (-154) - waiting 35s (6290): Can't acquire lockfile (-154) - exiting (6291): Can't acquire lockfile (-154) - waiting 35s (6291): Can't acquire lockfile (-154) - exiting (6292): Can't acquire lockfile (-154) - waiting 35s (6292): Can't acquire lockfile (-154) - exiting (6293): Can't acquire lockfile (-154) - waiting 35s (6293): Can't acquire lockfile (-154) - exiting (6294): Can't acquire lockfile (-154) - waiting 35s (6294): Can't acquire lockfile (-154) - exiting (6300): Can't acquire lockfile (-154) - waiting 35s (6300): Can't acquire lockfile (-154) - exiting (6301): Can't acquire lockfile (-154) - waiting 35s (6301): Can't acquire lockfile (-154) - exiting (6475): Can't acquire lockfile (-154) - waiting 35s (6475): Can't acquire lockfile (-154) - exiting (6476): Can't acquire lockfile (-154) - waiting 35s (6476): Can't acquire lockfile (-154) - exiting (6477): Can't acquire lockfile (-154) - waiting 35s (6477): Can't acquire lockfile (-154) - exiting (6478): Can't acquire lockfile (-154) - waiting 35s (6478): Can't acquire lockfile (-154) - exiting (6479): Can't acquire lockfile (-154) - waiting 35s (6479): Can't acquire lockfile (-154) - exiting (6480): Can't acquire lockfile (-154) - waiting 35s (6480): Can't acquire lockfile (-154) - exiting (6491): Can't acquire lockfile (-154) - waiting 35s (6491): Can't acquire lockfile (-154) - exiting (6492): Can't acquire lockfile (-154) - waiting 35s (6492): Can't acquire lockfile (-154) - exiting (6643): Can't acquire lockfile (-154) - waiting 35s (6643): Can't acquire lockfile (-154) - exiting (6737): Can't acquire lockfile (-154) - waiting 35s (3978): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6737): Can't acquire lockfile (-154) - exiting (6876): No heartbeat from core client for 30 sec - exiting (8274): Can't acquire lockfile (-154) - waiting 35s CPDN Monitor - No 'heartbeat' from BOINC... (8274): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (10186): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (11263): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (18013): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (20742): Can't acquire lockfile (-154) - waiting 35s (19310): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (20742): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (23605): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (24901): Can't acquire lockfile (-154) - waiting 35s (24901): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (31664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6026): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (9292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (19245): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1 Model crash detected, will try to restart... (7750): Can't acquire lockfile (-154) - waiting 35s (6448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (7750): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16456, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16456, iMonCtr=1 Model crash detected, will try to restart... (17223): Can't acquire lockfile (-154) - waiting 35s (16456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (17223): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11243, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11243, iMonCtr=1 Model crash detected, will try to restart... (11243): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (12546): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22299, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22299, iMonCtr=1 Model crash detected, will try to restart... (22299): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23429, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23429, iMonCtr=1 Model crash detected, will try to restart... (23429): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29011, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29011, iMonCtr=1 Model crash detected, will try to restart... (29011): No heartbeat from core client for 30 sec - exiting (30348): Can't acquire lockfile (-154) - waiting 35s CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30348, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30348, iMonCtr=1 Model crash detected, will try to restart... (30348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3633, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3633, iMonCtr=1 Model crash detected, will try to restart... (4789): Can't acquire lockfile (-154) - waiting 35s (3633): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4789, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4789, iMonCtr=1 Model crash detected, will try to restart... (7573): Can't acquire lockfile (-154) - waiting 35s (4789): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7573, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7573, iMonCtr=1 Model crash detected, will try to restart... (7573): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17036, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17036, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 (17036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20340, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20340, iMonCtr=1 Model crash detected, will try to restart... (20340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21106, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21106, iMonCtr=1 Model crash detected, will try to restart... (21106): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27284, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27284, iMonCtr=1 Model crash detected, will try to restart... (28050): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (28050): Can't acquire lockfile (-154) - exiting (28058): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (28058): Can't acquire lockfile (-154) - exiting (28161): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (28161): Can't acquire lockfile (-154) - exiting (28162): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (28162): Can't acquire lockfile (-154) - exiting (28168): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (28168): Can't acquire lockfile (-154) - exiting (28169): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (28169): Can't acquire lockfile (-154) - exiting Suspended CPDN Monitor - Suspend request from BOINC... (28170): Can't acquire lockfile (-154) - waiting 35s (28170): Can't acquire lockfile (-154) - exiting (28344): Can't acquire lockfile (-154) - waiting 35s (28344): Can't acquire lockfile (-154) - exiting (28345): Can't acquire lockfile (-154) - waiting 35s (28345): Can't acquire lockfile (-154) - exiting (28346): Can't acquire lockfile (-154) - waiting 35s (28346): Can't acquire lockfile (-154) - exiting (28347): Can't acquire lockfile (-154) - waiting 35s (28347): Can't acquire lockfile (-154) - exiting (28348): Can't acquire lockfile (-154) - waiting 35s (28348): Can't acquire lockfile (-154) - exiting (28349): Can't acquire lockfile (-154) - waiting 35s (28349): Can't acquire lockfile (-154) - exiting (28360): Can't acquire lockfile (-154) - waiting 35s (28360): Can't acquire lockfile (-154) - exiting (28361): Can't acquire lockfile (-154) - waiting 35s (28361): Can't acquire lockfile (-154) - exiting (28535): Can't acquire lockfile (-154) - waiting 35s (28535): Can't acquire lockfile (-154) - exiting (28536): Can't acquire lockfile (-154) - waiting 35s (28536): Can't acquire lockfile (-154) - exiting (28537): Can't acquire lockfile (-154) - waiting 35s (28537): Can't acquire lockfile (-154) - exiting (28538): Can't acquire lockfile (-154) - waiting 35s (28538): Can't acquire lockfile (-154) - exiting (28539): Can't acquire lockfile (-154) - waiting 35s (28539): Can't acquire lockfile (-154) - exiting (28540): Can't acquire lockfile (-154) - waiting 35s (28540): Can't acquire lockfile (-154) - exiting (28546): Can't acquire lockfile (-154) - waiting 35s (28546): Can't acquire lockfile (-154) - exiting (28547): Can't acquire lockfile (-154) - waiting 35s (28547): Can't acquire lockfile (-154) - exiting (28548): Can't acquire lockfile (-154) - waiting 35s (28548): Can't acquire lockfile (-154) - exiting (28722): Can't acquire lockfile (-154) - waiting 35s (28722): Can't acquire lockfile (-154) - exiting (28723): Can't acquire lockfile (-154) - waiting 35s (28723): Can't acquire lockfile (-154) - exiting (28724): Can't acquire lockfile (-154) - waiting 35s (28724): Can't acquire lockfile (-154) - exiting (28725): Can't acquire lockfile (-154) - waiting 35s (28725): Can't acquire lockfile (-154) - exiting (28726): Can't acquire lockfile (-154) - waiting 35s (28726): Can't acquire lockfile (-154) - exiting (28737): Can't acquire lockfile (-154) - waiting 35s (28737): Can't acquire lockfile (-154) - exiting (28738): Can't acquire lockfile (-154) - waiting 35s (28738): Can't acquire lockfile (-154) - exiting (28739): Can't acquire lockfile (-154) - waiting 35s (28739): Can't acquire lockfile (-154) - exiting (28913): Can't acquire lockfile (-154) - waiting 35s (28913): Can't acquire lockfile (-154) - exiting (28914): Can't acquire lockfile (-154) - waiting 35s (28914): Can't acquire lockfile (-154) - exiting (28915): Can't acquire lockfile (-154) - waiting 35s (28915): Can't acquire lockfile (-154) - exiting (28916): Can't acquire lockfile (-154) - waiting 35s (28916): Can't acquire lockfile (-154) - exiting (28917): Can't acquire lockfile (-154) - waiting 35s (28917): Can't acquire lockfile (-154) - exiting (28918): Can't acquire lockfile (-154) - waiting 35s (28918): Can't acquire lockfile (-154) - exiting (28924): Can't acquire lockfile (-154) - waiting 35s (28924): Can't acquire lockfile (-154) - exiting (28925): Can't acquire lockfile (-154) - waiting 35s (28925): Can't acquire lockfile (-154) - exiting (28926): Can't acquire lockfile (-154) - waiting 35s (28926): Can't acquire lockfile (-154) - exiting (29100): Can't acquire lockfile (-154) - waiting 35s (29100): Can't acquire lockfile (-154) - exiting (29101): Can't acquire lockfile (-154) - waiting 35s (29101): Can't acquire lockfile (-154) - exiting (29102): Can't acquire lockfile (-154) - waiting 35s (29102): Can't acquire lockfile (-154) - exiting (29103): Can't acquire lockfile (-154) - waiting 35s (29103): Can't acquire lockfile (-154) - exiting (29104): Can't acquire lockfile (-154) - waiting 35s (29104): Can't acquire lockfile (-154) - exiting (29115): Can't acquire lockfile (-154) - waiting 35s (29115): Can't acquire lockfile (-154) - exiting (29116): Can't acquire lockfile (-154) - waiting 35s (29116): Can't acquire lockfile (-154) - exiting (29117): Can't acquire lockfile (-154) - waiting 35s (29117): Can't acquire lockfile (-154) - exiting (29316): Can't acquire lockfile (-154) - waiting 35s (29316): Can't acquire lockfile (-154) - exiting (29536): Can't acquire lockfile (-154) - waiting 35s (29536): Can't acquire lockfile (-154) - exiting (29537): Can't acquire lockfile (-154) - waiting 35s (29537): Can't acquire lockfile (-154) - exiting (29538): Can't acquire lockfile (-154) - waiting 35s (29538): Can't acquire lockfile (-154) - exiting (29539): Can't acquire lockfile (-154) - waiting 35s (29539): Can't acquire lockfile (-154) - exiting (29545): Can't acquire lockfile (-154) - waiting 35s (29545): Can't acquire lockfile (-154) - exiting (29546): Can't acquire lockfile (-154) - waiting 35s (29546): Can't acquire lockfile (-154) - exiting (29547): Can't acquire lockfile (-154) - waiting 35s (29547): Can't acquire lockfile (-154) - exiting (29548): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (29548): Can't acquire lockfile (-154) - exiting (29752): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (29752): Can't acquire lockfile (-154) - exiting (29753): Can't acquire lockfile (-154) - waiting 35s (29753): Can't acquire lockfile (-154) - exiting (29754): Can't acquire lockfile (-154) - waiting 35s (29754): Can't acquire lockfile (-154) - exiting (29755): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (29755): Can't acquire lockfile (-154) - exiting (29766): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (29766): Can't acquire lockfile (-154) - exiting (29767): Can't acquire lockfile (-154) - waiting 35s (29767): Can't acquire lockfile (-154) - exiting (29768): Can't acquire lockfile (-154) - waiting 35s (29768): Can't acquire lockfile (-154) - exiting (29769): Can't acquire lockfile (-154) - waiting 35s (29769): Can't acquire lockfile (-154) - exiting (29943): Can't acquire lockfile (-154) - waiting 35s (29943): Can't acquire lockfile (-154) - exiting (29944): Can't acquire lockfile (-154) - waiting 35s (29944): Can't acquire lockfile (-154) - exiting (29945): Can't acquire lockfile (-154) - waiting 35s (29945): Can't acquire lockfile (-154) - exiting (29946): Can't acquire lockfile (-154) - waiting 35s (29946): Can't acquire lockfile (-154) - exiting (29947): Can't acquire lockfile (-154) - waiting 35s (29947): Can't acquire lockfile (-154) - exiting (29953): Can't acquire lockfile (-154) - waiting 35s (29953): Can't acquire lockfile (-154) - exiting (29954): Can't acquire lockfile (-154) - waiting 35s (29954): Can't acquire lockfile (-154) - exiting (29955): Can't acquire lockfile (-154) - waiting 35s (29955): Can't acquire lockfile (-154) - exiting (30129): Can't acquire lockfile (-154) - waiting 35s (30129): Can't acquire lockfile (-154) - exiting (30130): Can't acquire lockfile (-154) - waiting 35s (30130): Can't acquire lockfile (-154) - exiting (30131): Can't acquire lockfile (-154) - waiting 35s (30131): Can't acquire lockfile (-154) - exiting (30132): Can't acquire lockfile (-154) - waiting 35s (30132): Can't acquire lockfile (-154) - exiting (30133): Can't acquire lockfile (-154) - waiting 35s (30133): Can't acquire lockfile (-154) - exiting (30144): Can't acquire lockfile (-154) - waiting 35s (30144): Can't acquire lockfile (-154) - exiting (30145): Can't acquire lockfile (-154) - waiting 35s (30145): Can't acquire lockfile (-154) - exiting (30146): Can't acquire lockfile (-154) - waiting 35s (30146): Can't acquire lockfile (-154) - exiting (30147): Can't acquire lockfile (-154) - waiting 35s (30147): Can't acquire lockfile (-154) - exiting (30321): Can't acquire lockfile (-154) - waiting 35s (30321): Can't acquire lockfile (-154) - exiting (30322): Can't acquire lockfile (-154) - waiting 35s (30322): Can't acquire lockfile (-154) - exiting (30323): Can't acquire lockfile (-154) - waiting 35s (30323): Can't acquire lockfile (-154) - exiting (30324): Can't acquire lockfile (-154) - waiting 35s (30324): Can't acquire lockfile (-154) - exiting (30330): Can't acquire lockfile (-154) - waiting 35s (30330): Can't acquire lockfile (-154) - exiting (30331): Can't acquire lockfile (-154) - waiting 35s (30331): Can't acquire lockfile (-154) - exiting (30332): Can't acquire lockfile (-154) - waiting 35s (30332): Can't acquire lockfile (-154) - exiting (30333): Can't acquire lockfile (-154) - waiting 35s (30333): Can't acquire lockfile (-154) - exiting (30507): Can't acquire lockfile (-154) - waiting 35s (30507): Can't acquire lockfile (-154) - exiting (30508): Can't acquire lockfile (-154) - waiting 35s (30508): Can't acquire lockfile (-154) - exiting (30509): Can't acquire lockfile (-154) - waiting 35s (30509): Can't acquire lockfile (-154) - exiting (30510): Can't acquire lockfile (-154) - waiting 35s (30510): Can't acquire lockfile (-154) - exiting (30511): Can't acquire lockfile (-154) - waiting 35s (30511): Can't acquire lockfile (-154) - exiting (30522): Can't acquire lockfile (-154) - waiting 35s (30522): Can't acquire lockfile (-154) - exiting (30523): Can't acquire lockfile (-154) - waiting 35s (30523): Can't acquire lockfile (-154) - exiting (30524): Can't acquire lockfile (-154) - waiting 35s (30524): Can't acquire lockfile (-154) - exiting (30878): Can't acquire lockfile (-154) - waiting 35s (30878): Can't acquire lockfile (-154) - exiting (27284): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30943, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30943, iMonCtr=1 Model crash detected, will try to restart... (30943): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1461, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1461, iMonCtr=1 Model crash detected, will try to restart... (1461): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (2541): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15714, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15714, iMonCtr=1 Model crash detected, will try to restart... (15714): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22546, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22546, iMonCtr=1 Model crash detected, will try to restart... (22546): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (22952): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1788, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1788, iMonCtr=1 Model crash detected, will try to restart... (4126): Can't acquire lockfile (-154) - waiting 35s (1788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (4126): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21834, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21834, iMonCtr=1 Model crash detected, will try to restart... (21834): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (27722): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8122, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8122, iMonCtr=1 Model crash detected, will try to restart... (11572): Can't acquire lockfile (-154) - waiting 35s (8122): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11572, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11572, iMonCtr=1 Model crash detected, will try to restart... (11572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (19552): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18563, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18563, iMonCtr=1 Model crash detected, will try to restart... (9413): Can't acquire lockfile (-154) - waiting 35s (18563): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9413, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9413, iMonCtr=1 Model crash detected, will try to restart... (11088): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (11088): Can't acquire lockfile (-154) - exiting (11097): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... (11097): Can't acquire lockfile (-154) - exiting (9413): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11200, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11200, iMonCtr=1 Model crash detected, will try to restart... (12732): Can't acquire lockfile (-154) - waiting 35s (11200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12732, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12732, iMonCtr=1 Model crash detected, will try to restart... (12732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (14419): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14419, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14419, iMonCtr=1 Model crash detected, will try to restart... (17638): Can't acquire lockfile (-154) - waiting 35s (14419): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17638, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17638, iMonCtr=1 Model crash detected, will try to restart... (25588): Can't acquire lockfile (-154) - waiting 35s (17638): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (25588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (25588): No heartbeat from core client for 30 sec - exiting (10680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (10786): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (23778): No heartbeat from core client for 30 sec - exiting (7098): Can't acquire lockfile (-154) - waiting 35s Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (19008): Can't acquire lockfile (-154) - waiting 35s (19635): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (9553): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14862, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14862, iMonCtr=1 Model crash detected, will try to restart... (15430): Can't acquire lockfile (-154) - waiting 35s (14862): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (15430): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (4817): Can't acquire lockfile (-154) - waiting 35s (483): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (17473): Can't acquire lockfile (-154) - waiting 35s (12069): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (17473): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (30029): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting (30029): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... (32272): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... (23411): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting (23411): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... (23632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting (23632): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... (12765): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (17123): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (7409): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (7409): No heartbeat from core client for 30 sec - exiting (5755): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (5999): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6219): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6262): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (6624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (2314): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... (12296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (12522): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (12546): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 21 - Return code = 1 Model crashed: READ_FLH: I/O error tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 21 - Return code = 1 Model crashed: READ_FLH: I/O error tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 21 - Return code = 1 Model crashed: READ_FLH: I/O error tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 21 - Return code = 1 Model crashed: READ_FLH: I/O error tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 21 - Return code = 1 Model crashed: READ_FLH: I/O error tmp/pipe_dummy BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 21 - Return code = 1 Model crashed: READ_FLH: I/O error tmp/pipe_dummy Sorry, too many model crashes! :-( (12983): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Sep 2010 13:35:35 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,329,146 | 884,127 | 0.6652 |
11 Sep 2010 11:53:45 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,319,786 | 877,886 | 0.6652 |
11 Sep 2010 10:07:46 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,310,426 | 871,635 | 0.6652 |
11 Sep 2010 08:22:30 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,301,066 | 865,397 | 0.6651 |
11 Sep 2010 06:35:27 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,291,706 | 859,152 | 0.6651 |
11 Sep 2010 04:54:26 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,282,346 | 852,908 | 0.6651 |
11 Sep 2010 03:08:23 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,272,986 | 846,665 | 0.6651 |
11 Sep 2010 01:24:32 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,263,626 | 840,426 | 0.6651 |
10 Sep 2010 23:23:34 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,254,266 | 834,182 | 0.6651 |
10 Sep 2010 21:37:27 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,244,906 | 827,962 | 0.6651 |
10 Sep 2010 19:51:28 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,235,546 | 821,743 | 0.6651 |
10 Sep 2010 18:06:50 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,226,186 | 815,514 | 0.6651 |
10 Sep 2010 16:25:14 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,216,826 | 809,278 | 0.6651 |
10 Sep 2010 14:41:29 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,207,466 | 803,036 | 0.6651 |
10 Sep 2010 12:56:24 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,198,106 | 796,827 | 0.6651 |
10 Sep 2010 11:12:57 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,188,746 | 790,616 | 0.6651 |
10 Sep 2010 09:29:07 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,179,386 | 784,371 | 0.6651 |
10 Sep 2010 07:45:08 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,170,026 | 778,113 | 0.6650 |
10 Sep 2010 06:00:18 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,160,666 | 771,881 | 0.6650 |
10 Sep 2010 04:14:36 | 779809 | 11429629 | famous_u295_1999_200_006635764_4 | 1,151,306 | 765,646 | 0.6650 |
©2024 climateprediction.net