Opened 17 years ago
Closed 16 years ago
#205 closed Defect (fixed)
Can't rename current state file to previous state file
Reported by: | Ageless | Owned by: | charlief |
---|---|---|---|
Priority: | Major | Milestone: | Undetermined |
Component: | Client - Daemon | Version: | |
Keywords: | Cc: |
Description
As reported by zombie67 on the BOINC_dev forums: http://boinc.berkeley.edu/dev/forum_thread.php?id=1827
I regularly have BOINC lock up. During the lock up, CPU usage drops to 0. After about a minute, it resumes with the following error message. Upon resuming, the tasks continue to crunch until complete like normal. FWIW, it does not cause the tasks to abort, or cause validation errors.
Can't rename client_state_next.xml to client_state.xml; check file and directory permissions |
[error] Couldn't write state file: system rename |
Tue May 22 09:44:47 2007|SETI@home|Task 05mr05aa.16368.15714.348568.3.118_6 exited with zero status but no 'finished' file Tue May 22 09:44:47 2007|SETI@home|If this happens repeatedly you may need to reset the project.
The last two lines are repeated, once for each task running (8 times total for my machine).
The permissions look correct to me. I have tried resetting with no luck. Any ideas?
[hulk:/Library/Application Support/BOINC Data] zombie% ls -la
total 17720
drwxrwxr-x 32 boinc_ma boinc_ma 1088 May 22 09:50 .
drwxrwxr-x 20 root admin 680 May 12 15:17 ..
-rw-rw-r-- 1 boinc_ma boinc_ma 12292 May 14 16:49 .DS_Store
-rw-rw-r-- 1 boinc_ma boinc_ma 2097 May 22 08:09 account_setiathome.berkeley.edu.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 13959 May 18 07:19 all_projects_list.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 238100 Apr 26 02:24 ca-bundle.crt
-rw-rw-r-- 1 boinc_ma boinc_ma 1622746 May 22 09:50 client_state.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 1622746 May 22 09:49 client_state_prev.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 260 May 17 14:42 get_project_config.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 2037 May 21 12:16 global_prefs.xml
-rw-rw---- 1 boinc_ma boinc_ma 32 May 12 15:18 gui_rpc_auth.cfg
-rw-rw-r-- 1 boinc_ma boinc_ma 14701 May 18 07:25 job_log_abcbeta.math.leidenuniv.nl.txt
-rw-rw-r-- 1 boinc_ma boinc_ma 186 May 14 09:44 job_log_boinc.bakerlab.org_rosetta.txt
-rw-rw-r-- 1 boinc_ma boinc_ma 8133 May 18 08:23 job_log_dist.ist.tugraz.at_cape5.txt
-rw-rw-r-- 1 boinc_ma boinc_ma 141525 May 22 09:45 job_log_setiathome.berkeley.edu.txt
dr-xr-x--- 38 boinc_ma boinc_ma 1292 May 8 23:25 locale
-rw-rw-r-- 1 boinc_ma boinc_ma 0 May 12 15:18 lockfile
-rw-rw-r-- 1 boinc_ma boinc_ma 139 May 17 14:42 lookup_account.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 5964 May 22 09:45 lookup_website.html
-rw-rw-r-- 1 boinc_ma boinc_ma 69 May 18 04:49 master_setiathome.berkeley.edu.xml
drwxrwxr-x 4 boinc_ma boinc_ma 136 May 18 08:25 projects
-rw-rw-r-- 1 boinc_ma boinc_ma 5395 May 22 08:47 sched_reply_setiathome.berkeley.edu.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 249360 May 22 08:47 sched_request_setiathome.berkeley.edu.xml
drwxrwxr-x 19 boinc_ma boinc_ma 646 May 22 08:16 slots
-rw-rw-r-- 1 boinc_ma boinc_ma 3723 May 22 08:48 statistics_setiathome.berkeley.edu.xml
-rw-rw-r-- 1 boinc_ma boinc_ma 0 May 12 15:18 stderrdae.txt
-rw-rw-r-- 1 boinc_ma boinc_ma 2180603 May 22 08:08 stderrgui.txt
-rw-rw-r-- 1 boinc_ma boinc_ma 2097163 May 21 07:43 stdoutdae.old
-rw-rw-r-- 1 boinc_ma boinc_ma 781017 May 22 09:47 stdoutdae.txt
-rw-rw-r-- 1 boinc_ma boinc_ma 268 May 22 06:23 stdoutgui.txt
dr-xr-x--- 4 boinc_ma boinc_ma 136 May 18 08:05 switcher
-rw-rw-r-- 1 boinc_ma boinc_ma 4862 May 22 08:08 time_stats_log
Change History (7)
comment:1 Changed 17 years ago by
comment:2 Changed 17 years ago by
zombie67 is testing 5.9.12 now.
It doesn't look like the fix worked...
But now I am beginning to think there are two different problems, maybe not related.
1) BOINC periodically locking up for about a minute at a time. Haven't seen this happen with 5.9.12 yet. But I have only been running it for a little while.
2) "Can't rename client_state_next.xml to client_state.xml"
Starting BOINC client version 5.9.12 for i686-apple-darwin |
log flags: task, file_xfer, sched_ops |
Libraries: libcurl/7.16.1 OpenSSL/0.9.7l zlib/1.2.3 |
Data directory: /Library/Application? Support/BOINC Data |
Wed May 23 18:50:43 2007|SETI@home|Found app_info.xml; using anonymous platform
Processor: 8 GenuineIntel? Intel(R) Xeon(R) CPU X5365 @ 3.00GHz [x86 Family 6 Model 15 Stepping 7] |
Processor features: FPU VME DE PSE TSC MSR PAE MCE CX8 APIC SEP MTRR PGE MCA CMOV PAT PSE36 CLFSH DS ACPI MMX FXSR SSE SSE2 SS HTT TM SSE3 MON DSCPL VMX EST TM2 MNI CX16 TPR |
Memory: 4.00 GB physical, 628.77 GB virtual |
Disk: 698.32 GB total, 628.52 GB free |
Version change (5.9.11 -> 5.9.12) |
Wed May 23 18:50:44 2007|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 3360245; location: work; project prefs: default
General prefs: from http://www.worldcommunitygrid.org/ (last modified 2007-05-20 21:24:06) |
Host location: work |
General prefs: using separate prefs for work |
Preferences limit memory usage when active to 4055.04MB |
Preferences limit memory usage when idle to 4055.04MB |
Preferences limit disk usage to 93.13GB |
Running CPU benchmarks |
Benchmark results: |
Number of CPUs: 8 |
2434 floating point MIPS (Whetstone) per CPU |
6777 integer MIPS (Dhrystone) per CPU |
Wed May 23 18:51:46 2007|SETI@home|Restarting task 16ja05ab.25838.11154.586070.3.134_2 using setiathome_enhanced version 513 Wed May 23 18:51:46 2007|SETI@home|Restarting task 18mr05aa.11342.17728.990892.3.112_0 using setiathome_enhanced version 513 Wed May 23 18:51:46 2007|SETI@home|Restarting task 27fe05aa.20439.24976.884652.3.1_0 using setiathome_enhanced version 513 Wed May 23 18:51:46 2007|SETI@home|Restarting task 27fe05aa.20439.22530.117328.3.32_0 using setiathome_enhanced version 513 Wed May 23 18:51:46 2007|SETI@home|Restarting task 27fe05aa.20439.24976.884652.3.132_1 using setiathome_enhanced version 513 Wed May 23 18:51:46 2007|SETI@home|Restarting task 27fe05aa.20439.24976.884652.3.108_0 using setiathome_enhanced version 513 Wed May 23 18:51:46 2007|SETI@home|Restarting task 27fe05aa.20439.24976.884652.3.109_0 using setiathome_enhanced version 513 Wed May 23 18:51:46 2007|SETI@home|Restarting task 27fe05aa.20439.24976.884652.3.238_0 using setiathome_enhanced version 513 Wed May 23 18:52:04 2007|SETI@home|Sending scheduler request: Requested by user Wed May 23 18:52:04 2007|SETI@home|Reporting 7 tasks Wed May 23 18:52:25 2007|SETI@home|Scheduler RPC succeeded [server version 509] Wed May 23 18:52:25 2007|SETI@home|Deferring communication for 11 sec Wed May 23 18:52:25 2007|SETI@home|Reason: requested by project Wed May 23 19:04:36 2007|SETI@home|Computation for task 16ja05ab.25838.11154.586070.3.134_2 finished Wed May 23 19:04:36 2007|SETI@home|Starting 27fe05aa.20439.24976.884652.3.233_0 Wed May 23 19:04:37 2007|SETI@home|Starting task 27fe05aa.20439.24976.884652.3.233_0 using setiathome_enhanced version 513 Wed May 23 19:04:39 2007|SETI@home|[file_xfer] Started upload of file 16ja05ab.25838.11154.586070.3.134_2_0 Wed May 23 19:04:41 2007|SETI@home|[file_xfer] Finished upload of file 16ja05ab.25838.11154.586070.3.134_2_0 Wed May 23 19:04:41 2007|SETI@home|[file_xfer] Throughput 29322 bytes/sec Wed May 23 19:08:42 2007|SETI@home|Sending scheduler request: Requested by user Wed May 23 19:08:42 2007|SETI@home|Reporting 1 tasks Wed May 23 19:09:02 2007|SETI@home|Scheduler RPC succeeded [server version 509] Wed May 23 19:09:02 2007|SETI@home|Deferring communication for 11 sec Wed May 23 19:09:02 2007|SETI@home|Reason: requested by project Wed May 23 19:14:18 2007|SETI@home|Computation for task 18mr05aa.11342.17728.990892.3.112_0 finished Wed May 23 19:14:18 2007|SETI@home|Starting 27fe05aa.20439.24976.884652.3.244_2 Wed May 23 19:14:18 2007|SETI@home|Starting task 27fe05aa.20439.24976.884652.3.244_2 using setiathome_enhanced version 513 Wed May 23 19:14:20 2007|SETI@home|[file_xfer] Started upload of file 18mr05aa.11342.17728.990892.3.112_0_0 Wed May 23 19:14:23 2007|SETI@home|[file_xfer] Finished upload of file 18mr05aa.11342.17728.990892.3.112_0_0 Wed May 23 19:14:23 2007|SETI@home|[file_xfer] Throughput 36473 bytes/sec
Can't rename client_state_next.xml to client_state.xml; check file and directory permissions |
[error] Couldn't write state file: system rename |
Wed May 23 19:27:51 2007|SETI@home|Computation for task 27fe05aa.20439.24976.884652.3.1_0 finished Wed May 23 19:27:51 2007|SETI@home|Starting 16ja05ab.25838.15234.517322.3.89_1 Wed May 23 19:27:51 2007|SETI@home|Starting task 16ja05ab.25838.15234.517322.3.89_1 using setiathome_enhanced version 513 Wed May 23 19:27:53 2007|SETI@home|[file_xfer] Started upload of file 27fe05aa.20439.24976.884652.3.1_0_0 Wed May 23 19:27:57 2007|SETI@home|[file_xfer] Finished upload of file 27fe05aa.20439.24976.884652.3.1_0_0 Wed May 23 19:27:57 2007|SETI@home|[file_xfer] Throughput 7808 bytes/sec
Can't rename client_state_next.xml to client_state.xml; check file and directory permissions |
[error] Couldn't write state file: system rename |
comment:3 Changed 17 years ago by
Resolution: | → fixed |
---|---|
Status: | new → closed |
Should be fixed in 5.9.12. -David
comment:4 Changed 17 years ago by
Resolution: | fixed |
---|---|
Status: | closed → reopened |
http://boinc.berkeley.edu/dev/forum_thread.php?id=1853
AdrianX - WinXP, 5.8.15, reports:
06/06/2007 17:31:01|Docking@Home|Reporting 1 tasks
06/06/2007 17:31:06|Docking@Home|Scheduler RPC succeeded [server version 509]
06/06/2007 17:31:06|Docking@Home|Deferring communication for 11 sec
06/06/2007 17:31:06|Docking@Home|Reason: requested by project
Can't rename current state file to previous state file; The process cannot access the file because it is being used by another process. (0x20) |
06/06/2007 17:58:13|QMC@HOME|Sending scheduler request: Requested by user
06/06/2007 17:58:13|QMC@HOME|(not requesting new work or reporting completed tasks)
06/06/2007 17:58:18|QMC@HOME|Scheduler RPC succeeded [server version 509]
06/06/2007 17:58:18|QMC@HOME|Scheduler RPC succeeded [server version 509]
06/06/2007 17:58:18|QMC@HOME|Deferring communication for 7 sec
06/06/2007 17:58:18|QMC@HOME|Reason: requested by project
Can't rename current state file to previous state file; The process cannot access the file because it is being used by another process. (0x20) |
06/06/2007 19:03:02|QMC@HOME|Sending scheduler request: To fetch work
06/06/2007 19:03:02|QMC@HOME|Requesting 736 seconds of new work
06/06/2007 19:03:08|QMC@HOME|Scheduler RPC succeeded [server version 509]
06/06/2007 19:03:08|QMC@HOME|Deferring communication for 7 sec
06/06/2007 19:03:08|QMC@HOME|Reason: requested by project
06/06/2007 19:03:10|QMC@HOME|[file_xfer] Started download of file qmc.in.two_470_peptidexp.1833
... again, but all still seems to function. client_state and client_state_prev xml files both have time stamps of 20:20 so they are obviously updating.
comment:5 Changed 17 years ago by
Summary: | Macintosh: Error: Can't rename client_state_next.xml → Can't rename current state file to previous state file |
---|
comment:6 Changed 17 years ago by
Owner: | changed from davea to charlief |
---|---|
Status: | reopened → new |
comment:8 Changed 16 years ago by
Resolution: | → fixed |
---|---|
Status: | new → closed |
This message has appeared from time to time but can never be reproduced. As noted by Ageless above, it does no harm because it usually works on a retry, and even if all the retries fail, it will usually work the next time the state file is updated. The only real problem was that it unnecessarily alarmed users. So I made the following change for version 6.1.10 to fix that:
Charlie Mar 10 2008
- Client: Rework write_state_file logic to always retry if there is an
error writing or renaming the state file, and to show warning messages Can't rename / Can't write state file only if it failed all retries or if the state_debug log flag is set. These messages have appeared from time to time and worry users, but they never seemed to signal any real problems, probably because the subsequent retry succeeded.
I also deleted the last entry, which was SPAM.
Reported to Charlie Fenton. A fix was included for 5.9.12 by David Anderson.