Opened 15 years ago

Closed 14 years ago

#896 closed Defect (fixed)

Client issues 'empty' RPC requesting zero work

Reported by: Richard Haselgrove Owned by: davea
Priority: Minor Milestone: 6.12
Component: Client - Work Fetch Policy Version: 6.6.28
Keywords: Cc:

Description (last modified by Nicolas)

Description If a client has a CUDA work shortfall, but no CUDA project is fetchable, it may issue a CPU work fetch instead. If there is no CPU shortfall, the work fetch is issued for 0.00 seconds.

For example:

17/04/2009 11:21:51 [wfd] CPU: shortfall 0.00 nidle 0.00 est. delay 0.00
17/04/2009 11:21:51 [wfd] CUDA: shortfall 21420.16 nidle 0.00 est. delay 0.00
17/04/2009 11:21:51 Einstein@Home chosen: CPU minor shortfall
17/04/2009 11:21:51 Einstein@Home [wfd] request: CPU (0.00 sec, 0) CUDA (0.00 sec, 0)

Reproducibility Difficult. Only observed during SETI server crashes, but observed twice during different outages - once with client v6.6.23, and several times with client v6.6.28

Suspect code area Function WORK_FETCH::choose_project() in work_fetch.cpp

There is no "none of the above" escape path to cancel the work fetch if no project with the desired resource is fetchable.

Reporting history On boinc_alpha:

17 Apr 2009, "Work fetch bug in v6.6.23" (with log)
23 Apr 2009, "Fw: Work fetch bug in v6.6.23" (analysis with JM7)
10 May 2009, "Work fetch bug continues in v6.6.28" (7 instances logged)

Change History (4)

comment:1 Changed 15 years ago by Nicolas

Anything a project could do to intentionally cause this problem, and help reproduce?

comment:2 Changed 15 years ago by Richard Haselgrove

Nobody noticed (or if they noticed, they were too polite to mention) that I typed v6.6.28 wrongly.

Observed again today, with v6.6.31, and no SETI server crash (though the feeder/scheduler is under some extra stress because they've squandered all the Astropulse jobs).

comment:3 Changed 14 years ago by Nicolas

Description: modified (diff)

Fixed version number.

comment:4 Changed 14 years ago by davea

Resolution: fixed
Status: newclosed

This is probably fixed in 6.10.24; reopen if not.

Note: See TracTickets for help on using tickets.