Context Navigation

Changes between Version 18 and Version 19 of AppCoprocessor

Timestamp:: Aug 21, 2009, 11:38:19 AM (16 years ago)
Author:: davea
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

AppCoprocessor

-                      v18
+                      v19
 = Applications that use coprocessors =
+This document describes BOINC's support for applications that use coprocessors such as
+ * GPUs
+ * Cell SPEs
+We'll assume that these resources are allocated rather than scheduled:
+i.e., an application using a coprocessor has it locked while the app is in memory,
+even if the app is suspended by BOINC or descheduled by the OS.
+BOINC supports applications that use coprocessors.
+The supported coprocessor types (as of [18892])are NVIDIA and API GPUs.
 The BOINC client probes for coprocessors and reports them in scheduler requests.
 …
 It only runs an app if enough instances are available.
+You can develop your application using any programming system, e.g.
+CUDA (for NVIDIA), Brook+ (for ATI) or OpenCL.
+== Command-line arguments ==
+Some hosts have multiple GPUs.
+When your application is run by BOINC, it will be passed a command-line argument
+{{{
+--device N
+}}}
+where N is the device number of the GPU that is to be used.
+If your application uses multiple GPUs,
+it will be passed multiple --device arguments, e.g.
+{{{
+--device 0 --device 3
+}}}
 == Deploying a coprocessor app ==
+BOINC uses the [AppPlan application planning] mechanism to
+coordinate the scheduling of multi-threaded applications.
+When you deploy a coprocessor app you must specify:
+Suppose you've developed a coprocessor program,
+that it uses a CUDA GPU and 1 GFLOPS of the CPU,
+and produces a total of 100 GFLOPS.
+To deploy it:
+ * its hardware and software requirements
+ * an estimate of what fraction of a CPU it will use
+ * an estimate of its performance on individual hosts
+ * Choose a "planning class" name for the program, say "cuda" (see below).
+This information is specified in an
+[AppPlan application planning function] that you link into your scheduler.
+Specifically, you must:
+ * Choose a "plan class" name for your program, say "cuda" (see below).
  * Create an [UpdateVersions app version], specifying its plan class as "cuda".
+ * Link the following function into your scheduler (customize as needed):
+ * Edit the function '''app_plan()''' in '''sched/sched_customize.cpp''' so that it contains a clause for your plan class.
+The default '''app_plan()''' contains a clause for plan class '''cuda'''.
+We will explain its logic; you may need to modify it for your CUDA app.
+First, we check if the host has an NVIDIA GPU.
 {{{
 int app_plan(SCHEDULER_REQUEST& sreq, char* plan_class, HOST_USAGE& hu) {
+    ...
     if (!strcmp(plan_class, "cuda")) {
-        // the following is for an app that uses a CUDA GPU
-        //
         COPROC_CUDA* cp = (COPROC_CUDA*)sreq.coprocs.lookup("CUDA");
         if (!cp) {
 …
             return PLAN_REJECT_CUDA_NO_DEVICE;
+        }
+}}}
+Check the compute capability (1.0 or better):
+{{{
         int v = (cp->prop.major)*100 + cp->prop.minor;
         if (v < 100) {
 …
             return PLAN_REJECT_CUDA_VERSION;
+        }
+}}}
+        if (cp->drvVersion && cp->drvVersion < PLAN_CUDA_MIN_DRIVER_VERSION) {
+            if (config.debug_version_select) {
+                log_messages.printf(MSG_NORMAL,
+                    "[version] NVIDIA driver version %d < PLAN_CUDA_MIN_DRIVER_VERSION\n",
+                    cp->drvVersion
+                );
+Check the CUDA runtime version.
+As of client version 6.10, all clients report the CUDA runtime version
+(cp->cuda_version); use that if it's present.
+In 6.8 and earlier, the CUDA runtime version isn't reported.
+Windows clients report the driver version,
+from which the CUDA version can be inferred;
+Linux clients don't return the driver version,
+so we don't know what the CUDA version is.
+{{{
+        // for CUDA 2.3, we need to check the CUDA RT version.
+        // Old BOINC clients report display driver version;
+        // newer ones report CUDA RT version
+        //
+        if (!strcmp(plan_class, "cuda23")) {
+            if (cp->cuda_version) {
+                if (cp->cuda_version < 2030) {
+                    return PLAN_REJECT_CUDA_VERSION;
+                }
+            } else if (cp->display_driver_version) {
+                if (cp->display_driver_version < PLAN_CUDA23_MIN_DRIVER_VERSION) {
+                    return PLAN_REJECT_CUDA_VERSION;
+                }
+            } else {
+                return PLAN_REJECT_CUDA_VERSION;
+            }
+            return PLAN_REJECT_NVIDIA_DRIVER_VERSION;
+        }
+}}}
+Check for the amount of video RAM:
+{{{
         if (cp->prop.dtotalGlobalMem < PLAN_CUDA_MIN_RAM) {
             if (config.debug_version_select) {
 …
             return PLAN_REJECT_CUDA_MEM;
+        }
+}}}
+Estimate the FLOPS:
+{{{
         hu.flops = cp->flops_estimate();
+}}}
+Estimate its CPU usage:
+{{{
         // assume we'll need 0.5% as many CPU FLOPS as GPU FLOPS
         // to keep the GPU fed.
 …
         hu.avg_ncpus = x;
         hu.max_ncpus = x;
-        hu.ncudas = 1;
-        if (config.debug_version_select) {
-            log_messages.printf(MSG_NORMAL,
-                "[version] CUDA app estimated %.2f GFLOPS (clock %d count %d)\n",
-                hu.flops/1e9, cp->prop.clockRate,
-                cp->prop.multiProcessorCount
-            );
+        }
-        return 0;
+    }
-    log_messages.printf(MSG_CRITICAL,
-        "Unknown plan class: %s\n", plan_class
-    );
-    return PLAN_REJECT_UNKNOWN;
+}
 }}}
+== Questions ==
+Indicate the number of GPUs used.
+Typically this will be 1.
+If your application uses only a fraction X<1 of the CPU processors,
+and a fraction Y<1 of video RAM,
+reports the number of GPUs as min(X, Y).
+In this case BOINC will attempt to run multiple jobs per GPU is possible.
+{{{
+        hu.ncudas = 1;
+}}}
+ * How does BOINC know if non-BOINC applications are using resources?
+Return 0 to indicate that the application can be run on the host:
+{{{
+        return 0;
+}}}