Context Navigation

Changes between Version 35 and Version 36 of AppCoprocessor

Timestamp:: Oct 15, 2011, 10:41:45 PM (14 years ago)
Author:: davea
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

AppCoprocessor

-                      v35
+                      v36
 = Applications that use coprocessors =
 BOINC supports applications that use coprocessors.
 The supported coprocessor types (as of [18892])are NVIDIA and ATI GPUs.
+The supported coprocessor types (as of [24404])are NVIDIA and ATI GPUs.
 The BOINC client probes for coprocessors and reports them in scheduler requests.
 …
 It only runs an app if enough instances are available.
+You can develop your application using any programming system, e.g. CUDA (for NVIDIA), Brook+ (for ATI) or OpenCL.
+You can develop your application using any programming system,
+e.g. CUDA (for NVIDIA), CAL (for ATI) or OpenCL.
 == Dealing with GPU memory allocation failures ==
 GPUs don't have virtual memory.
 GPU memory allocations may fail because other applications are using the GPU.
 …
 boinc_temporary_exit(60);
 }}}
+This will exit the application, and will tell the BOINC client to restart it again in at least 60 seconds.
+This will exit the application, and will tell the BOINC client to restart
+it again in at least 60 seconds,
+at which point memory may be available.
 == Device selection ==
 Some hosts have multiple GPUs.
+When your application is run by BOINC, it will be passed a command-line argument
+When your application is run by BOINC, it receives information
+about which GPU instance to use.
+This is passed as a command-line argument
 {{{
 --gpu_type X --device N
+--device N
 }}}
 where X is the GPU type (e.g., 'nvidia' or 'ati') and N is the device number of the GPU that is to be used.
+where N is the device number of the GPU that is to be used.
 If your application uses multiple GPUs, it will be passed multiple --device arguments, e.g.
 {{{
 --gpu_type X --device 0 --device 3
+--device 0 --device 3
 }}}
+Some OpenCL apps can use either NVIDIA or ATI GPUs,
+so they must also be told which type of GPU to use.
+This is passed in the APP_INIT_DATA structure returned by '''boinc_get_init_data()'''.
+{{{
+char gpu_type[64];     // "nvidia" or "ati"
+int gpu_device_num;
+}}}
 == Cleanup on premature exit ==
+The BOINC client may kill your application in the middle. This may leave the GPU in a bad state. To prevent this, call
+The BOINC client may kill your application during execution.
+This may leave the GPU in a bad state. To prevent this, call
 {{{
 …
     // cudaThreadSynchronize(); or whatever is needed
     boinc_end_critical_section();
     while (1) boinc_sleep(1);
+    exit(0);
+}
 }}}
 == Plan classes ==
+Each coprocessor application has an associated [wiki:AppPlan plan class] which determines the hardware and software resources that are needed to run the application.
+Each coprocessor application has an associated [wiki:AppPlan plan class]
+which determines the hardware and software resources that are needed to run the application.
 The following plan classes for NVIDIA are pre-defined:
 …
 == Defining a custom plan class ==
 If your application has properties that differ from any of the pre-defined classes,
+you can define your own.
+To do this, you must modify the [wiki:AppPlan application planning function] that you link into your scheduler.
+you can modify them, or better yet define your own.
+To see how to do this, let's look at the default function.
+First, we check if the host has an NVIDIA GPU.
+To define a new NVIDIA/CUDA plan class, add a new clause
+to '''app_plan_cuda()''' in sched/sched_customize.cpp.
+For example, the plan class '''cuda23''' is defined by:
 {{{
-int app_plan(SCHEDULER_REQUEST& sreq, char* plan_class, HOST_USAGE& hu) {
     ...
+    if (!strcmp(plan_class, "cuda")) {
+        COPROC_CUDA* cp = (COPROC_CUDA*)sreq.coprocs.lookup("CUDA");
+        if (!cp) {
+            if (config.debug_version_select) {
+                log_messages.printf(MSG_NORMAL,
+                    "[version] Host lacks CUDA coprocessor for plan class cuda\n"
+                );
+            }
+            add_no_work_message("Your computer has no NVIDIA GPU");
+    if (!strcmp(plan_class, "cuda23")) {
+        if (!cuda_check(c, hu,
+,        // minimum compute capability (1.0)
+,        // max compute capability (2.0)
+,       // min CUDA version (2.3)
+,      // min display driver version (195.00)
+*MEGA,   // min video RAM
+.,         // # of GPUs used (may be fractional, or an integer > 1)
+            .01,        // fraction of FLOPS done by the CPU
+            .21            // estimated GPU efficiency (actual/peak FLOPS)
+        )) {
             return false;
+        }
+    }
 }}}
+Check the compute capability (1.0 or better):
+To define a new ATI/CAL plan class, add a new clause
+to '''app_plan_ati()'''.
+For example:
+{{{
+    if (!strcmp(plan_class, "ati14")) {
+        if (!ati_check(c, hu,
+            1004000,    // min display driver version (10.4)
+            false,      // require libraries named "ati", not "amd"
+*MEGA,   // min video RAM
+.,         // # of GPUs used (may be fractional, or an integer > 1)
+            .01,        // fraction of FLOPS done by the CPU
+            .21         // estimated GPU efficiency (actual/peak FLOPS)
+        )) {
+            return false;
+        }
+    }
+}}}
+To define a new OpenCL plan class, add a new clause to
+'''app_plan_opencl()'''.
+For example:
 {{{
+        int v = (cp->prop.major)*100 + cp->prop.minor;
+        if (v < 100) {
+            if (config.debug_version_select) {
+                log_messages.printf(MSG_NORMAL,
+                    "[version] CUDA version %d < 1.0\n", v
+                );
+            }
+            add_no_work_message(
+                "Your NVIDIA GPU lacks the needed compute capability"
+            );
+         }
+    if (!strcmp(plan_class, "opencl_nvidia_101")) {
+        return opencl_check(
+            c, hu,
+,        // OpenCL version (1.1)
+*MEGA,   // min video RAM
+,          // # of GPUs used
+            .1,         // fraction of FLOPS done by the CPU
+            .21         // estimated GPU efficiency (actual/peak FLOPS)
+        );
+    }
 }}}
-Check the CUDA runtime version.
-As of client version 6.10, all clients report the CUDA runtime version (cp->cuda_version); use that if it's present.
-In 6.8 and earlier, the CUDA runtime version isn't reported.
-Windows clients report the driver version, from which the CUDA version can be inferred;
-Linux clients don't return the driver version, so we don't know what the CUDA version is.
-{{{
-        // for CUDA 2.3, we need to check the CUDA RT version.
-        // Old BOINC clients report display driver version;
-        // newer ones report CUDA RT version
-        //
-        if (!strcmp(plan_class, "cuda23")) {
-            if (cp->cuda_version) {
-                if (cp->cuda_version < 2030) {
-                    add_no_work_message("CUDA version 2.3 needed");
-                    return false;
+                 }
-            } else if (cp->display_driver_version) {
-                if (cp->display_driver_version < PLAN_CUDA23_MIN_DRIVER_VERSION) {
-                    sprintf(buf, "NVIDIA display driver %d or later needed",
-                        PLAN_CUDA23_MIN_DRIVER_VERSION
-                    );
+                 }
-            } else {
-                add_no_work_message("CUDA version 2.3 needed");
-                return false;
+            }
-}}}
-Check for the amount of video RAM:
-{{{
-        if (cp->prop.dtotalGlobalMem < PLAN_CUDA_MIN_RAM) {
-            if (config.debug_version_select) {
-                log_messages.printf(MSG_NORMAL,
-                    "[version] CUDA mem %d < %d\n",
-                    cp->prop.dtotalGlobalMem, PLAN_CUDA_MIN_RAM
-                );
+            }
-            sprintf(buf,
-                "Your NVIDIA GPU has insufficient memory (need %.0fMB)",
-                PLAN_CUDA_MIN_RAM
-            );
-            add_no_work_message(buf);
-            return false;
+        }
-}}}
-Estimate the FLOPS:
-{{{
-        hu.flops = cp->flops_estimate();
-}}}
-Estimate its CPU usage:
-{{{
-        // assume we'll need 0.5% as many CPU FLOPS as GPU FLOPS
-        // to keep the GPU fed.
-        //
-        double x = (hu.flops*0.005)/sreq.host.p_fpops;
-        hu.avg_ncpus = x;
-        hu.max_ncpus = x;
-}}}
-Indicate the number of GPUs used. Typically this will be 1.
-If your application uses only a fraction X<1 of the CPU processors,
-and a fraction Y<1 of video RAM, reports the number of GPUs as min(X, Y).
-In this case BOINC will attempt to run multiple jobs per GPU is possible.
-{{{
-        hu.ncudas = 1;
-}}}
-Return true to indicate that the application can be run on the host:
-{{{
-        return true;
-}}}