Context Navigation

Changes between Version 10 and Version 11 of RemoteJob

Timestamp:: Jan 11, 2011, 3:45:28 AM (15 years ago)
Author:: tonig
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

RemoteJob

-                      v10
+                      v11
 = Remote job submission =
+== Introduction and disclaimer ==
 A group from Universitat Pompeu Fabra
 …
 to indicate the user of the RBoinc client and their machine.
-The system (Perl-based) is in boinc/rboinc/.
 '''Warning: this system has been used only by its developers.
 It will take some work to get it working on other projects.'''
 Powerpoint slides describing the system are
+ * Powerpoint slides describing the system are
 [http://boinc.berkeley.edu/rboinc.pdf here].
+For details please see the paper T. Giorgino, M. J. Harvey and G. De Fabritiis, ''Distributed computing as a virtual supercomputer: Tools to run and manage large-scale BOINC simulations'', Comp. Phys. Commun. 181, 1402 (2010).  [[http://boinc.berkeley.edu/rboinc.pdf pdf]]
+ * For details please see the paper T. Giorgino, M. J. Harvey and G. De Fabritiis, ''Distributed computing as a virtual supercomputer: Tools to run and manage large-scale BOINC simulations'', Comp. Phys. Commun. 181, 1402 (2010).  [[http://boinc.berkeley.edu/rboinc.pdf pdf]]
+ * For client instructions see http://www.multiscalelab.org/utilities/RemoteBoinc
 == Summary ==
+(This section to be fixed.)
+The software should be fairly self-explanatory, but installation may be tricky. Here's a general overview
+RBoinc is composed by the following main components
+. Client scripts, which are used by the scientists to submit and retrieve jobs. They are boinc_retrieve and boinc_submit.
+. Server cgis, used to handle files and interfacing with the rest of Boinc. They are called boinc_retrieve_server, boinc_submit_server
+. Various (optional) monitoring scripts, which generate nightly reports, statistics, and the like.
+    * boinc_retrieve_server, boinc_submit_server run as cgi. The former, actually, also handles all administrative requests (stop, purge).
+    * boinc_retrieve, boinc_submit, are the client components (ditto as above for admin requests)
+    * Exchange of files between client and server is done through WEBDAV http extensions (a scratch area needs be setup for this)
+The software should be fairly self-explanatory, but installation may be tricky. The system is in boinc/rboinc/. Here's a general overview
+    * You will need an apache web server on the Boinc server (either the existing one, or a separate process). This instance will serve
+          * the RBOINC cgi-scripts, e.g. at http://YOURSERVER:8383/rboinc_cgi
+          * a scratch area for temporary file exchage, exposed via WEBDAV, e.g. at  http://YOURSERVER:8383/DAV
     * Wus naming is important and enforced like this: NNN-UUU_GGG-XX-YY-RNDzzzz where
           * NN is the name of the workunit (sub-group)
 …
           * YY is the total n. of steps
           * zzzz is a random number (not needed,actually)
+    * WUs are kept in a "workflow_directory",  a subdir of the project dir,  as per slide 22 of the Powerpoint.
+    * Inside each dir a "process" bash file is created, which is executed by the assimilator with the name of the assimilated WU as its argument. It will create_work the next step for execution.
+    * The main reason for using perl is that  I preferred to use the XML::Simple module for (un-) xml-ing data structures over the network - it was useful for adding features on the fly keeping backwards compatibility
+    * I implemented basic functions for authentication, but this is not finished yet
+    * file storage is optimized through hardlinking and pooling. Network transfers are not (but they could be)
+    * WUs are kept in a "workflow_directory",  a subdir of the project dir,  as per slide 22 of the Powerpoint. Inside each dir a "process" bash file is created, which is executed by the assimilator with the name of the assimilated WU as its argument. It will create_work the next step for execution.
+    * File storage is optimized through hardlinking and pooling. (Network transfers are not yet)
+    * Warning: authentication is not done yet (do secure the RBoinc port by firewall rules)
+== Annotating the WU template files ==
+Both client and server are composed of Perl scripts (respectively command-line and cgi-bin). The main reason for using the Perl language is that I liked the XML::Simple module for (un-) xml-ing data structures over the network - which helped rapid development.
+== Client-side instructions ==
+Instructions on using the client scripts are temporarily hosted at http://www.multiscalelab.org/utilities/RemoteBoinc .
+Client Perl scripts need be unpacked to some client-visible installation directory. Make sure your Perl installation fulfulls the dependencies (use ''cpan'' or your distribution's package manager if not).
+For details on the chaining mechanism, please see the paper T. Giorgino, M. J. Harvey and G. De Fabritiis, ''Distributed computing as a virtual supercomputer: Tools to run and manage large-scale BOINC simulations'', Comp. Phys. Commun. 181, 1402 (2010).  [[http://boinc.berkeley.edu/rboinc.pdf pdf]].
+== Server-side instructions ==
+The main steps to install the RBoinc server components are:
+ * Setup or adapt an  instance of the apache web server on the boinc server (or change the boinc one) to serve the rboinc cgi and DAV paths. See the ''apache.conf'' example file provided with the distribution. We shall assume that apache will serve at http://YOUR_SERVER:8383/rboinc_cgi
+ * Copy the rboinc ''server'' scripts in the cgi directory, and edit the configuration file to suit your site setup.
+ * You may want to revise the ''process'' script. It is invoked every time a WU is complete, to perform the submission of the next chain step.
+ * Customize the WU and result template files, as directed below. This will RBoinc-enable Boinc ''applications'' of your choice.
+ * If desired, install the SQL stored procedures (monitoring components).
+=== Annotating the WU template files ===
 First, workunit template files should be marked as RBoinc-enabled at the top.
 …
 == Annotating the result template files ==
+=== Annotating the result template files ===
 Results template files are annotated with RBoinc-specific tags
 …
 retrieve directory.
-For details on the chaining mechanism, please see the paper T. Giorgino, M. J. Harvey and G. De Fabritiis, ''Distributed computing as a virtual supercomputer: Tools to run and manage large-scale BOINC simulations'', Comp. Phys. Commun. 181, 1402 (2010).  [[http://boinc.berkeley.edu/rboinc.pdf pdf]]