Context Navigation

Changes between Version 1 and Version 2 of JobSizeMatching

-                      v1
+                      v2
   * If we satisfy the request for a particular resource and the best app version
     uses that resource, we clear the entry.
+New approach:
+Do it one resource at a time (GPUs first).
+For each resource:
+ * For each app, find the best app version and the best reliable app version
+ * For each of these app versions, find the expected speed
+   (taking on-fraction etc. into account).
+   Based on this, and the statistics of the host population,
+   decide what size job to send for this resource.
+ * Scan the job array, starting at a random point.
+   Make a list of jobs for which an app version is available,
+   and that are of the right size.
+ * Sort this list by a "score" that combines the above criteria
+   (reliable, beta, previously infeasibly, locality scheduling lite).
+ * Scan the list; for each job
+  * Make sure it's still in the array
+  * Do quick checks
+  * Lock entry and do slow checks
+  * Send job
+  * Leave loop if resource request is satisfied or we're out of disk space