| 95 | | {{{ |
| | 95 | |
| | 96 | {{{ |
| | 97 | <gpu_multiplier> GM </gpu_multiplier> |
| | 98 | }}} |
| | 99 | If your project uses GPUs, set this to roughly the ratio |
| | 100 | of GPU speed to CPU speed. |
| | 101 | Used in the calculation of job limits (see next 2 items). |
| | 102 | {{{ |
| | 103 | <max_wus_to_send> N </max_wus_to_send> |
| | 104 | }}} |
| | 105 | Maximum jobs returned per scheduler RPC is '''N*(NCPUS + GM*NGPUS)'''. |
| | 106 | You can use this to limit the impact of faulty hosts. |
| | 107 | Default is 10. |
| | 108 | {{{ |
| | 109 | <max_ncpus>N</max_ncpus> |
| | 110 | }}} |
| | 111 | An upper bound on NCPUS (default: 16) |
| | 112 | |
| | 113 | {{{ |
| | 114 | <daily_result_quota> N </daily_result_quota> |
| | 115 | }}} |
| | 116 | Each host has a field MRD in the interval [1 .. daily_result_quota]; |
| | 117 | it's initially daily_result_quota, |
| | 118 | and is adjusted as the host sends good or bad results. |
| | 119 | The maximum number of jobs sent to a given host in a 24-hour period is |
| | 120 | '''MRD*(NCPUS + GM*NGPUS)'''. |
| | 121 | You can use this to limit the impact of faulty hosts. |
| | 122 | |
| | 123 | == Job limits (advanced) === |
| | 124 | The following is a |
| | 125 | more adaptable way of expressing limits on the number of jobs in progress on a host. |
| | 126 | You can specify limits for specific apps, and for your projects as a whole. |
| | 127 | Within each of these, you can specify limits for CPU jobs, GPU jobs, or total. |
| | 128 | In the case of CPU and GPU jobs, you can specify whether the limit should be |
| | 129 | scaled by the number of devices present on the host. |
| | 130 | |
| | 131 | This uses a separate config file, '''config_aux.xml'''. |
| | 132 | The syntax is: |
| | 133 | {{{ |
| | 134 | <?xml version="1.0" ?> |
| | 135 | <config> |
| 116 | | }}} |
| 117 | | A more adaptable way of expressing limits on the number of jobs in progress on a host. |
| 118 | | You can specify limits for specific apps, and for your projects as a whole. |
| 119 | | Within each of these, you can specify limits for CPU jobs, GPU jobs, or total. |
| 120 | | In the case of CPU and GPU jobs, you can specify whether the limit should be |
| 121 | | scaled by the number of devices present on the host. |
| 122 | | |
| 123 | | {{{ |
| 124 | | <gpu_multiplier> GM </gpu_multiplier> |
| 125 | | }}} |
| 126 | | If your project uses GPUs, set this to roughly the ratio |
| 127 | | of GPU speed to CPU speed. |
| 128 | | Used in the calculation of job limits (see next 2 items). |
| 129 | | {{{ |
| 130 | | <max_wus_to_send> N </max_wus_to_send> |
| 131 | | }}} |
| 132 | | Maximum jobs returned per scheduler RPC is '''N*(NCPUS + GM*NGPUS)'''. |
| 133 | | You can use this to limit the impact of faulty hosts. |
| 134 | | Default is 10. |
| 135 | | {{{ |
| 136 | | <max_ncpus>N</max_ncpus> |
| 137 | | }}} |
| 138 | | An upper bound on NCPUS (default: 16) |
| 139 | | |
| 140 | | {{{ |
| 141 | | <daily_result_quota> N </daily_result_quota> |
| 142 | | }}} |
| 143 | | Each host has a field MRD in the interval [1 .. daily_result_quota]; |
| 144 | | it's initially daily_result_quota, |
| 145 | | and is adjusted as the host sends good or bad results. |
| 146 | | The maximum number of jobs sent to a given host in a 24-hour period is |
| 147 | | '''MRD*(NCPUS + GM*NGPUS)'''. |
| 148 | | You can use this to limit the impact of faulty hosts. |
| | 156 | </config> |
| | 157 | }}} |