![]() |
![]() |
||||
|
|
|
|
|
|
|
|
|
|||||
Using LSF to access the MINOS FNAL Batch Nodes |
The lsf batch system from Platform Computing is used to submit and monitor batch jobs. All the batch nodes have access to AFS and the full range of ups products that are installed on FNALU. You can access the MINOS Offline software that is installed on FNALU. Instructions on setting up the software are available. LSF carries a copy of your current environment to the submitted job. This can be problematic if you are relying on UPS products. In particular, it is recommended that your script contain the lines:
Use .csh if you are using tcsh as your shell. Replace
"snapshot" with a particular release name if
you want something other than the latest snapshot.
Note that the development release at FNAL rebuilds EVERY night at about 10pm
so if your batch job is running at that time it will probably crash.
To avoid this for long running jobs you should use a frozen or snapshot
release.
Some of the MINOS cluster nodes (minos14-minos25) have been configured
to allow running LSF batch jobs; these are accessible only from the
minos (1 day CPU) queue [2008-02: this is currently disabled].
In addition there are 30 Linux batch nodes available in the general
FNALU cluster (flxb01-flxb30). Nodes flxi06, flxb11, flxb35 are running
64-bit installations without compatibility libraries and should be avoided;
see the instructions below on how to do so.
The Linux nodes flxi02 and flxi03 are
for interactive use only and long running jobs will be killed on these
machines, but are acceptable machines for submitting LSF jobs.
QUEUE_NAME PRIO STATUS MAX JL/U JL/P JL/H NJOBS PEND RUN SUSP
30min 10 Open:Active - - 1 - 0 0 0 0
4hr 8 Open:Active - - 1 - 0 0 0 0
12hr 6 Open:Active - - 1 - 68 53 15 0
1day 4 Open:Active - - 1 - 73 39 34 0
minos 4 Open:Active - - 1 - 20 0 20 0
1day_ex 4 Open:Active - 4 1 - 0 0 0 0
4day 2 Open:Active - 5 1 - 0 0 0 0
8day 1 Open:Active - 2 1 - 0 0 0 0
To submit a batch job to one of the Linux nodes in the 4 hour queue
(excluding some nodes):
|
|
Send suggestions or comments to - The Pagemaster |