New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
126 CHAPTER 5. COMPUTER SCIENCE GRID STRATEGIES<br />
To distribute data <strong>the</strong> system creates tasks (see section 5.3.1) that are<br />
inserted into <strong>the</strong> system’s job queue with an assigned priority that is lower<br />
than analysis tasks. There<strong>for</strong>e, if a new analysis is queued it will be handled<br />
be<strong>for</strong>e <strong>the</strong> next copy task will be executed.<br />
Remarks<br />
To disable this feature set worker storage size to zero. This might be useful<br />
in local area networks where data is available through <strong>the</strong> network file system<br />
anyway and does not need to be copied.<br />
5.3.3 Job Execution<br />
In this section we describe how jobs are actually created and executed. This<br />
comprises starting <strong>of</strong> workers at target machines (that will do <strong>the</strong> actual computational<br />
work), creation and scheduling <strong>of</strong> tasks and <strong>the</strong> actual execution <strong>of</strong><br />
tasks, that is matching <strong>of</strong> tasks and workers. Fur<strong>the</strong>r, we give details about<br />
how workers get job specific data from <strong>the</strong> Grid.<br />
Worker Injections<br />
One <strong>of</strong> <strong>the</strong> most striking features <strong>of</strong> <strong>the</strong> QAD Grid approach is that <strong>the</strong>re is<br />
no need to (pre-)install client s<strong>of</strong>tware on <strong>the</strong> working machines. The only<br />
requirement is that <strong>the</strong> target machine provides SSH access. When a machine<br />
is added to <strong>the</strong> Grid <strong>the</strong> plat<strong>for</strong>m server logs into that machine, copies <strong>the</strong><br />
client s<strong>of</strong>tware and needed libraries to this target and executes it. The worker<br />
<strong>the</strong>n connects itself to <strong>the</strong> Grid and starts working (see section 5.4). We have<br />
tested this with different types <strong>of</strong> Linux, MacOS and Windows systems running<br />
a variety <strong>of</strong> SSH servers. If a target machine cannot provide SSH access<br />
<strong>the</strong> worker can <strong>of</strong> course be started manually on that machine. Ano<strong>the</strong>r possibility<br />
on Windows-based systems is <strong>the</strong> usage <strong>of</strong> W<strong>MI</strong> (Windows Management<br />
Instrumentation) which is also supported. This allows <strong>the</strong> QAD Grid Server<br />
to log into an windows machine and - as in <strong>the</strong> SSH case - remotely execute a<br />
program.<br />
The main advantage <strong>of</strong> this approach is that only <strong>the</strong> network (IP) address<br />
<strong>of</strong> a new client needs to be added to <strong>the</strong> QAD Grids client database and<br />
<strong>the</strong>re must exist a user account on that machine <strong>for</strong> <strong>the</strong> QAD Grid server to<br />
login. Then, <strong>the</strong> server can inject <strong>the</strong> client s<strong>of</strong>tware and start a worker on<br />
that machine without fur<strong>the</strong>r user interaction. These workers are <strong>the</strong>n fully<br />
controlled by <strong>the</strong> QAD plat<strong>for</strong>m server.<br />
Details <strong>of</strong> this injection process are as follows:<br />
1. The QAD Grid server establishes a SSH (Linux) or WMD (Windows)<br />
connection to <strong>the</strong> target machine.<br />
2. If no sub-directory “qad grid” exists within <strong>the</strong> clients temporary directory,<br />
it is created.<br />
3. The “qad grid/3rd party libraries” sub-directory is checked whe<strong>the</strong>r all<br />
libraries needed by this worker are available. If not, missing libraries are<br />
transferred via SCP.<br />
4. The “qad grid/workers” sub-directory is checked <strong>for</strong> <strong>the</strong> existence <strong>of</strong> <strong>the</strong><br />
worker to start. If it does not exists or <strong>the</strong> available version is outdated