08.02.2013 Views

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

New Statistical Algorithms for the Analysis of Mass - FU Berlin, FB MI ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

126 CHAPTER 5. COMPUTER SCIENCE GRID STRATEGIES<br />

To distribute data <strong>the</strong> system creates tasks (see section 5.3.1) that are<br />

inserted into <strong>the</strong> system’s job queue with an assigned priority that is lower<br />

than analysis tasks. There<strong>for</strong>e, if a new analysis is queued it will be handled<br />

be<strong>for</strong>e <strong>the</strong> next copy task will be executed.<br />

Remarks<br />

To disable this feature set worker storage size to zero. This might be useful<br />

in local area networks where data is available through <strong>the</strong> network file system<br />

anyway and does not need to be copied.<br />

5.3.3 Job Execution<br />

In this section we describe how jobs are actually created and executed. This<br />

comprises starting <strong>of</strong> workers at target machines (that will do <strong>the</strong> actual computational<br />

work), creation and scheduling <strong>of</strong> tasks and <strong>the</strong> actual execution <strong>of</strong><br />

tasks, that is matching <strong>of</strong> tasks and workers. Fur<strong>the</strong>r, we give details about<br />

how workers get job specific data from <strong>the</strong> Grid.<br />

Worker Injections<br />

One <strong>of</strong> <strong>the</strong> most striking features <strong>of</strong> <strong>the</strong> QAD Grid approach is that <strong>the</strong>re is<br />

no need to (pre-)install client s<strong>of</strong>tware on <strong>the</strong> working machines. The only<br />

requirement is that <strong>the</strong> target machine provides SSH access. When a machine<br />

is added to <strong>the</strong> Grid <strong>the</strong> plat<strong>for</strong>m server logs into that machine, copies <strong>the</strong><br />

client s<strong>of</strong>tware and needed libraries to this target and executes it. The worker<br />

<strong>the</strong>n connects itself to <strong>the</strong> Grid and starts working (see section 5.4). We have<br />

tested this with different types <strong>of</strong> Linux, MacOS and Windows systems running<br />

a variety <strong>of</strong> SSH servers. If a target machine cannot provide SSH access<br />

<strong>the</strong> worker can <strong>of</strong> course be started manually on that machine. Ano<strong>the</strong>r possibility<br />

on Windows-based systems is <strong>the</strong> usage <strong>of</strong> W<strong>MI</strong> (Windows Management<br />

Instrumentation) which is also supported. This allows <strong>the</strong> QAD Grid Server<br />

to log into an windows machine and - as in <strong>the</strong> SSH case - remotely execute a<br />

program.<br />

The main advantage <strong>of</strong> this approach is that only <strong>the</strong> network (IP) address<br />

<strong>of</strong> a new client needs to be added to <strong>the</strong> QAD Grids client database and<br />

<strong>the</strong>re must exist a user account on that machine <strong>for</strong> <strong>the</strong> QAD Grid server to<br />

login. Then, <strong>the</strong> server can inject <strong>the</strong> client s<strong>of</strong>tware and start a worker on<br />

that machine without fur<strong>the</strong>r user interaction. These workers are <strong>the</strong>n fully<br />

controlled by <strong>the</strong> QAD plat<strong>for</strong>m server.<br />

Details <strong>of</strong> this injection process are as follows:<br />

1. The QAD Grid server establishes a SSH (Linux) or WMD (Windows)<br />

connection to <strong>the</strong> target machine.<br />

2. If no sub-directory “qad grid” exists within <strong>the</strong> clients temporary directory,<br />

it is created.<br />

3. The “qad grid/3rd party libraries” sub-directory is checked whe<strong>the</strong>r all<br />

libraries needed by this worker are available. If not, missing libraries are<br />

transferred via SCP.<br />

4. The “qad grid/workers” sub-directory is checked <strong>for</strong> <strong>the</strong> existence <strong>of</strong> <strong>the</strong><br />

worker to start. If it does not exists or <strong>the</strong> available version is outdated

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!