05.08.2014 Views

An Investigation into Transport Protocols and Data Transport ...

An Investigation into Transport Protocols and Data Transport ...

An Investigation into Transport Protocols and Data Transport ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2.2. <strong>An</strong>alysis Methods 31<br />

the data in order to increase collider efficiency.<br />

2.2 <strong>An</strong>alysis Methods<br />

The search for new exotic particles such as the Higgs boson becomes a statistical<br />

study of the probabilities that a particular event occurs. As such,<br />

large volumes of data must be generated in order to determine with high<br />

confidence that these rare events do actually occur.<br />

Also, Monte Carlo simulations must be produced in order to reduce statistical<br />

errors to an acceptable level. Generally, the size of such datasets are<br />

equivalent to that of the filtered data from the detector.<br />

At present, a typical experiment will store all raw detector data centrally<br />

in tape archives, from which the data will be staged to disk when required.<br />

When an institute wishes to run analysis on a particular dataset, the data<br />

is typically copied (replicated) from the central facility to that of the local<br />

farm at the institute. Collaborating institutes on the experiment operate<br />

more or less independently, in as much as should another institute also want<br />

to analyse the same dataset, they will typically also replicate it to their local<br />

facilities - unaware that an exact replica may be physically closer than that<br />

at the detector.<br />

Access patterns for datasets vary. Experimental data files typically have a<br />

single creator from which the initial production period will last several weeks<br />

<strong>and</strong> will be modified as new information is added. However, metadata is also<br />

created, which describes the information about the experiment. This may<br />

be created by multiple individuals <strong>and</strong> may be modified <strong>and</strong> or augmented<br />

over time, even after the creation of the experimental data. The size of meta<br />

data is typically smaller than that of the experimental data.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!