28.07.2013 Views

Performance Analysis and Optimization of the Hurricane File System ...

Performance Analysis and Optimization of the Hurricane File System ...

Performance Analysis and Optimization of the Hurricane File System ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 7<br />

Macrobenchmark<br />

7.1 Purpose<br />

The purpose <strong>of</strong> <strong>the</strong> macrobenchmark experiment was to determine <strong>the</strong> impact <strong>of</strong> a realistic workload on<br />

<strong>the</strong> file system <strong>and</strong> to verify that <strong>the</strong> optimizations implemented during <strong>the</strong> microbenchmark experiments<br />

were beneficial to more realistic applications. A Web server workload was chosen but an actual Web server<br />

application, such as Apache Web Server or Micros<strong>of</strong>t Internet Information Server, was not used because <strong>the</strong><br />

K42 operating system lacked infrastructure to support more complex <strong>of</strong>f-<strong>the</strong>-shelf applications. Instead, a<br />

simple simulated Web server was executed to generate <strong>the</strong> file system workload, using traces from a real<br />

workload.<br />

7.2 Experimental Setup<br />

7.2.1 Workload<br />

The World Cup 1998 Web trace [5] was chosen as <strong>the</strong> workload. This Web trace has been studied by<br />

Arlitt <strong>and</strong> Jin [6], resulting in detailed quantitative analysis <strong>of</strong> <strong>the</strong> trace. The Web trace is a time-sorted<br />

aggregation <strong>of</strong> requests to all World Cup Web servers during <strong>the</strong> 1998 tournament. It includes both static<br />

<strong>and</strong> dynamic content requests. The complete Web trace consisted <strong>of</strong> 90 days <strong>of</strong> log files, however only a<br />

small portion <strong>of</strong> <strong>the</strong> Web trace was used for our experiments, namely from June 8, 1998, 10:00:01pm GMT<br />

to 10:21:02pm GMT. This date corresponds to day 45 <strong>of</strong> <strong>the</strong> Web trace <strong>and</strong> was chosen because it was <strong>the</strong><br />

middle point <strong>of</strong> <strong>the</strong> tournament. The more intense final games during <strong>the</strong> final days <strong>of</strong> <strong>the</strong> tournament were<br />

not used since <strong>the</strong>y targeted a smaller set <strong>of</strong> Web pages <strong>and</strong> files. They would not have <strong>of</strong>fered as much<br />

breadth as traces from <strong>the</strong> middle.<br />

The chosen Web log segment was reverse engineered to recreate <strong>the</strong> contents on disk. <strong>File</strong>s <strong>of</strong> correct<br />

66

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!