12.07.2015 Views

Benchmarking in the Cloud - Parallel and Distributed Systems

Benchmarking in the Cloud - Parallel and Distributed Systems

Benchmarking in the Cloud - Parallel and Distributed Systems

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

6. Devise a maximal scal<strong>in</strong>g factor maxAnScaleYYYY. The benchmark shouldnot scale <strong>the</strong> load fur<strong>the</strong>r than maxAnScaleYYYY.7. Also report <strong>the</strong> price of <strong>the</strong> services used so that services can be comparedif <strong>the</strong>y reach <strong>the</strong> maximal anScale.8. Periodically (for example once per year) revise <strong>and</strong> agree on a new (larger)maxAnScaleYYYY.9. Similarly to <strong>the</strong> Simple IaaS use case report elasticity by runn<strong>in</strong>g with avary<strong>in</strong>g load.In <strong>the</strong> next section we discusses <strong>the</strong> challenges for <strong>the</strong> hypo<strong>the</strong>tical benchmarkspresented above <strong>in</strong> more detail.6 The Art of Build<strong>in</strong>g <strong>and</strong> Runn<strong>in</strong>g a Good Benchmark<strong>in</strong> <strong>the</strong> <strong>Cloud</strong>We believe, that <strong>the</strong>re are four ma<strong>in</strong> steps for build<strong>in</strong>g <strong>and</strong> runn<strong>in</strong>g a goodbenchmark <strong>in</strong> <strong>the</strong> <strong>Cloud</strong>. These are Mean<strong>in</strong>gful Metric, Workload Design, WorkloadImplementation <strong>and</strong> Creat<strong>in</strong>g Trust. In this section we discuss <strong>the</strong> <strong>in</strong>herentchallenges <strong>in</strong> <strong>the</strong>se steps.6.1 Mean<strong>in</strong>gful MetricA metric is a function that transforms measured results <strong>in</strong>to a form that is easilyunderstood by <strong>the</strong> system analyst. The most simple metric is <strong>the</strong> runtime metric,which reports ei<strong>the</strong>r median, average, maximum or even m<strong>in</strong>imum runtimeamong transactions run by <strong>the</strong> SUT. When systems have to deal with concurrenttransactions, it makes more sense to consider a throughput metric report<strong>in</strong>g <strong>the</strong>maximal number of transactions adher<strong>in</strong>g to some runtime SLA. Some benchmarks(ma<strong>in</strong>ly those designed to support bus<strong>in</strong>ess decisions) also report <strong>the</strong> costof <strong>the</strong> underly<strong>in</strong>g system or <strong>the</strong> cost of runn<strong>in</strong>g a s<strong>in</strong>gle transaction at maximalload. There is an ongo<strong>in</strong>g debate if report<strong>in</strong>g cost makes sense [31]. Kossmannet al. have devoted a sequence of papers to this topic <strong>in</strong> its <strong>Cloud</strong> context [5, 2,32], argu<strong>in</strong>g that <strong>in</strong> this case cost should be <strong>the</strong> primary metric. The argumentis motivated by <strong>the</strong> fact that <strong>in</strong> <strong>the</strong>ory <strong>Cloud</strong> technology should provide <strong>in</strong>f<strong>in</strong>itescalability, which makes <strong>the</strong> classic throughput metric obsolete. However, <strong>in</strong> [32]<strong>the</strong> authors observe that (to no surprise) <strong>in</strong>f<strong>in</strong>ite scalability is an illusion <strong>and</strong>for most providers breaks down sooner or later due to bottlenecks <strong>in</strong> <strong>the</strong> SUTarchitecture. In that case it makes perfect sense to report how far <strong>the</strong> load canbe scaled up.Challenge 1: Price vs. Performance Metric. For <strong>the</strong> parallel load of <strong>the</strong>Simple IaaS use case we had decided to report <strong>the</strong> number of nodes required torun <strong>the</strong> 250 TPC-C <strong>in</strong>stances. As nodes of different vendors will most probablyhave different characteristics, <strong>the</strong> number of nodes is <strong>in</strong> fact not a suitable metric.Money is <strong>the</strong> only possible yardstick <strong>and</strong> <strong>the</strong>refore we have to report <strong>the</strong> price

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!