4th International Conference on Principles and Practices ... - MADOC

More documents

Recommendations

Info

1. Identifying the primary factors that cause the poor throughput degradation. 2. Investigating the effects of these factors on throughput. 3. Observing how changes of algorithms and policies in these factors affect the throughput degradation. The remainder of this paper is organized as follows. Section 2 details our experiments to identify opportunities for improvement. Section 3 details our experimentation plan. Sections 4 and 5 report the results and discuss possible improvements. Section 6 discusses background information and highlights related work. Section 7 concludes the paper. 2. MOTIVATION In 2001, Welsh et al. [23] reported three important trends that magnify the challenges facing Web-based applications. First, services are becoming more complex with widespread adoption of dynamic contents in place of static contents. Second, the service logics “tend to change rapidly”. Thus, the complexity of development and deployment increases. Third, these services are deployed on general-purpose systems and not “carefully engineered systems for a particular service.” Such trends are now a common practice. Complex services including entire suites of business applications are now deployed using Web application servers running commodity processors and open-source software. With this in mind, we conduct an experiment to observe the degradation behavior of Java application servers on an experimental platform similar to the current common practice (i.e. using Linux on X86 system with MySQL database and JBoss application server). For detailed information about the experimental setup, refer to section 3.2. Initially, our experiments were conducted using the smallest amount of workload allowed by SPECjAppServer2004, a standardized benchmark to measure the performance of Java application servers. We set the maximum heap size to be twice as large as the physical memory—4 GB heap with 2 GB of physical memory in this case. We monitored the throughput delivered by the system. We then gradually increased the workload until the system refuses to service any requests. For comparison, we also conducted another experiment to observe the degradation behavior of the Apache Web server (we used the same computer system and SPECweb2005 to create requested traffic). Since the two benchmarks report different throughput metrics — jobs per second for jAppServer2004 vs connections per second for Web2005 — we normalized the throughput and the workload to percentage. That is we considered the maximum throughput delivered by a system during an execution as 100% (referred to as t) and the maximum workload i.e. the workload that the systems completely refuse connection as 100% (referred to as w). The degradation rate (referred to as d) is d = ∆t . The result of our ∆w comparison is shown in Figure 1. The result shows that JBoss is able to deliver high throughput for about 60% of the given workload. However, when the workload surpasses 60%, the throughput reduces drastically. This system begins to refuse connection at 80% of the maximum workload. A drastic degradation in throughput (nearly 75%) occurs when the workload increases by only 20%. Thus, the degradation rate, d, is 0.75 0.20 = 3.40. Also notice that the value of d for the Apache is 1.69 (see Figure 1). A smaller value of d means that the application is more failure-resistant to increasing workload. We also investigated the effect of larger memory on the throughput. Again, larger memory improves the maximum throughput (see Figure 2 but has very little effect on the degradation behavior. According to [12], the degradation behavior experienced in our experiments is considered ungraceful because such behavior can lead to non-robust systems. Moreover, it gives very little time to administer recovery procedures. The authors investigated the factors that affect throughput degradation behavior of Java Servlets by examining the operating system behaviors. They found that thread synchronization is the most prominent factor at the OS level. We would like to point out that their work did not study the factors within the Java virtual machine. On the contrary, our investigation concentrated specifically at the Java Virtual Machine level. Since Java Virtual Machines (JVMs) provide the execution environment for these application servers, we conjectured that the major factors that cause the throughput to degrade ungracefully reside in the Virtual Machines. 3. EXPERIMENTS In this study, our main objectives are as follows: Research Objective 1 (RO1): Identifying the major factors responsible for the rapidly declining throughput of Java application servers triggered by small workload increase. Research Objective 2 (RO2): Investigating how these factors affect the throughput of Java server applications. Research Objective 3 (RO3): Observing how the changes in algorithms and policies controlling these factors affect the throughput of Java application servers. To achieve this objective, we manipulate the algorithms and policies governing the behaviors of these factors. 3.1 Benchmarks There are two major components in our experimental objects, the application servers and the workload drivers. The selected application servers must meet the following criteria. First, they must be representative of real-world/widely used application servers. Second, we must have accessibility to the source code to control and manipulate their execution context. Our effort began with the identification of server applications that fit the two criteria. We have investigated several possibilities and selected two open-source applications described below. JBoss [13] is by far the most popular open-source Java application server (34% of market share and over five million downloads to date) 6 . It fully supports J2EE 1.4 with advanced optimization including object cache to reduce the overhead of object creation. Java Open Application Server (JOnAS) 7 is another open-source application server. It is built as part of the ObjectWeb initiative. Its collaborators include the France Telecom, INRIA, and Bull (a software development company). In addition to identifying the applications, we also need to identify workload drivers that create a realistic client/server environment. We choose an application server benchmark from SPEC, jAppServer2004 [19], which is the standard benchmark for testing the performance of Java application servers. It emulates an automobile manufacturing company and its associated dealerships. Dealers interact with the system using web browsers (simulated by a driver program) while the actual manufacturing process is accomplished via RMI (also driven by the driver). This workload stresses the ability of Web and EJB containers to handle the complexities of memory management, connection pooling, passivation/activation, caching, etc. 6 from http://www.gridtoday.com/04/0927/103890.html 7 JOnAS: Java Open Application Server available from http://jonas.objectweb.org 41
Figure 1: Apache. Throughput degradation behaviors of JBoss and Figure 2: Throughput comparison with respect to heap sizes. Workload of the benchmark is measured by transaction rate, which specifies the number of Dealer and Manufacturing threads. Throughput of the benchmark is measured by JOPS (job operations per second). The SPECjAppServer2004 Design Document [19] includes a complete description of the workload and the application environment in which it is executed. 3.2 Experimental Platforms To deploy SPECjAppServer2004, we used four machines to construct the three-tier architecture. Since our experiments utilized both the Uniprocessor system and the Multiprocessor system, our configuration can be described as follows. Uniprocessor application server (System A): The client machine is a dual-processor Apple PowerMac with 2x2GHz PowerPC G5 processors and 2 GB of memory. The server is a single-processor 1.6 GHz Athlon with 1GB of memory. The MySQL 8 database server is a Sun Blade with dual 2GHz AMD Opteron processors as the client machine running Fedora Core 2 and 2 GB of memory. Multiprocessor application server (System B): The client machine is the same as the system above. However, we swapped the application server machine and the database server machine. Thus, the dual-processor Sun Blade is used as the application server, and the single-processor Athlon is used as the database server. In all experiments, we used Suns J2SE 1.5.0 on the server side, and the young generation area is set to the default value, which is 1/9 of the entire heap and has shown to minimize the number of the expensive mature collections. We ran all experiments in standalone mode with all non-essential daemons and services shut down. The virtual machine is instrumented to generate trace information pertaining to the runtime behavior such as object allocation information, reference assignment, execution thread information, and garbage collection (GC) information. It is not uncommon that such trace files be as large as several gigabytes. These trace files are then used as inputs to our analysis tool that performs lifetime analysis similar to the Merlin algorithm proposed by Hertz et al. [10]. The major difference between our approach and theirs is that ours uses off-line analysis and theirs uses on-line analysis. To obtain micro-architecture information, we utilize model specific performance monitoring registers. 8 MySQL available from http://www.mysql.com 3.3 Variables and Measures We utilized several workload configurations to vary the level of stress on the selected applications. In all experiments, we increased the workload from the minimum value available to the maximum value that still allow the application to operate. For example, we began our experiment by setting the workload value of SPECjAppServer2004 to 1. In each subsequent experiment, we increased the workload value until JBoss encounters failure. The failure point is considered to be the maximum workload that the system (combination of application server, JVM, OS, etc.) can handle. As shown in section 2, the throughput dramatically degrades as the workload increases. This degradation is likely caused by the runtime overhead. To address our RO1, we monitor the overall execution time (T ), which is defined as: T = T app + T gc + T jit + T sync It is worth noticing that T app is the time spent executing the application itself. T gc is the time spent on garbage collection. T jit is the time spent on runtime compilation. Many modern virtual machines use Just-In-Time (JIT) compilers to translate byte-code into native instructions when a method is first executed. This time does not include the execution of compiled methods; instead, it is the time spent on the actual methods compilation and code cache management. Finally, T sync is the time spent on synchronization. We monitored synchronization operations such as lock/unlock, notify/wait, the number of threads yield due to lock contentions. We chose these time components because they have historically been used to measure the performance of Java Virtual Machines [2]. By measuring the execution of each run-time function, we can identify the function that is most sensitive to the increasing workload. The result of this research objective is used as the focal point in RO2. To address RO2, we further investigated the runtime behaviors of these factors. Once again, we varied the workload but this time, we also measured other performance parameters such as the number of page faults in addition to the throughput. These parameters give us more insight into the effect of these factors on the throughput. Specifically, we closely examined the governing policies of these runtime factors (causes) to gain more understanding of the effects they have on the throughput. To address RO3, we conducted experiments that adjust both the fundamental algorithms and the policies used by the runtime factors and observed their ef- 42
Page 1 and 2: Edited by: Ralf Gitzel, Markus Alek
Page 3 and 4: Message from the Chairs Dear confer
Page 5 and 6: Sam Midkiff, Purdue University (USA
Page 7 and 8: Session F: Novel Uses of Java______
Page 10 and 11: The Project Maxwell Assembler Syste
Page 12 and 13: tomated assembler and disassembler
Page 14 and 15: memory address offset mnemonic argu
Page 16 and 17: 5.2 Instruction Templates The assem
Page 18 and 19: ange for a very short restart/debug
Page 20 and 21: Tatoo: an innovative parser generat
Page 22 and 23: In the grammar file an error termin
Page 24 and 25: Figure 3: Push parsers and lexers L
Page 26 and 27: eturn visit((Expr)p1, param); } R v
Page 28: Session B Program and Performance A
Page 31 and 32: Table 1: Use of interfaces and deco
Page 33 and 34: Table 3: Comparison of the situatio
Page 35 and 36: will trigger the creation of many n
Page 37 and 38: Appendix A 10 9 8 7 6 5 4 3 2 1 0 1
Page 39 and 40: Throughout this paper, we make no a
Page 41 and 42: instrumented program and obtained t
Page 43 and 44: Figure 5: Investigation of garbage
Page 45 and 46: to the same category—again, this
Page 47: Investigating Throughput Degradatio
Page 51 and 52: Minor GC Full GC Workload # of Avg.
Page 53 and 54: Figure 6: Comparing memory and pagi
Page 55 and 56: GenRC on the other hand, allows the
Page 58: Session C Mobile and Distributed Sy
Page 61 and 62: ing a continuous buffer for raw dat
Page 63 and 64: the data arrival speed is more impo
Page 65 and 66: public class StreamingClient { publ
Page 67 and 68: importance of β in our aggregation
Page 69 and 70: Enabling Java Mobile Computing on t
Page 71 and 72: A strongly mobile thread has the ab
Page 73 and 74: a context switch occur. The invoked
Page 75 and 76: above phases. Now, the new stack ha
Page 77 and 78: 5 frames 15 frames 25 frames Pure d
Page 79 and 80: Juxta-Cat: A JXTA-based platform fo
Page 81 and 82: Figure 1: JXTA Architecture. 3. JUX
Page 83 and 84: • Send a discovery query to the p
Page 85 and 86: The best candidate is then the one
Page 87 and 88: Local Hosts=2 Hosts=4 Hosts=8 Hosts
Page 90: Session D Resource and Object Manag
Page 93 and 94: Enterprise Edition (J2EE). It enabl
Page 95 and 96: As pictured in Figure 4, JDOSecure
Page 97 and 98: Methods of a PersistenceManager, th
Page 99 and 100:
Datenmodell abstrahiert user role,
Page 101 and 102:
An Extensible Mechanism for Long-Te
Page 103 and 104:
missing object references in the de
Page 105 and 106:
4.2 Mapping the name spaces of the
Page 107 and 108:
(a) (b) ColorSliderPanel0 color
Page 109 and 110:
8. REFERENCES [1] Abu-Ghazaleh, N.,
Page 111 and 112:
permission by process. In other wor
Page 113 and 114:
The processor then refers to the en
Page 115 and 116:
1: // Protection domain 2: struct p
Page 117 and 118:
Table 3: Frequency of JNI calls tha
Page 119 and 120:
[13] Intel Corporation. IA-32 Intel
Page 121 and 122:
01 try { 02 ... 03 } finally { 04 t
Page 123 and 124:
2. FRAMEWORK The Framework for Unif
Page 125 and 126:
ManagedInputStream ResourceGroup Ma
Page 127 and 128:
18. throw ‡ 1. release 17. cleanu
Page 129 and 130:
3. FUTURE WORK The ability to split
Page 131 and 132:
124
Page 133 and 134:
UML sequence diagrams are among the
Page 135 and 136:
diagrams, and behavioral (dynamic)
Page 137 and 138:
the official scripting language for
Page 139 and 140:
Apache's XML Graphics Project. A cu
Page 141 and 142:
Java Debug Interface (JDI). In Revi
Page 143 and 144:
1.3 JML 1.3.1 Overview JML, the Jav
Page 145 and 146:
The Usage of CANAPA is fairly strai
Page 147 and 148:
*@non_null@*/ String str = attribut
Page 149 and 150:
142
Page 151 and 152:
parallel and distributed versions o
Page 153 and 154:
RingElem CextendsRingElem GenPolyno
Page 155 and 156:
RingElem +isZERO():bolean CextendsR
Page 157 and 158:
class constructors with the actual
Page 159 and 160:
plemented via a distributed hash ta
Page 161 and 162:
which are common nowadays. The reus
Page 163 and 164:
Derivative Contract Component Repos
Page 165 and 166:
stepwise execution create Derivativ
Page 167 and 168:
5.4.1 Structural Constraints Struct
Page 169 and 170:
into the conceptual foundation of n
Page 171 and 172:
different implementation and is mor
Page 173 and 174:
the service from the root. It allow
Page 175 and 176:
model: 1. Servlet [6] adapter compo
Page 177 and 178:
y possibly different service, where
Page 179 and 180:
or the fact that widgets extend ser
Page 181 and 182:
174
Page 183 and 184:
The idea of Java 5.0 type inference
Page 185 and 186:
Source := (class | interface)∗ cl
Page 187 and 188:
5. IMPLEMENTATION In order to prese
Page 189 and 190:
Infinite Streams in Java Dominik Gr
Page 191 and 192:
} private static LinkedStream fibon
Page 193 and 194:
of the f stream in order to compute
Page 195 and 196:
Interaction among Objects via Roles
Page 197 and 198:
(a) (b) (c) (d) (e) Figure 1: The p
Page 199 and 200:
class Printer { private int totalPr
Page 201 and 202:
Experiences of using the Dagstuhl M
Page 203 and 204:
Metric Definition Java .class files
Page 205 and 206:
scription of the metric values. Non
Page 207 and 208:
Figure 1: Results from the J-vestig
Page 209 and 210:
an error is subsequently generated
Page 211 and 212:
3. BASIC TERMINOLOGY As object-orie
Page 213 and 214:
• subtyping • (implementation)
Page 215 and 216:
Improving the Quality of Programmin
Page 217 and 218:
Results of online checks (shortened
Page 219 and 220:
212
Page 221 and 222:
214
Page 223 and 224:
Experiences With Hierarchy-Based Co
Page 225 and 226:
Sub View Child DataField 0..n Ke
Page 227 and 228:
Figure 8 - Actual State Charts elem
Page 229 and 230:
associations, which may result in r
Page 231 and 232:
M3PS: A Multi-Platform P2P System B
Page 233 and 234:
In following, we will explain in de
Page 235 and 236:
Figure 10. JDP in Windows XP. Figur
Page 237 and 238:
Mapping Clouds of SOA- and Business
Page 239 and 240:
the technical SOA events (from the
Page 241 and 242:
component and the business process
Page 243 and 244:
Figure 13. “Business Value of Dat
Page 245 and 246:
Solaris of SUN. This platform provi
Page 247 and 248:
status change of an application is
Page 249 and 250:
coming up, is presently discussing
Page 251:
ISBN: 3-939352-05-5 ISBN: 978-3-939
show all

4th International Conference on Principles and Practices ... - MADOC

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?