4th International Conference on Principles and Practices ... - MADOC

More documents

Recommendations

Info

special concept field, which would allow us to take advantage of the class inheritance mechanism. Each method can have a special concept parameter. However this system is thoroughly intrusive. Consider inserting concept information after the Java source code has been written. The concept information will cause wide-ranging changes to the source code, even affecting the actual API! This is an unacceptably invasive transformation. Now consider using such custom metadata at runtime. Again, the metadata will only be useful on a specially instrumented JVM that can dump appropriate concept information as it encounters the metadata. 2.4 Custom Comments A key disadvantage of the above approaches is that concepts can only be embedded at certain points in the program, for specific granularities (classes and methods). In contrast, comments can occur at arbitrary program points. It would be possible to insert concept information in special comments, that could be recognised by some kind of preprocessor and transformed into something more useful. The Javadoc system supports custom tags in comments. This would allow us to insert concept information at arbitrary program points! Then we use a Javadoc style preprocessor (properly called a doclet system in Java) to perform sourceto-source transformation. We eventually adopted this method for supporting concepts in our Java source code, due to its simplicity of concept creation, markup and compilation. The custom comments can be transformed to suitable statements that will be executed at runtime as the flow of execution crosses the marked concept boundaries. Such a statement would need to record the name of the concept, the boundary type (entry or exit) and some form of timestamp. In our first system (see Section 3) the custom comments are replaced by simple println statements and timestamps are computed using the System.nanoTime() Java 1.5 API routine, thus there is no need for a specially instrumented JVM. In our second system (see Section 4) the custom comments are replaced by Jikes RVM specific logging statements, which more efficient than println statements, but entirely nonportable. Timestamps are computed using the IA32 TSC register, via a new ‘magic’ method. Again this should be more efficient than using the System.nanoTime() routine. In order to change the runtime behaviour at concept boundaries, all that is required is to change the few lines in the concept doclet that specify the code to be executed at the boundaries. One could imagine that more complicated code is possible, such as data transfer via a network socket in a distributed system. However note the following efficiency concern: One aim of this logging is that it should be unobtrusive. The execution overhead of concept logging should be no more than noise, otherwise any profiling will be inaccurate! In the studies described in this paper, the mean execution time overhead for running concept-annotated code is 35% for the small Java program (Section 3) but only 2% for the large Java program (Section 4). // @concept_begin Concept1 public class Test { public void f() { .... } ... while (...) // @concept_end Concept1 // @concept_begin Concept2 } // @concept_end Concept2 Figure 2: Comments-based concepts in example Java source code Figure 2 shows some example Java source code with concepts marked up as custom comments. There are certainly other approaches for supporting concepts, but the four presented above seemed the most intuitive and the final one seemed the most effective. 3. DYNAMIC ANALYSIS FOR SMALL JAVA PROGRAM The first case study involves a small Java program called BasicPredictors which is around 500 lines in total. This program analyses textual streams of values and computes how well these values could be predicted using standard hardware prediction mechanisms. It also computes information theoretic quantities such as the entropy of the stream. The program was used to generate the results for an earlier study on method return value predictability for Java programs [23]. 3.1 Concept Assignment The BasicPredictors code is an interesting subject for concept assignment since it calculates values for different purposes in the same control flow structures (for instance, it is possible to re-use information for prediction mechanisms to compute entropy). We have identified four concepts in the source code, shown below. system: the default concept. Prints output to stdout, reads in input file. Reads arguments. allocates memory. predictor compute: performs accuracy calculation for several computational value prediction mechanisms. predictor context: performs accuracy calculation for contextbased value prediction mechanism (table lookup). entropy: performs calculation to determine information theoretic entropy of entire stream of values. The concepts are marked up manually using custom Javadoc tags, as described in Section 2.4. This code is transformed using the custom doclet, so the comments have been replaced by println statements that dump out concept information at execution time. After we have executed the 33
instrumented program and obtained the dynamic execution trace which includes concept information, we are now in a position to perform some dynamic analysis. 3.2 Dynamic Analysis for Concept Proportions The first analysis simply processes the dynamic concept trace and calculates the overall amount of time spent in each concept. (At this stage we do not permit nesting of concepts, so code can only belong to a single concept at any point in execution time.) This analysis is similar to standard function profiling, except that it is now based on specificationlevel features of programs, rather than low-level syntactic features such as function calls. The tool outputs its data in a format suitable for use with the Kcachegrind profiling and visualization toolkit [1]. Figure 3 shows a screenshot of the Kcachegrind system, with data from the BasicPredictors program. It is clear to see that most of the time is spent in the system concept. It is also interesting to note that predictor context is far more expensive than predictor compute. This is a well-known fact in the value prediction literature [19]. 3.3 Dynamic Analysis for Concept Phases While the analysis above is useful for determining overall time spent in each concept, it gives no indication of the temporal relationship between concepts. It is commonly acknowledged that programs go through different phases of execution which may be visible at the microarchitectural [7] and method [9, 15] levels of detail. It should be possible to visualize phases at the higher level of concepts also. So the visualization in Figure 4 attempts to plot concepts against execution time. The different concepts are highlighted in different colours, with time running horizontally from left-to-right. Again, this information is extracted from the dynamic concept trace using a simple perl script, this time visualized as HTML within any standard web browser. There are many algorithms to perform phase detection but even just by observation, it is possible to see three phases in this program. The startup phase has long periods of system (opening and reading files) and predictor context (setting up initial table) concept execution. This is followed by a periodic phase of prediction concepts, alternately predictor context and predictor compute. Finally there is a result report and shutdown phase. 3.4 Applying this Information How can these visualizations be used They are ideal for visualization and program comprehension. They may also be useful tools for debugging (since concept anomalies often indicate bugs [22]) and profiling (since they show where most of the execution time is spent). Extensions are possible. At the moment we only draw a single bar. It would be necessary to move to something resembling a Gantt chart if we allow nested concepts (so a source code entity can belong to more than one concept at once) or if we have multiple threads of execution (so more than one concept is being executed at once). 4. DYNAMIC ANALYSIS FOR LARGE JAVA PROGRAM The second case study uses Jikes RVM [2] which is a reasonably large Java system. It is a production-quality adaptive JVM written in Java. It has become a significant vehicle for JVM research, particularly into adaptive compilation mechanisms and garbage collection. All the tests reported in this section use Jikes RVM version 2.4.4 on IA32 Linux. A common complaint from new users of Jikes RVM is that it is hard to understand how the different adaptive runtime mechanisms operate and interact. So this case study selects some high-level concepts from the adaptive infrastructure, thus enabling visualization of runtime behaviour. After some navigation of the Jikes RVM source code, we inserted concepts tags around a few key points that encapsulate adaptive mechanisms like garbage collection and method compilation. Note that all code not in such a concept (both Jikes RVM code and user application code) is in the default system concept. 4.1 Garbage Collection Figure 5 shows concept visualization of two runs of the _201_compress benchmark from SPEC JVM98. The top run has an initial and maximum heap size of 20MB (-Xms20M -Xmx20M) whereas the bottom run has an initial and maximum heap size of 200MB. It is clear to see that garbage collection occurs far more frequently in the smaller heap, as might be expected. In the top run, the garbage collector executes frequently and periodically, doing a non-trivial amount of work as it compacts the heap. In the bottom run, there is a single trivial call to System.gc() as part of the initial benchmark harness code. After this, garbage collection is never required so we assume that the heap size is larger than the memory footprint of the benchmark. Many other garbage collection investigations are possible. So far we have only considered varying the heap configuration. It is also possible to change the garbage collection algorithms in Jikes RVM, and determine from concept visualizations what effect this has on runtime performance. 4.2 Runtime Compilation In the investigation above, the compilation concept only captures optimizing compiler behaviour. However since Jikes RVM is an adaptive compilation system, it has several levels of compilation. The cheapest compiler to run (but one that generates least efficient code) is known as the baseline compiler. This is a simple macro-expansion routine from Java bytecode instructions to IA32 assembler. Higher levels of code efficiency (and corresponding compilation expense!) are provided by the sophisticated optimizing compiler, which can operate at different levels since it has many flags to enable or disable various optimization strategies. In general, Jikes RVM initially compiles application methods using the baseline compiler. The adaptive monitoring system identifies ‘hot’ methods that are frequently executed, and these are candidates for optimizing compilation. The hottest methods should be the most heavily optimized methods. 34
Page 1 and 2: Edited by: Ralf Gitzel, Markus Alek
Page 3 and 4: Message from the Chairs Dear confer
Page 5 and 6: Sam Midkiff, Purdue University (USA
Page 7 and 8: Session F: Novel Uses of Java______
Page 10 and 11: The Project Maxwell Assembler Syste
Page 12 and 13: tomated assembler and disassembler
Page 14 and 15: memory address offset mnemonic argu
Page 16 and 17: 5.2 Instruction Templates The assem
Page 18 and 19: ange for a very short restart/debug
Page 20 and 21: Tatoo: an innovative parser generat
Page 22 and 23: In the grammar file an error termin
Page 24 and 25: Figure 3: Push parsers and lexers L
Page 26 and 27: eturn visit((Expr)p1, param); } R v
Page 28: Session B Program and Performance A
Page 31 and 32: Table 1: Use of interfaces and deco
Page 33 and 34: Table 3: Comparison of the situatio
Page 35 and 36: will trigger the creation of many n
Page 37 and 38: Appendix A 10 9 8 7 6 5 4 3 2 1 0 1
Page 39: Throughout this paper, we make no a
Page 43 and 44: Figure 5: Investigation of garbage
Page 45 and 46: to the same category—again, this
Page 47 and 48: Investigating Throughput Degradatio
Page 49 and 50: Figure 1: Apache. Throughput degrad
Page 51 and 52: Minor GC Full GC Workload # of Avg.
Page 53 and 54: Figure 6: Comparing memory and pagi
Page 55 and 56: GenRC on the other hand, allows the
Page 58: Session C Mobile and Distributed Sy
Page 61 and 62: ing a continuous buffer for raw dat
Page 63 and 64: the data arrival speed is more impo
Page 65 and 66: public class StreamingClient { publ
Page 67 and 68: importance of β in our aggregation
Page 69 and 70: Enabling Java Mobile Computing on t
Page 71 and 72: A strongly mobile thread has the ab
Page 73 and 74: a context switch occur. The invoked
Page 75 and 76: above phases. Now, the new stack ha
Page 77 and 78: 5 frames 15 frames 25 frames Pure d
Page 79 and 80: Juxta-Cat: A JXTA-based platform fo
Page 81 and 82: Figure 1: JXTA Architecture. 3. JUX
Page 83 and 84: • Send a discovery query to the p
Page 85 and 86: The best candidate is then the one
Page 87 and 88: Local Hosts=2 Hosts=4 Hosts=8 Hosts
Page 90:
Session D Resource and Object Manag
Page 93 and 94:
Enterprise Edition (J2EE). It enabl
Page 95 and 96:
As pictured in Figure 4, JDOSecure
Page 97 and 98:
Methods of a PersistenceManager, th
Page 99 and 100:
Datenmodell abstrahiert user role,
Page 101 and 102:
An Extensible Mechanism for Long-Te
Page 103 and 104:
missing object references in the de
Page 105 and 106:
4.2 Mapping the name spaces of the
Page 107 and 108:
(a) (b) ColorSliderPanel0 color
Page 109 and 110:
8. REFERENCES [1] Abu-Ghazaleh, N.,
Page 111 and 112:
permission by process. In other wor
Page 113 and 114:
The processor then refers to the en
Page 115 and 116:
1: // Protection domain 2: struct p
Page 117 and 118:
Table 3: Frequency of JNI calls tha
Page 119 and 120:
[13] Intel Corporation. IA-32 Intel
Page 121 and 122:
01 try { 02 ... 03 } finally { 04 t
Page 123 and 124:
2. FRAMEWORK The Framework for Unif
Page 125 and 126:
ManagedInputStream ResourceGroup Ma
Page 127 and 128:
18. throw ‡ 1. release 17. cleanu
Page 129 and 130:
3. FUTURE WORK The ability to split
Page 131 and 132:
124
Page 133 and 134:
UML sequence diagrams are among the
Page 135 and 136:
diagrams, and behavioral (dynamic)
Page 137 and 138:
the official scripting language for
Page 139 and 140:
Apache's XML Graphics Project. A cu
Page 141 and 142:
Java Debug Interface (JDI). In Revi
Page 143 and 144:
1.3 JML 1.3.1 Overview JML, the Jav
Page 145 and 146:
The Usage of CANAPA is fairly strai
Page 147 and 148:
*@non_null@*/ String str = attribut
Page 149 and 150:
142
Page 151 and 152:
parallel and distributed versions o
Page 153 and 154:
RingElem CextendsRingElem GenPolyno
Page 155 and 156:
RingElem +isZERO():bolean CextendsR
Page 157 and 158:
class constructors with the actual
Page 159 and 160:
plemented via a distributed hash ta
Page 161 and 162:
which are common nowadays. The reus
Page 163 and 164:
Derivative Contract Component Repos
Page 165 and 166:
stepwise execution create Derivativ
Page 167 and 168:
5.4.1 Structural Constraints Struct
Page 169 and 170:
into the conceptual foundation of n
Page 171 and 172:
different implementation and is mor
Page 173 and 174:
the service from the root. It allow
Page 175 and 176:
model: 1. Servlet [6] adapter compo
Page 177 and 178:
y possibly different service, where
Page 179 and 180:
or the fact that widgets extend ser
Page 181 and 182:
174
Page 183 and 184:
The idea of Java 5.0 type inference
Page 185 and 186:
Source := (class | interface)∗ cl
Page 187 and 188:
5. IMPLEMENTATION In order to prese
Page 189 and 190:
Infinite Streams in Java Dominik Gr
Page 191 and 192:
} private static LinkedStream fibon
Page 193 and 194:
of the f stream in order to compute
Page 195 and 196:
Interaction among Objects via Roles
Page 197 and 198:
(a) (b) (c) (d) (e) Figure 1: The p
Page 199 and 200:
class Printer { private int totalPr
Page 201 and 202:
Experiences of using the Dagstuhl M
Page 203 and 204:
Metric Definition Java .class files
Page 205 and 206:
scription of the metric values. Non
Page 207 and 208:
Figure 1: Results from the J-vestig
Page 209 and 210:
an error is subsequently generated
Page 211 and 212:
3. BASIC TERMINOLOGY As object-orie
Page 213 and 214:
• subtyping • (implementation)
Page 215 and 216:
Improving the Quality of Programmin
Page 217 and 218:
Results of online checks (shortened
Page 219 and 220:
212
Page 221 and 222:
214
Page 223 and 224:
Experiences With Hierarchy-Based Co
Page 225 and 226:
Sub View Child DataField 0..n Ke
Page 227 and 228:
Figure 8 - Actual State Charts elem
Page 229 and 230:
associations, which may result in r
Page 231 and 232:
M3PS: A Multi-Platform P2P System B
Page 233 and 234:
In following, we will explain in de
Page 235 and 236:
Figure 10. JDP in Windows XP. Figur
Page 237 and 238:
Mapping Clouds of SOA- and Business
Page 239 and 240:
the technical SOA events (from the
Page 241 and 242:
component and the business process
Page 243 and 244:
Figure 13. “Business Value of Dat
Page 245 and 246:
Solaris of SUN. This platform provi
Page 247 and 248:
status change of an application is
Page 249 and 250:
coming up, is presently discussing
Page 251:
ISBN: 3-939352-05-5 ISBN: 978-3-939
show all

4th International Conference on Principles and Practices ... - MADOC

Create successful ePaper yourself

Delete template?

Save as template?