4th International Conference on Principles and Practices ... - MADOC

More documents

Recommendations

Info

Experiences with the Development of a Reverse Engineering Tool for UML Sequence Diagrams: A Case Study in Modern Java Development Matthias Merdes EML Research gGmbH Villa Bosch Schloss-Wolfsbrunnenweg 33 D-69118 Heidelberg, Germany Dirk Dorsch EML Research gGmbH Villa Bosch Schloss-Wolfsbrunnenweg 33 D-69118 Heidelberg, Germany @emlr.villa-bosch.de @emlr.villa-bosch.de ABSTRACT The development of a tool for reconstructing UML sequence diagrams from executing Java programs is a challenging task. We implemented such a tool designed to analyze any kind of Java program. Its implementation relies heavily on several advanced features of the Java platform. Although there are a number of research projects in this area usually little information on implementation-related questions or the rationale behind implementation decisions is provided. In this paper we present a thorough study of technological options for the relevant concerns in such a system. The various options are explained and the tradeoffs involved are analyzed. We focus on practical aspects of data collection, data representation and meta-model, visualization, editing, and export concerns. Apart from analyzing the available options, we report our own experience in developing a prototype of such a tool in this study. It is of special interest to investigate systematically in what ways the Java platform facilitates (or hinders) the construction of the described reverse engineering tool. Categories and Subject Descriptors D.2.2 [Software Engineering]: Design Tools and Techniques – object-oriented design methods, D.2.7 [Software Engineering]: Distribution, Maintenance, and Enhancement – reverse engineering, documentation. General Terms Algorithms, Documentation, Design, Experimentation Keywords UML models, sequence diagrams, reverse engineering, Java technology Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. PPPJ 2006, August 30 – September 1, 2006, Mannheim, Germany. Copyright 2006 ACM …$5.00. 1. INTRODUCTION Due to the increasing size and complexity of software applications the understanding of their structure and behavior has become more and more important. Proper specification and design activities are known to be important in producing understandable software. If such specification and design artifacts are unavailable or of poor quality reverse engineering technologies can significantly improve understanding of the design of an existing deployed software system and in general support debugging and maintenance. While modern CASE tools usually support the reconstruction of static structures, the reverse engineering of dynamic behavior is still a topic of on-going research [20], [25]. The development of a tool supporting the reconstruction of the behavior of a running software system must address the major areas of data collection from a (running) system, representation of this data in a suitable meta-model, export of the meta-model’s information or its graphical representation as well as postprocessing and visualization aspects. These core areas and their mutual dependencies are shown in Figure 1. Clearly, all conceptual components depend on the meta-model. In addition, a visualization mechanism can be based on a suitable export format as discussed in sections 4 and 5. While this figure illustrates the main conceptual components of our sequence diagram reengineering tool a symbolic view of its primary use can be seen in Figure 2: The main purpose of such a tool is to provide a mapping from a Java program to a UML sequence diagram. The various relevant options will be discussed in detail in the following sections. Recurrent technical topics include meta-model engineering, aspect-oriented technologies, XML technologies – especially in the areas of serialization and transformation – and vector graphics. Data Collection Meta-Model Visualization Export Figure 1. Conceptual components with dependencies 125
UML sequence diagrams are among the most widely used diagrams of the Unified Model Language (UML) [32]. The UML is now considered the lingua franca of software modeling supporting both structural (static) and behavioral (dynamic) models and their representation as diagrams. Behavioral diagrams include activity, communication, and sequence diagrams. Such sequence diagrams are a popular form to illustrate participants of an interaction and the messages between these participants. They are widely used in specification documents and testing activities [24] as well as in the scientific and technical literature on software engineering. Sequence diagrams [32] are composed of a few basic and a number of more advanced elements. The basic ingredients of a sequence diagram are illustrated in a very simple example in the right part of Figure 2 along with their respective counterparts in the Java source code on the left-hand side. In such a diagram participants are shown along the horizontal dimension of the diagram as so-called ‘life-lines’. In the example, the two participants are ‘Editor’ and ‘Diagram’. These life-lines are connected by arrows symbolizing the messages exchanged between participants. The messages are ordered chronologically along the vertical dimension. In the example, two messages from Editor to Diagram are depicted, namely the constructor message ‘new Diagram()’ and the ‘open()’ message. More advanced concepts (not shown in the figure) such as modeling alternatives, loops, and concurrent behavior, can be factored out into so-called ‘fragments’ for modularization and better readability. Figure 2. Behavior as Java source code and sequence diagram The reconstruction of the behavior of a software system has been studied extensively both in the static case (from source or byte code) [36], [37], [38] and in the dynamic case (from tracing running systems) [6], [33], [34]. [42] and [7] focus more on interaction with and understanding of sequence diagrams, respectively. An overview of approaches is provided by [25] and [20]. Despite this considerable amount of work there is often little information on implementation-centric questions or the rationale behind implementation decisions. Our study is intended to remedy this lack of such a systematic investigation and is motivated by our experiences in implementing our own sequence diagram reengineering tool. This paper has two main purposes. Firstly, we describe and analyze the possible technological options for the required areas. We also report the lessons learned by our implementation. In this way, the more abstract analysis based on theoretical considerations and the technical and scientific literature is verified and complemented by our own practical experience. The remainder of this paper is organized as follows. Section 2 explores methods to collect relevant data and section 3 describes the choices for representation of this data using a suitable metamodel. We describe options for visualization and model or graphics export in section 4 and 5, respectively. 2. Data Collection In this section we will discuss technologies for retrieving information from Java software systems with the purpose of generating instances of a meta-model for UML sequence diagrams. We focus on dynamic (or execution-time) methods but cover static (or development-time) methods as well for the sake of completeness. Static methods gather information from a nonrunning, (source or byte) code-represented system. Dynamic methods on the other hand record the interaction by observing a system in execution. Data collection requires a mechanism for filtering relevant execution-time events which supports a finegrained selection of method invocations. 2.1 Development-time Methods 2.1.1 Source Code Based Using the source code for collecting information about the interaction within an application will have at least one disadvantage: one must have access to the source code. Nevertheless source code analysis is a common practice in the reverse engineering of software systems and supported by most of the available modeling tools. It should be mentioned that the analysis of source code will provide satisfactory results for static diagrams (e.g., class diagrams), but the suitability for the dynamic behavior of an application is limited. If one is interested in a sequence diagram in the form of a common forward engineered diagram (i.e., a visualization of all possible branches of the control flow in the so-called CombinedFragment [32] of the UML), source code analysis will fulfill this requirement. In [37] Rountev, Volgin, and Reddoch introduce an algorithm which maps the control flow to these CombinedFragments. If the intention of the reverse engineering is to visualize the actual interaction any approach of static code analysis is doomed to fail, since it is inherently not possible to completely deduce the state of a system in execution by examining the source code only without actually running the system. Obvious problems include conditional behavior, late binding, and sensor or interactive user input. 126
Page 1 and 2:
Edited by: Ralf Gitzel, Markus Alek
Page 3 and 4:
Message from the Chairs Dear confer
Page 5 and 6:
Sam Midkiff, Purdue University (USA
Page 7 and 8:
Session F: Novel Uses of Java______
Page 10 and 11:
The Project Maxwell Assembler Syste
Page 12 and 13:
tomated assembler and disassembler
Page 14 and 15:
memory address offset mnemonic argu
Page 16 and 17:
5.2 Instruction Templates The assem
Page 18 and 19:
ange for a very short restart/debug
Page 20 and 21:
Tatoo: an innovative parser generat
Page 22 and 23:
In the grammar file an error termin
Page 24 and 25:
Figure 3: Push parsers and lexers L
Page 26 and 27:
eturn visit((Expr)p1, param); } R v
Page 28:
Session B Program and Performance A
Page 31 and 32:
Table 1: Use of interfaces and deco
Page 33 and 34:
Table 3: Comparison of the situatio
Page 35 and 36:
will trigger the creation of many n
Page 37 and 38:
Appendix A 10 9 8 7 6 5 4 3 2 1 0 1
Page 39 and 40:
Throughout this paper, we make no a
Page 41 and 42:
instrumented program and obtained t
Page 43 and 44:
Figure 5: Investigation of garbage
Page 45 and 46:
to the same category—again, this
Page 47 and 48:
Investigating Throughput Degradatio
Page 49 and 50:
Figure 1: Apache. Throughput degrad
Page 51 and 52:
Minor GC Full GC Workload # of Avg.
Page 53 and 54:
Figure 6: Comparing memory and pagi
Page 55 and 56:
GenRC on the other hand, allows the
Page 58:
Session C Mobile and Distributed Sy
Page 61 and 62:
ing a continuous buffer for raw dat
Page 63 and 64:
the data arrival speed is more impo
Page 65 and 66:
public class StreamingClient { publ
Page 67 and 68:
importance of β in our aggregation
Page 69 and 70:
Enabling Java Mobile Computing on t
Page 71 and 72:
A strongly mobile thread has the ab
Page 73 and 74:
a context switch occur. The invoked
Page 75 and 76:
above phases. Now, the new stack ha
Page 77 and 78:
5 frames 15 frames 25 frames Pure d
Page 79 and 80:
Juxta-Cat: A JXTA-based platform fo
Page 81 and 82: Figure 1: JXTA Architecture. 3. JUX
Page 83 and 84: • Send a discovery query to the p
Page 85 and 86: The best candidate is then the one
Page 87 and 88: Local Hosts=2 Hosts=4 Hosts=8 Hosts
Page 90: Session D Resource and Object Manag
Page 93 and 94: Enterprise Edition (J2EE). It enabl
Page 95 and 96: As pictured in Figure 4, JDOSecure
Page 97 and 98: Methods of a PersistenceManager, th
Page 99 and 100: Datenmodell abstrahiert user role,
Page 101 and 102: An Extensible Mechanism for Long-Te
Page 103 and 104: missing object references in the de
Page 105 and 106: 4.2 Mapping the name spaces of the
Page 107 and 108: (a) (b) ColorSliderPanel0 color
Page 109 and 110: 8. REFERENCES [1] Abu-Ghazaleh, N.,
Page 111 and 112: permission by process. In other wor
Page 113 and 114: The processor then refers to the en
Page 115 and 116: 1: // Protection domain 2: struct p
Page 117 and 118: Table 3: Frequency of JNI calls tha
Page 119 and 120: [13] Intel Corporation. IA-32 Intel
Page 121 and 122: 01 try { 02 ... 03 } finally { 04 t
Page 123 and 124: 2. FRAMEWORK The Framework for Unif
Page 125 and 126: ManagedInputStream ResourceGroup Ma
Page 127 and 128: 18. throw ‡ 1. release 17. cleanu
Page 129 and 130: 3. FUTURE WORK The ability to split
Page 131: 124
Page 135 and 136: diagrams, and behavioral (dynamic)
Page 137 and 138: the official scripting language for
Page 139 and 140: Apache's XML Graphics Project. A cu
Page 141 and 142: Java Debug Interface (JDI). In Revi
Page 143 and 144: 1.3 JML 1.3.1 Overview JML, the Jav
Page 145 and 146: The Usage of CANAPA is fairly strai
Page 147 and 148: *@non_null@*/ String str = attribut
Page 149 and 150: 142
Page 151 and 152: parallel and distributed versions o
Page 153 and 154: RingElem CextendsRingElem GenPolyno
Page 155 and 156: RingElem +isZERO():bolean CextendsR
Page 157 and 158: class constructors with the actual
Page 159 and 160: plemented via a distributed hash ta
Page 161 and 162: which are common nowadays. The reus
Page 163 and 164: Derivative Contract Component Repos
Page 165 and 166: stepwise execution create Derivativ
Page 167 and 168: 5.4.1 Structural Constraints Struct
Page 169 and 170: into the conceptual foundation of n
Page 171 and 172: different implementation and is mor
Page 173 and 174: the service from the root. It allow
Page 175 and 176: model: 1. Servlet [6] adapter compo
Page 177 and 178: y possibly different service, where
Page 179 and 180: or the fact that widgets extend ser
Page 181 and 182: 174
Page 183 and 184:
The idea of Java 5.0 type inference
Page 185 and 186:
Source := (class | interface)∗ cl
Page 187 and 188:
5. IMPLEMENTATION In order to prese
Page 189 and 190:
Infinite Streams in Java Dominik Gr
Page 191 and 192:
} private static LinkedStream fibon
Page 193 and 194:
of the f stream in order to compute
Page 195 and 196:
Interaction among Objects via Roles
Page 197 and 198:
(a) (b) (c) (d) (e) Figure 1: The p
Page 199 and 200:
class Printer { private int totalPr
Page 201 and 202:
Experiences of using the Dagstuhl M
Page 203 and 204:
Metric Definition Java .class files
Page 205 and 206:
scription of the metric values. Non
Page 207 and 208:
Figure 1: Results from the J-vestig
Page 209 and 210:
an error is subsequently generated
Page 211 and 212:
3. BASIC TERMINOLOGY As object-orie
Page 213 and 214:
• subtyping • (implementation)
Page 215 and 216:
Improving the Quality of Programmin
Page 217 and 218:
Results of online checks (shortened
Page 219 and 220:
212
Page 221 and 222:
214
Page 223 and 224:
Experiences With Hierarchy-Based Co
Page 225 and 226:
Sub View Child DataField 0..n Ke
Page 227 and 228:
Figure 8 - Actual State Charts elem
Page 229 and 230:
associations, which may result in r
Page 231 and 232:
M3PS: A Multi-Platform P2P System B
Page 233 and 234:
In following, we will explain in de
Page 235 and 236:
Figure 10. JDP in Windows XP. Figur
Page 237 and 238:
Mapping Clouds of SOA- and Business
Page 239 and 240:
the technical SOA events (from the
Page 241 and 242:
component and the business process
Page 243 and 244:
Figure 13. “Business Value of Dat
Page 245 and 246:
Solaris of SUN. This platform provi
Page 247 and 248:
status change of an application is
Page 249 and 250:
coming up, is presently discussing
Page 251:
ISBN: 3-939352-05-5 ISBN: 978-3-939
show all

4th International Conference on Principles and Practices ... - MADOC

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?