Automated Formal Static Analysis and Retrieval of Source Code - JKU

More documents

Recommendations

Info

3.2. INTEGRATION OF MINDBREEZE CODE SEARCH INTO MINDBREEZE ENTERPRISE SEARCH37 3.2.1.2 Information Structuring The tagger and the parser work together in order to retrieve as much information (static and dynamic) as possible for a hit-type. Because the information retrieved by the parser and the tagger does not have a uniform representation, we have to unify the representation of their output data. This issues can be solved by structuring the information in a file with XML structure. The benefits gained are: • we keep the structure of the document apart from the content; • we obtain a lot of structured information. Therefore a XML structure is given to the CTAGS output file, a XML element for each CTAGS output file entry. The information stored for each element (corresponding to a hit-type) is enriched by the parser, the only hard point is to add information to the right hit-type. We came up with the following XML structure for the indexing needs: Example 3.2. ... int x = 6; ... The Document element attributes have the following meaning: • category – the name of the data source; • categoryclass – the type of the hit-type analyzed;
38CHAPTER 3. CODE SEARCH INTEGRATION FACILITY INTO MINDBREEZE ENTERPRISE SEARCH • catinst – the directory which is crawled and indexed; • key – the unique identifier of the hit-type; • security token – unique identifier used for authorization purposes; • title – the title of the hit-type. Each Document element has a Metadata and a Content nodes and each Metadata node contains a set specific metadata with key, value attributes. 3.2.1.3 Indexing In the MCS system, indexing means storing in an index (dictionary) data obtained from the Filter Service such that the Query Service is able to perform search within it. Indexing is done by issuing a FilterService object and an indexing method. Example 3.3. try{ FilterService filter = null; filter = InitializeMindbreeze(); filter.indexRawData(content.getBytes(), metaData, indexURL, "CodeSearch", catInst, key, categoryClass, "txt", title, getCalendarFromDate(new Date()), getCalendarFromDate(new Date()), null); catch (ServiceException ex) {} catch (MalformedURLException e) {}} ... The method indexRawData is called once per translation unit (file). Its parameters are filled in by parsing the XML files previously created (see Section 3.2.1.2). 3.2.1.4 Creating and Referring Hit-Types Unique Keys The hit-types metadata like category, category instance, language, return type, type, etc. do not uniquely identify a hit-type. Therefore, a unique identification of the hit-type has to exist for the scenarios when the user or system requests it for operations: displaying it in the client together with specific metadata (client requests) or insertion, update, deletion (system requests, more precisely IndexService requests). These scenarios involve working with an inverted index (index), therefore there must exist hit-type unique identification. Moreover, the uniqueness property of the hit-type key has to be related to the category and categoryinstance metadata of the object.
Page 1: J O H A N N E S K E P L E R U N I V
Page 5: Abstract In this thesis two approac
Page 8 and 9: viii
Page 10 and 11: Chapter 1 Introduction From the han
Page 12 and 13: 3 Contributions of the Thesis The s
Page 14 and 15: Next section starts by describing t
Page 16 and 17: Chapter 2 Program Verification by S
Page 18 and 19: 2.1. BACKGROUND 9 From the practica
Page 20 and 21: 2.1. BACKGROUND 11 A program is see
Page 22 and 23: 2.2. FORWARD SYMBOLIC EXECUTION IN
Page 28 and 29: 2.3. THE SIMPLIFICATION OF THE VERI
Page 30 and 31: 2.4. IMPLEMENTATION AND EXAMPLES 21
Page 36 and 37: Chapter 3 Code Search Integration F
Page 38 and 39: 3.1. BACKGROUND 29 Figure 3.1: Sour
Page 40 and 41: 3.1. BACKGROUND 31 The programming
Page 42 and 43: 3.2. INTEGRATION OF MINDBREEZE CODE
Page 62 and 63: Chapter 4 Conclusions In this thesi
Page 64 and 65: 55 How the database is used. For ea
Page 66 and 67: Bibliography [***] ***. Mindbreeze
Page 68 and 69: BIBLIOGRAPHY 59 [Hoa69] [How73] [KF
Page 70 and 71: BIBLIOGRAPHY 61 [VHBP00] [Wil94] W.
Page 72: Eidesstattliche Erklärung Ich erkl

Automated Formal Static Analysis and Retrieval of Source Code - JKU

Create successful ePaper yourself

Delete template?

Save as template?