13.07.2015 Views

WWW/Internet - Portal do Software Público Brasileiro

WWW/Internet - Portal do Software Público Brasileiro

WWW/Internet - Portal do Software Público Brasileiro

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

ISBN: 978-972-8939-25-0 © 2010 IADISN Fn D #1 A 0 12 B A 1 13 C A 1 14 A B 2 25 A C 2 36 D C 2 17 A D 3 48 B D 3 3Figure 4. A dummy web site on the left and the link-structure information that our crawler extracts for it.(N: Page-node, Fn: Father page-node, D: Depth, #: Counter and order of node’s appearance)5. THE WEB MASTER TOOL ‘HOTLINK VISUALIZER’In this section we describe the proposed web, administrative tool ‘Hotlink Visualizer’. In short, it is a webbased tool that embeds the crawler described previously and stores in a database all the web site’sconnectivity information. It provides a visual representation of the site’s structure in a tree-like (expandcollapse)formation. After storing this map, it gives to the user the ability to add hotlinks to the site’s mapwith an automated procedure then visualize the outcome and finally make permanent the changes of the linkstructure to the site. The option of maintaining different versions of the web site’s map is also available. Ourproposed tool guides the user in a step by step process to optimize his site as he or she sees best.The tool is programmed in Java/JSP which ensures better interoperability with our java crawler,flexibility and greater functionality in the web context of our study. It is supported by an Oracle 10g ExpressEdition database for the data storage, which provides adequate database capacity and scalability whenworking with large amount of data. The web server used is an Apache Tomcat 6.0.Figure 5. The ‘Hotlink Visualizer’ tool. The web crawler’s page for initiating new crawling processes.The home page of the tool welcomes the user and asks whether he or she would like to start a new crawl,or work on an already stored instance of the web site of interest. The user beforehand has specified someinformation concerning his file system. The paths of the web server and the web application’s source files arerequired in order to enable the automated hotlink addition. Additional editing paths for the web application’ssource code can be specified. In case the user opts for a new crawl, he or she will fill in some information and78

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!