27.03.2014 Views

SEKE 2012 Proceedings - Knowledge Systems Institute

SEKE 2012 Proceedings - Knowledge Systems Institute

SEKE 2012 Proceedings - Knowledge Systems Institute

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Institution (cod_institution, city, state, name, department);<br />

Paper (cod_paper, title, cod_institution), cod_institution references<br />

Institution;<br />

Writer (cod_writer, name, code, e_mail, age);<br />

Conference (cod_conference, day, title, city, state);<br />

PaperWriter (cod_paper, cod_writer), cod_paper references Paper<br />

and cod_writer references Writer;<br />

ConferencePaper (cod_conference, cod_paper), cod_conference<br />

references Conference and cod_paper references Paper;<br />

is done into a table, as shown in element (line 14);<br />

the table name is the element (line 15).<br />

Figure 6. Mapping file<br />

Figure 5. Global OWL Ontology<br />

Basically the non-lexical concepts are mapped to tables,<br />

as well as the N:N relationships. The lexical concepts<br />

are mapped to columns in the corresponding tables. The<br />

columns whose cardinality is 1 become not null in the<br />

DB. Details of this mapping are described in [6].<br />

4.2 Mapping Document<br />

In order to describe and track the equivalences between<br />

concepts from the XML files, the ontology and the relational<br />

schema, we have defined a structured artifact named<br />

mapping document, as shown in Figure 6.<br />

To explain the involved transformations for storing the<br />

input XML data into the relational schema, the mapping<br />

process represents the ontology concepts with the element<br />

(line 3). According to the existing concept<br />

in the global ontology, the XML data sources are identified<br />

by the id attribute of the element (line 4).<br />

The XML content is pointed by the XPath expression in<br />

the element (line 5). If an XML source<br />

content has not refered to a certain ontology concept, the<br />

element is empty.<br />

The element (line 13) indicates where<br />

the data will be stored. For non-lexical concepts, the storage<br />

For lexical concepts, the element has<br />

two internal elements, indicating that the information is<br />

stored into a column. From line 28 on, it is described<br />

that the concept title specified on line 18 is stored as<br />

a column (line 29), whose name is title (line 30) in<br />

conference table (line 31). In this case, it is a lexical<br />

concept represented by a string (line 32).<br />

We used the mapping file presented in Figure 6 for generating<br />

the insert statements. The mapping document is an essential<br />

mechanism that allows specifying the equivalences<br />

between the XML data, the ontology and the relational<br />

schema. For better understanding, only conference e<br />

title concepts were demonstrated. The mapping document<br />

generation is implemented by a semi-automatic tool<br />

named CMap (Concept Mapper).<br />

5 Conflict Classification and Resolution<br />

To store original XML data into the RDB, several conflicts<br />

have to be detected and solved. Because of space issues,<br />

we describe in details only the naming conflicts. Other<br />

conflicts are just mentioned in the end of this section. The<br />

original work (master thesis [5]) describes the full set of<br />

conflicts, as well as all the proposed algorithms, the implementation<br />

and the tests that deal with those conflicts.<br />

The naming conflicts refers to naming inconsistencies in<br />

XML documents. There are two types of naming conflicts:<br />

441

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!