14.06.2013 Views

Databases and Systems

Databases and Systems

Databases and Systems

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

110<br />

7.<br />

the information-retrieval metric of Wilbur (16), which measures similarity of two<br />

bibliographic citations based on textual content. ¹<br />

Within a single axis, entities may be inter-related through recursive relationships<br />

of the parent-child type. This complicates the query process because of the need<br />

to "explode"a query object instance, retrieving all its children prior to scanning<br />

the association data. E.g., in the example above, the pars compacta is part of the<br />

substantia nigra, which is part of the mid-brain. To process a query that asked for<br />

anatomical locations of various receptors in the mid-brain, one would first have<br />

to retrieve all "child" anatomical sites within the mid-brain <strong>and</strong> then search the<br />

association data against this set of child sites.<br />

A general representation of N-ary Associations<br />

The sub-schema we propose to h<strong>and</strong>le the case of N-ary associations (where N, the<br />

number of axes, varies) is shown in fig. 1. (In this figure, table names are<br />

Bold/Underlined, while primary keys are in italics. Arrows point from a foreign key<br />

to a primary key.) The Classes <strong>and</strong> Objects tables have been mentioned earlier in<br />

connection with the Object Dictionary approach. The Facts table stores a unique<br />

identifier, the Fact ID, <strong>and</strong> a textual narrative of the fact (for reasons described<br />

shortly). There is a one-to-many link between the Facts table <strong>and</strong> a Citations table<br />

(not shown in the figure).<br />

¹<br />

While some systems, such as NCBI's Entrez, store pre-computed sequence <strong>and</strong><br />

citation similarity scores for efficiency, such pre-computation must be done each time<br />

new sequences or citations are added to the database. Such pre-computation is<br />

justifiable only if the primarily purpose of the database is to assist similarity<br />

searching (as in Entrez).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!