23.01.2014 Views

7 - Indira Gandhi Centre for Atomic Research

7 - Indira Gandhi Centre for Atomic Research

7 - Indira Gandhi Centre for Atomic Research

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

thesauri and controlled vocabularies are widely used as a basis <strong>for</strong> both resource<br />

description/discovery and in<strong>for</strong>mation management purposes. The minimum metadata sets<br />

are required <strong>for</strong> harvesting such as Document Identifier, Title, Description, and Subjects.<br />

OAI-PMH (Open Archives Initiative Protocol <strong>for</strong> Metadata Harvesting) is mechanism <strong>for</strong><br />

harvesting XML-<strong>for</strong>matted metadata from distributed collections of metadata. This<br />

Protocol provides the basic in<strong>for</strong>mation discovery environment that relies on transferring<br />

metadata en masse from one server to another in a network of in<strong>for</strong>mation systems based<br />

on the open standards HTTP (Hypertext Transport Protocol) and XML (Extensible<br />

Markup Language).<br />

Repository<br />

(Data Provider)<br />

Repository<br />

(Data Provider)<br />

Repository<br />

(Data Provider)<br />

Repository<br />

(Data Provider)<br />

Harvesting<br />

End user<br />

Search and Retrieval of<br />

Service Provider<br />

Metadata<br />

Database<br />

Metadata<br />

Fig.6 Basic approach of OAI-PMH<br />

The metadata that is harvested may be in any <strong>for</strong>mat that is agreed by a community (or by<br />

any discrete set of data and service providers), although unqualified Dublin Core is<br />

specified to provide a basic level of interoperability. Thus, metadata from many sources<br />

can be gathered together in one database, and services can be provided based on this<br />

centrally harvested or "aggregated" data. The link between this metadata and the related<br />

Content is not defined by the OAI protocol. It is important to realize that OAI-PMH does<br />

not provide a search across this data; it simply makes it possible to bring the data together<br />

in one place.<br />

4. Storage & Retrieval<br />

There is an increasing need on providing search on the full text of digitized documents in<br />

addition to the bibliographic level. It is required to develop a storage & retrieval system<br />

which takes the advantage of structural knowledge from the assigned index terms on<br />

various search fields. There is an increasing need to research on the impact of this<br />

capability and to develop a storage and retrieval system, which takes advantage of<br />

structural knowledge. The integration of structural and textual in<strong>for</strong>mation can allow one<br />

to achieve a higher quality of retrieval results.<br />

Standard Generalized Markup Language (SGML) provides a very powerful tool <strong>for</strong><br />

describing document. Based on SGML, Hypertexts Markup Language (HTML),<br />

Hypermedia/Time-based Structuring Language (HyTime), and Text Encoding Initiative<br />

53

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!