7 - Indira Gandhi Centre for Atomic Research
7 - Indira Gandhi Centre for Atomic Research
7 - Indira Gandhi Centre for Atomic Research
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Digital Library: File Formats, Standards and Protocols<br />
Umesha Naik and Shivalingaiah D.<br />
Abstract<br />
The development of the WWW and other networked digital in<strong>for</strong>mation systems has<br />
provided in<strong>for</strong>mation professionals with many opportunities, while at the same time<br />
requiring them to confront issues that they have not had occasion to explore<br />
previously. Now the mode of conversion, storage and dissemination of in<strong>for</strong>mation<br />
are changing in electronic era. This paper highlights the various file <strong>for</strong>mats, standards<br />
and protocols which may act as a guideline <strong>for</strong> preservation, maintenance, storage and<br />
access of digital materials. This paper also considers in depth the issue surrounding<br />
the file <strong>for</strong>mats, standards and protocols of the digital library materials. As with so<br />
many issues surrounding digitization the question of choosing which file <strong>for</strong>mat,<br />
standardisation and protocols to use both store and access the in<strong>for</strong>mation is<br />
somewhat vexed one. There is a great need <strong>for</strong> adopting various standards and best<br />
practices to build interoperable digital libraries.<br />
Keywords: Digital Library, File Formats, Standards, Protocols, Metadata, Z39.50,<br />
MARC 21, Image files, Text files, Program Files, Structured files.<br />
1. Introduction<br />
Digital library is a collection of digital objects, including text, video and audio along with<br />
methods <strong>for</strong> access, retrieval, selection, organisation and maintenance of the collection.<br />
The important point is that a digital library has materials stored in a computer system in a<br />
<strong>for</strong>mat that allows it to be manipulated and delivered in ways that the conventional version<br />
of the materials cannot be. File <strong>for</strong>mats vary in terms of resolution, bit-depth, colour<br />
capabilities, and support <strong>for</strong> compression and metadata. As with all matters to do with<br />
computers there are standards which impinge on the area of digital libraries.<br />
The standards fall into three areas - material description, user access and system<br />
architecture. The quality and the <strong>for</strong>mats of the items may vary Lecturer considerably,<br />
particularly where results whether they are hit lists or the final required objects, come from<br />
a number of sources. The goal is to access relevant in<strong>for</strong>mation seamlessly, regardless of<br />
its type and location. A protocol is the special set of rules that end points in a<br />
telecommunication connection use when they communicate.<br />
2. Text and Document Files<br />
A file that consists of text characters without any <strong>for</strong>matting in<strong>for</strong>mation it is also known<br />
as an ASCII file. A text file can be read by any word processor and file that contains<br />
characters in a plain human-readable <strong>for</strong>mat.<br />
2.1 Text file<br />
The lowest common denominator of text file <strong>for</strong>mats contains only ASCII characters<br />
without any special <strong>for</strong>matting. A text is not an undifferentiated sequence of words, much<br />
less of bytes. There are actually two <strong>for</strong>ms of ASCII: standard and extended. Standard<br />
ASCII contains codes <strong>for</strong> 128 characters (i.e. a 7 bit binary code). Ed<br />
________________________________________________________________________<br />
Mangalore University, Mangalagangothri-574199; E-mail:umeshai@yahoo.com<br />
94