READIT - 2009 - Indira Gandhi Centre for Atomic Research
READIT - 2009 - Indira Gandhi Centre for Atomic Research
READIT - 2009 - Indira Gandhi Centre for Atomic Research
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Proceedings of <strong>READIT</strong>-<strong>2009</strong>, Kalpakkam - 603 102, 29-30 Dec <strong>2009</strong><br />
- Creating backup copies of all files on a stable medium.<br />
- Creating meaningful metadata <strong>for</strong> image files or collections.<br />
- Storing media in an appropriate environment.<br />
- Monitoring an recopying data as necessary.<br />
- Outlining a migration strategy <strong>for</strong> transferring data across generations of technology.<br />
- Anticipating and planning <strong>for</strong> future technological developments.<br />
Since the different steps in digitization listed below must be identified and a workflow<br />
carefully designed and executed by each library, it is only possible to highlight the major<br />
challenges under each of the steps here. Deciding on appropriate <strong>for</strong>mats <strong>for</strong> incorporating<br />
the contents into CD or on the Web will be dependent on the presence of different software<br />
required, and the means to support any extra systems needed as well as the size and features<br />
of materials to be digitized.<br />
5.2. Building-up digital collections<br />
The most important component of a digital library is the digital collection it holds or<br />
has access to. Viability and extent of usefulness of a digital library would depend upon the<br />
critical mass of digital collection that it has. A digital library can have a wide range of<br />
resources. It may contain library can have a wide range of resources. It may contain both<br />
paper-based conventional documents and in<strong>for</strong>mation contained in computer-processible<br />
<strong>for</strong>m. In<strong>for</strong>mation contents of a digital library, depending on the media type it contains, may<br />
include a combination of structured / unstructured text, numerical data, scanned images,<br />
graphics audio and video recordings. Different types of resources need to be handled<br />
differently in digital library.<br />
Rusbridge (1998) divided resources <strong>for</strong> a digital library in three distinct categories,<br />
i.e., Legacy, Transition, and New.<br />
Legacy resources, according to Rusbridge, are largely non-digital resources,<br />
including manuscript, print, slides, maps, audio and video recordings. In spite of the fact that<br />
large investments are being made in the process of digitization of resources, vast majority of<br />
existing legacy resources will remain outside the electronic domain <strong>for</strong> many years to come.<br />
These legacy resources are the major resources of existing libraries.<br />
Transition resources, primarily designed <strong>for</strong> another medium (mostly print), are<br />
those which are being or have been digitized, making the transition into the digital world.<br />
Such resources are converted <strong>for</strong> increased access and to reduce reliance on physical libraries.<br />
The transition resources are either digitized images or images that are converted to text by the<br />
process of Optical Character Recognition (OCR).<br />
New digital resources are either expressly created as digital or are created in parallel<br />
to print. Data files created in the process of electronic publishing are used <strong>for</strong> generating<br />
outputs suitable <strong>for</strong> the Internet and the web. These data files are converted into SGML,<br />
HTML, PostScript and PDF. While HTML and PDF (sometimes PostScript) is posted on the<br />
web. The SGML version which is a rich archive <strong>for</strong>mat is used <strong>for</strong> preservation.<br />
6. LIMITATIONS<br />
Some of the key issues facing digital libraries today:<br />
- Technological obsolescence: Hardware and Software<br />
- Cost of Content refreshing: Digital preservation will be an ongoing operation that<br />
requires considerable recurring expense.<br />
29