READIT-2007 - Indira Gandhi Centre for Atomic Research

More documents

Recommendations

Info

Multimedia Data Mining in Digital Libraries: Standards and Features Sanjeevkumar R. Jadhav * , and Praveenkumar Kumbargoudar * Abstract The digital library retrieves, collects, stores and preserves the digital data. For this purpose, there is need to convert different formats of information such as text, images, video, audio, etc. The data mining techniques are popular while conversion of the multimedia files in the libraries. The present paper attempted to define the term data mining. It also covered different data mining features and standards. The paper explained about the Architecture of data mining, which contains the stages of the data mining such as (1) domain understanding; (2) data selection; (3) cleaning and preprocessing; (4) discovering patters; (5) interpretation; and (6) reporting and using discovered knowledge. It is emphasized that there is need to develop multimedia data mining techniques and standards in the library for conversion of multimedia information. 1. INTRODUCTION Over the past few decades, rapid changes in information technology have drastically changed the functions and activities of the libraries. The Information and Communication Technology created a new type of work culture, new forms of information storage, and new means of communication and dissemination of information. The advent of electronic resources and their increased use in libraries has brought about significant changes in Storage and Communication of Information. As a Result, the Conventional libraries are transforming into digital libraries. Majority of the libraries have computerized already and digitizing their printed collection. In India, the process of digitization is slow compared to other developed countries. This is so because, only 21% of the Indian population is computer literate and only 14% of the Indian Population is using Internet. Due to the development in digitization, many of the libraries are digitizing their collection by transforming their printed materials into digital form. A fully developed digital library environment involves the following elements 1 : 1. Initial Conversion of Content from Physical to Digital form. 2. The extraction or creation of metadata or indexing information describing the content to facilitate searching and discovery, as well as administrative and structural metadata to assist in object viewing, management and preservation. 3. Storage of digital content and metadata in appropriate multimedia repository. The repository will include rights management capabilities to enforce Intellectual Property Rights, if required. e-commerce functionality may also be present if needed to handle accounting and billing. 4. Client Services for the browser, including repository querying and workflow. 5. Content delivery via file transfer or streaming media. 6. Patron access through a browser or dedicated client. * Gulbarga University, GULBARGA: 585 106. Karnataka. E-Mail: kumbargoudar@rediffmail.com 54
7. A private or public network. 2. DIGITIZATION AND DATA MINING Digitization refers to the conversion of an item – be it printed text, manuscript, image or sound, film and video recording – from one format (usually print or analogue) into digital. The process basically involves taking a physical object and essentially making an ‘electronic photograph’ of it. An image of the physical object is captured- using a scanner or digital camera – and converted to digital format that can be stored electronically and accessed via a computer 2 . It is noted that the data and information available in different formats. These formats include Text, Images, Video, Audio, Picture, Maps, etc. It is noted that in case of text information, there is needed to scan the printed text through scanners and provide different links to access it. But in case of multimedia formats like images, Audio, Picture, Maps, Video etc, the conversion and systematic presentation is not easy. Further, there is needed to make automatic search for easy accessibility. The easy search, effective and systematic presentation of the data is essential in case of multimedia information. For this purpose, there is need to adopt data mining techniques in the library. Data mining techniques are basically from logic, Multimedia and Artificial Intelligence techniques. Data mining is the automatic extraction of patterns of information from historical data, enabling companies to focus on the next important aspects of their business—telling them what they did not know and had not even thought of asking 3 . Data mining is that it “is the process of automating information discovery” 4 , which improves decision making and gives a company advantages on the market. Another definition is that is “is the exploration and analysis, by automatic or semiautomatic means, of large quantities of data in order to discover meaningful patterns and rules: 5 Data mining is an applied discipline, which grew our of the statistical pattern recognition, machine learning, and artificial intelligence and coupled with business decision making to optimize and enhance it. Initially, data mining techniques have been applied to structured data from databases. Recently two branches of data mining, text data mining and Web data mining, have emerged 6&7 . They have their own research agenda, communities of researchers, and supporting companies that develop technologies and tools. Unfortunately, today multimedia data mining is in beginning stage and still there is need for developments to make effective presentation of multimedia information. There are four types of multimedia data: audio data, which includes sound , speech, and music; image data (black-and-white and colour images); video data, which include timealigned sequences of images; and electronic or digital, which is sequences of time aligned 2D or 3D coordinates of a stylus, a light per, data glove sensors, or a similar device. All this data is generated by specific kind of sensors. The concept of mining in multimedia is also referred to as automatic annotation or annotation mining. There appears to be three main pattern discovery approaches that have been used for automatic annotation in multimedia data mining. These approaches primarily differ in terms of how external knowledge is provided to mine concepts. The first approach includes assigning key words or classifying the data. The second approach for automatic annotation is through clustering and here multimedia documents are clustered first and then the resulting clusters are assigned keywords by annotator. The third approach does not rely on manual annotator and it tries to mine concepts by knowing the contextual information. 55
Page 1 and 2:
Conference on Recent Advances in In
Page 3 and 4:
CONTENTS INVITED TALKS “Bodhi”
Page 5 and 6:
Dr. Sanjay K Kaushik and Vijendra S
Page 7 and 8:
Tacit Knowledge Contextual Mental P
Page 9 and 10:
documents can become knowledge repo
Page 11 and 12:
objects to form a conceptual networ
Page 13 and 14:
Global highly competitive fast ch
Page 15 and 16:
knowledge would always be packaged
Page 17 and 18:
If our human filters are all too mu
Page 19 and 20:
6. Booch, G. (1993). Object-Oriente
Page 21 and 22:
development (Balyan, R.K. 2007). Th
Page 23 and 24:
4. NEW FOUND ROLES / RESPONSIBILITI
Page 25 and 26: human resource development in the k
Page 27 and 28: Those librarians who have the neede
Page 29 and 30: of information using formal and inf
Page 31 and 32: high quality information and by net
Page 33 and 34: and underpin information management
Page 35 and 36: everywhere on the Internet, existin
Page 37 and 38: 11. RESOURCES SHARING AND NETWORKIN
Page 39 and 40: influence the organization’s know
Page 41 and 42: KNOWLEDGE MANAGEMENT IN DIGITAL INF
Page 43 and 44: Thirdly Knowledge Management promot
Page 45 and 46: 6. KNOWLEDGE MANAGEMENT: COMMON AND
Page 47 and 48: 7. KNOWLEDGE MANAGEMENT SOLUTION IN
Page 49 and 50: Knowledge and Information Managemen
Page 51 and 52: to a broad collection of organizati
Page 53 and 54: Information technology is a tool fo
Page 55 and 56: In the library world, there is a le
Page 57 and 58: more science in public domain. Seve
Page 59 and 60: Web-of Science which covers Science
Page 61 and 62: Creation of digital contents and in
Page 66 and 67: University Libraries - Metamorphosi
Page 68 and 69: • Management skills - motivating,
Page 70 and 71: KM will play an essential role, and
Page 72 and 73: 4. INDUSTRIAL SAFETY AND HEALTH INF
Page 74 and 75: Safety Knowledge Management System
Page 78 and 79: The Multimedia Data Mining (MDM) is
Page 80 and 81: performed at the global or local le
Page 82 and 83: REFERENCES 1. Sinha, Manojkumar and
Page 84 and 85: - Information explosion - Prolifera
Page 86 and 87: also go together for adding visual
Page 88 and 89: 6.2 Format This refers to the abili
Page 90 and 91: Using Knowledge mapping to support
Page 92 and 93: - Be aware of organizational level
Page 94: application of knowledge providing
Page 99 and 100: By implementing these strategies, t
Page 101 and 102: detectors are suitable for detectin
Page 103 and 104: iii) Response Indicator II. Confirm
Page 105 and 106: 5. CONCLUSION Present day libraries
Page 107 and 108: The objective of the archives is: 1
Page 109 and 110: custody and two copies are made for
Page 111 and 112: DIGITAL PRESERVATION ISSUES AND FAL
Page 113 and 114: epresentation, provenance, fixity a
Page 115 and 116: ) Technical c) Resource challenges.
Page 117 and 118: There are many cost factors to cons
Page 119 and 120: This knowledge would vanish from th
Page 121 and 122: The key issues were: - Development
Page 123 and 124: as there would be changes in compet
Page 125 and 126: Digital Preservation and Online Acc
Page 127 and 128:
Physical deterioration of digital m
Page 129 and 130:
7. POLICIES FOR DIGITIZATION 1. Meg
Page 134 and 135:
Library 2.0: Myth or Reality ? E.So
Page 136 and 137:
Web 1.0 Web 2.0 DoubleClick --> Goo
Page 138 and 139:
and chooses tagging, tag clouds, fo
Page 140 and 141:
and access can be given through com
Page 142 and 143:
writes, injures, defaces, cuts, mut
Page 144 and 145:
to have access to restricted areas.
Page 146 and 147:
Internet Usage by Research Scholars
Page 148 and 149:
Kumar and Amritpal Kaur (2004) 5 st
Page 150 and 151:
whereas fifteen respondents use it
Page 152 and 153:
Fully satisfied Table 12 Satisfacti
Page 154 and 155:
Information Security: The role of D
Page 156 and 157:
Identifier Owner suffix 10-1003 / 0
Page 158 and 159:
is costly affair. However, it is a
Page 160 and 161:
(IR) and providing their own digita
Page 162 and 163:
As per the data of Home Ministry of
Page 164 and 165:
4.8 Cyber Defamation: Cyber defamat
Page 166 and 167:
4.18 Web jacking Forceful taking of
Page 168 and 169:
Under the circumstances, the Librar
Page 170 and 171:
10. CONCLUSION Cyber crimes in Indi
Page 172 and 173:
2. STATE AGRICULTURAL UNIVERSITIES
Page 174 and 175:
Table- 2: Library Network facilitie
Page 176 and 177:
Table- 4: OPAC / Web OPAC Facilitie
Page 178 and 179:
6.6 Electronic surveillance Librari
Page 182 and 183:
Knowledge Management in Academic In
Page 184 and 185:
summarizes evolution of Knowledge M
Page 186 and 187:
To prove their relevance and value,
Page 188 and 189:
functions of the organization. Patr
Page 190 and 191:
Knowledge Communication in Academic
Page 192 and 193:
y way of conferring it the status o
Page 194 and 195:
through face-to-face conversation.
Page 196 and 197:
hurdles. Instant message are delive
Page 198 and 199:
Knowledge communication network of
Page 200 and 201:
As KM and learning activities for o
Page 202 and 203:
• Operation personnel are require
Page 204 and 205:
Figure-2: Home page for Centralised
Page 206 and 207:
Glossary Development Figure-4: Home
Page 208 and 209:
Extension of Qualification Systems
Page 210 and 211:
Nuclear Knowledge Management (NKM)
Page 212 and 213:
Knowledge Sharing Environment (1) B
Page 214 and 215:
already possess it. As those worker
Page 216 and 217:
- identify critical “at risk” k
Page 218 and 219:
open literature, content and TOC cr
Page 220 and 221:
FROM MANAGEMENT OF ORGANIZATIONAL K
Page 222 and 223:
Organizational Knowledge Organizati
Page 224 and 225:
Central Repository Data + Informati
Page 226 and 227:
The infrastructure based approach i
Page 228 and 229:
The ‘System Quality’ refers to
Page 230 and 231:
KNOWLEDGE: A MARKETABLE COMMODITY A
Page 232 and 233:
APPLYING E-COMMERCE TO E-LEARNING E
Page 234 and 235:
Role of Libraries in the Knowledge
Page 236 and 237:
their scholars to post their articl
Page 238 and 239:
Author Index Amudhavalli, A., 143 A
show all

READIT-2007 - Indira Gandhi Centre for Atomic Research

Create successful ePaper yourself

Delete template?

Save as template?