RBU_JR_LIS_V23_2021-FULL_TEXT-E-Copy

More documents

Recommendations

Info

Sarkar & Bhattacharya: Library of Congress Subject Headings …In case of overlapping (as displayed Table: 1) out of theseLCSH descriptors and social tags only 51 terms wereoverlapped i.e. these 51 terms used by both experts andgeneral users in whole collection. Thus, it appears that theoverlapping terms cover only 3.06% for social tags and27.86% for LCSH descriptors. In other words it can be saythat a large portion (100%-3.06%=96.94%) of the socialtags are not available in the LCSH descriptor, but incontrast, about 28 % of the LCSH descriptors are likely tobe accepted by users as social tags.4 Spearman’s Rank Correlation of overlappingtermsFurthermore, the study attempts to show the frequency ofusage of the overlapping terms when they are used asLibraryThing tags and as LCSH descriptors. For thispurpose the overlapping terms were ranked as frequencywise in both datasets (highest to lowest). Spearman's rankcorrelation was used to perform the task.The formula of Spearman’s correlation coefficient asfollows:6Σd 2r = 1---------------------n(n 2 – 1)In this study Spearman’s correlation coefficient of the twosets of terms rankings is 0.83 that shows there is a strongrelation between them. It also concludes that when anoverlapping word is used as subject headings by experts,users have an 83 percent chance of using it as a social tag.5 Top ten used social tags & LCSH descriptorsThis section focused on the comparison between the topten frequent LCSH descriptors and LibraryThing socialtags in both dataset.Quantumphysics14 Magnetism 5Reference 12 Fluid mechanics 3Particlephysics11Quantum theory-Popular worksTable-2: Top 10 (ten) frequent social tags & LCSHdescriptors relating to PhysicsThis study attempted to measure subject oriented terms(particularly for this domain i.e. physics) and non-subjectoriented terms (not related to particular this domain, butsubject element) within the top frequent terms in bothdatasets. Above Table-2 shows that LCSH descriptorscontains ten (10) purely subject-based terms whereasLibraryThing social tags contains six (06) purely subjectbasedterms and remaining four (04) personal terms (e.g.:non-fiction, textbook, reference and to-read). Above tablealso shows that out of both set of terms only two (02)terms are common, which are Physics and Mathematicalphysics. These three terms used by both domain expertsand users (bold in the above table). From the frequencyanalysis it is also clear that the word 'Physics' has thehighest frequency (98) in the social tag vocabulary andalso in the LCSH vocabulary with 16 frequencies. It canbe saying that the term ‘Physics’ is used in 98 titles out of100 titles by taggers (users) whereas used in 16 titles byexperts. Next the word ‘Mathematical physics' was used in22 titles in the tag vocabulary and 8 in the LCSHvocabulary. On other hand not similar but very close, theterm 'Relativity' was used in 28 titles in the social tagvocabulary and the word 'Relativity (Physics)' 11 titles inthe LCSH descriptors. This means that most users usegeneral topic or subject-based terminologies rather thanexpertise.6 Social tags compared with LCSH subdivisions(as per MARC tag)2Social tagsFrequencyLCSHdescriptorsPhysics 98 Physics 16Science 72Relativity(Physics)non-fiction 67 Physics textbook 10textbook 37MathematicalphysicsRelativity 28 Nuclear physics 8Mathematicalphysicsto-read 21 Thermodynamics 6Frequency118In MARC 21 bibliographic format 650 field known asSubject added entry-Topical term. This study comparesLibraryThing social tags with all subfields under MARC21 field 650 that have been used in selected books. Thiscomparison helps to know the ratio of used subfields byboth users and experts. For this study, Subfieldsconsidered under field 650 are: $a - Topical term orgeographic name entry element, $d - Active dates, $v -Form subdivision, $x - General subdivision, $y -Chronological subdivision and $z - Geographicsubdivision.In case of comparison of social tags with LCSH22 Mechanics 7descriptors from MARC subfield’s point of view table-3explores that under field 650, subfield $a used for the alltitles (100) and remaining are as follows $x (21), $v (10),$z (7) and $d (1) by the domain experts. In case of socialtags, terms were (at least one) matched with LCSH $a10subfield in 49https://lisrbu.wixsite.com/dlis/rbu-journal-of-lis
RBU Journal of library & Information Science, V. 23, 2021books, $v subfield in 5 books, $x subfield in 4 books, and$z subfield in 3 books. There is no matching seen undersubfield $d and $y. From comparison given in the table 3below, it can be concluded that the subfield $a are mostfamous to the users and the popularity of other subfieldsare as follows $v, $z, $x.Number of books/records = 100MARC Subfields used in LCSH(N=100)$a $x $v $z $dNumber of titles with this subfield in LCSHdescriptors100100%2121%1010%77%11%Number of titles which have at least one49 4 5 3 0matching with LCSH subfield termsPercentage 49% 19.04% 50% 42.85% 0%Table 3: Comparison of social tags with LCSH descriptors from MARC subfield’s point of view7 Similarity and distance measurement based onJaccard similarity coefficientIn this study top frequently used social tags and topfrequently used LCSH descriptors were analyzed in orderto identify if any similarities and distances exist at thelevel of use. For this purpose Jaccard similarity index wasused. “The Jaccard Index, also known as the Jaccardsimilarity coefficient, is a statistic used in understandingthe similarities between sample sets. The measurementemphasizes similarity between finite sample sets, and isformally defined as the size of the intersection divided bythe size of the union of the sample sets”. This is a measureof similarity for two sets of data, with a range from 0% to100%. When the percentage is higher, that means moresimilarities can be found between the two populations(Statistics How To, n.d.).The formula is as follows:Jaccard Index = (the number in both sets) / (thenumber in either set)In details steps are:“Count the number of members which are shared betweenboth sets.Count the total number of members in both sets (sharedand un-shared).Divide the number of shared members by the totalnumber of members.Multiply the number you found in by 100 (This willproduce a percentage measurement of similarity betweenthe two sample sets)” (Statistics How To, n.d.).We know the formula is:Jaccard Index = (the number in both sets) / (thenumber in either set)The same formula in notation is:J(X, Y) = |X∩Y| / |X∪Y|[Where X= Social tags and Y= LCSH descriptors]For this study both data sets are as follows:X= {1, 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 14, 21, 22, 28, 37, 67,72, 98}Y= {1, 2, 3, 5, 6, 7, 8, 10, 11, 16}So,J(X, Y) = |X∩Y| / |X∪Y|11https://lisrbu.wixsite.com/dlis/rbu-journal-of-lisJ(X, Y) =|{1, 2, 3, 5, 7, 10, 11}| / |{1, 2, 3, 4, 5, 6, 7, 8, 9,10, 11, 12, 14, 16, 21, 22, 28, 37, 67, 72, 98}|J(X, Y) = 7/21 = 0.3333We know if the results would be closer to 100%, thatmeans high similarity presents (e.g. 90% is more similarthan 89%). If results would be 0%, that means nosimilarity presents.This study also shows the Jaccard distance between them.“The Jaccard distance, is a measure of how dissimilar twosets are. It is the complement of the Jaccard index and canbe found by subtracting the Jaccard Index from 1”(Statistics How To, n.d.).The formula is as follows:D(X, Y) = 1 – J(X,Y)Here, Jaccard distance is = 1- 0.3333 = 0.6667In this study, Jaccard similarity index becomes 0.3333 or33.33 (0.3333*100 = 33.33%) which indicate a littlesimilarity between social tags and descriptors. Jaccarddistance shows that the top frequent social tags used byusers and top frequent LCSH descriptors used by domainexperts are dissimilar.Suggestion and ConclusionsOverall comparison between social tags and LCSHdescriptors provides many results regarding thefunctionality and usability of social tags in the library.Overlapping of terms makes it clear that the vocabulary ofthe social tags is larger than the LCSHs database. Out oftotal LCSH descriptors and social tags only 51 terms wereoverlapped i.e. these 51 terms used by both experts andgeneral users in whole collection. Those overlapping termscover only 3.06% (very small portion) for social tags and27.86% for LCSH descriptors. This means that usersmostly use controlled terms as tags to describe books, butexperts rarely use social tags as descriptors. In terms ofoverlapping words, Spearman's rank correlation suggeststhat when the word is used as a tag (here used asLibraryThing tag), as a descriptor there is 83 percentchance of using it. However it is clear that there arevocabulary differences between the two datasets.
Page 1: e-Published on 14th November 2021
Page 4 and 5: About the JournalThe RBU Journal of
Page 6 and 7: Objective-1-2 linesMethodology-1-2
Page 8 and 9: UGC CARE enlisted JournalRBU JOURNA
Page 10 and 11: RBU Journal of Library & Informatio
Page 12 and 13: found various literary works regula
Page 14 and 15: Biswas & Mukherjee: Books for bulle
Page 20 and 21: Sarkar & Bhattacharya: Library of C
Page 28 and 29: Choudhury & Rath: Use of college li
Page 34 and 35: RBU Journal of library & Informatio
Page 42 and 43: Sharma & Karkee: Assessment of News
Page 50 and 51: Sainul: Information Management …I
Page 52 and 53: Sainul: Information Management …m
Page 54 and 55: Sainul: Information Management …s
Page 68 and 69: Majumdar: 110 years influence of
Page 70 and 71: Majumdar: 110 years influence of
Page 72 and 73:
Majumdar: 110 years influence of
Page 74 and 75:
Majumdar: 110 years influence of
Page 76 and 77:
Saha & Hatua: Generalities Class (0
Page 78 and 79:
Page 80 and 81:
Page 82 and 83:
RBU Journal of library & Informatio
Page 84 and 85:
Page 86 and 87:
Page 88 and 89:
Sl. NoMode of ILPNo. ofRespondents(
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
India that have the most frequently
Page 96 and 97:
Page 98 and 99:
Page 100 and 101:
Page 102 and 103:
2010. Five document types were foun
Page 104 and 105:
Page 106 and 107:
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
Page 114 and 115:
Page 116 and 117:
Page 118 and 119:
On the basis of data collections, t
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
1 Changes of Nomenclatures andIncor
Page 138 and 139:
Page 140 and 141:
show all

RBU_JR_LIS_V23_2021-FULL_TEXT-E-Copy

Create successful ePaper yourself

Delete template?

Save as template?