Web Mining and Social Networking: Techniques and ... - tud.ttu.ee

More documents

Recommendations

Info

174 8 Web Mining and Recommendation Systemslearning is able to train the model (i.e. the item similarity matrix) before the real recommendationmaking, largely avoiding the computational difficulty and high time cost. Upon the learnedmodel, the further recommendation operation could be performed in a short time period, makingthe online recommendation feasible and operational. More importantly, such two-stagerecommendation scheme has become a well-adopted strategy for many recommender systemlater. Basically the computational complexity of such model-based CF systems requires anO(n 2 ) for a setting of n items.8.1.3 Performance EvaluationIn [218], comprehensive experiments are carried out to evaluate the proposed model-basedCF systems. In the following section, we briefly review the experimental setting and somerepresentative results.The dataset used in the experiments is from the MovieLens recommender system. Movie-Lens is a Web-based research recommender system built up in Fall 1997. Since Sagwar et al.published their work using the MovieLens dataset, later many researchers on recommendersystem research are continuing the use of this dataset for comparative studies. Even recentlysome latest recommendation work use it as a benchmark [222]. The dataset chosen for experimentscontains 943 users, 1682 movies and 100,000 ratings. For the chosen dataset, acertain percentage of whole dataset is separated for the model training purpose, while the restof dataset is left out for the test. In the experiments, different separation ratio values x areempirically investigated.Here we select several experimental results to present. Figure 8.3 depicts the impact oftrain/test separation ratio values and the neighborhood size on MAE using the two recommendationstrategies: item-item weighted sum and regression. Known from the figure, thebigger separation ratio values of train/test dataset always achieve the better recommendationperformance, indicating the larger training dataset is essential for an accurate recommendation.While for the selection of neighborhood size, the recommendation accuracy increaseswhen the neighborhood size is becoming bigger, and becomes stable after the neighborhoodsize reaches a certain value. The observed result implies that an appropriate neighborhoodsize achieves the best recommendation outcome, suggesting that the choosing a large numberof neighbors will only increase the computation cost but not benefit the recommendations.Similarly, in Fig.8.4, the recommendation comparisons of model-based and user-based CF al-Fig. 8.3. The impact of parameter x and neighborhood size [218]gorithms are carried out in terms of parameter x and neighborhood size. From the figure, it is
8.2 A Hybrid User-based and Item-based Web Recommendation System 175seen that the proposed model-based CF algorithms consistently outperform the user-based CFalgorithms.Fig. 8.4. The recommendation comparisons of item-based and user-based collaborative filteringalgorithms [218]8.2 A Hybrid User-based and Item-based Web RecommendationSystemIn this section, we will introduce a strategy which is the combination of the User-basedand Item-based Web recommendation system reported in [212]. Collaborative Filtering (CF)-based recommender systems are indispensable tools to find items of interest from the unmanageablenumber of available items. Moreover, companies who deploy a CF-based recommendersystem may be able to increase revenue by drawing customers’ attention to items thatthey are likely to buy. However, many algorithms proposed thus far, where the principal concernis recommendation quality, may be too expensive to operate in a large-scale system. Toaddress this, a hybrid strategy which combine user-based and item-based recommender systemis proposed by Rashid et al. [212]. Such strategy is simple and intuitive which is well suitedfor large data sets. In this section, we discuss CLUSTKNN, a hybrid CF algorithm based onclustering techniques, as a way to overcome this saclability challenge. By applyingcomplexity analysis, we analytically demonstrate the performance advantages thatCLUSTKNN has over traditional CF algorithms. In addition, we present some empiricalmeasurements of the performance and recommendation accuracy of CLUSTKNN andseveral other algorithms.8.2.1 Problem DomainAs it is introduced in the last section, a collaborative filtering domain consists of a setof n customers of users {u 1 ,u 2 ,...,u n }, a set of m products or items{a 1 ,a 2 ,...,a m },and users’ preferences on items. Typically, each user only expresses her preferencesfor a small number of items. In other words, the corresponding user × item matrix isvery sparse.
Page 2 and 3:
Web Mining and Social Networking
Page 4:
Guandong Xu • Yanchun Zhang • L
Page 8 and 9:
VIIIPrefacefollowing characteristic
Page 11:
Acknowledgements: We would like to
Page 14 and 15:
XIVContents3.1.2 Basic Algorithms f
Page 16 and 17:
XVIContentsPart III Social Networki
Page 19:
Part IFoundation
Page 22 and 23:
4 1 Introduction(3). Learning usefu
Page 24 and 25:
6 1 Introductioncalled computationa
Page 26 and 27:
8 1 Introduction• The data on the
Page 28 and 29:
10 1 Introductionin a broad range t
Page 31 and 32:
2Theoretical BackgroundsAs discusse
Page 33 and 34:
2.2 Textual, Linkage and Usage Expr
Page 35 and 36:
2.4 Eigenvector, Principal Eigenvec
Page 37 and 38:
2.5 Singular Value Decomposition (S
Page 39 and 40:
2.6 Tensor Expression and Decomposi
Page 41 and 42:
2.7 Information Retrieval Performan
Page 43 and 44:
2.8 Basic Concepts in Social Networ
Page 45:
2.8 Basic Concepts in Social Networ
Page 48 and 49:
30 3 Algorithms and TechniquesTable
Page 50 and 51:
32 3 Algorithms and TechniquesSpeci
Page 52 and 53:
34 3 Algorithms and Techniquesa sub
Page 54 and 55:
36 3 Algorithms and TechniquesMetho
Page 56 and 57:
38 3 Algorithms and TechniquesCusto
Page 58 and 59:
40 3 Algorithms and TechniquesTable
Page 60 and 61:
42 3 Algorithms and Techniquesa bSI
Page 62 and 63:
44 3 Algorithms and Techniques{a}10
Page 64 and 65:
46 3 Algorithms and Techniques3.2 S
Page 66 and 67:
48 3 Algorithms and TechniquesConce
Page 68 and 69:
50 3 Algorithms and TechniquesNaive
Page 70 and 71:
52 3 Algorithms and Techniquesuses
Page 72 and 73:
54 3 Algorithms and Techniquesin th
Page 74 and 75:
56 3 Algorithms and Techniques// Fu
Page 76 and 77:
58 3 Algorithms and Techniquesendd
Page 78 and 79:
60 3 Algorithms and Techniquesstart
Page 80 and 81:
62 3 Algorithms and TechniquesHere
Page 82 and 83:
64 3 Algorithms and Techniques3.8.2
Page 84 and 85:
66 3 Algorithms and Techniquesfor e
Page 86 and 87:
68 3 Algorithms and Techniquesthat
Page 89 and 90:
4Web Content MiningIn recent years
Page 91 and 92:
score(q,d)=4.2 Web Search 73V(q) ·
Page 93 and 94:
4.2 Web Search 75algorithm. The Web
Page 95 and 96:
4.3 Feature Enrichment of Short Tex
Page 97 and 98:
4.4 Latent Semantic Indexing 794.4
Page 99 and 100:
Notation4.5 Automatic Topic Extract
Page 101 and 102:
4.5 Automatic Topic Extraction from
Page 103 and 104:
4.6 Opinion Search and Opinion Spam
Page 105:
4.6 Opinion Search and Opinion Spam
Page 108 and 109:
90 5 Web Linkage Mining5.2 Co-citat
Page 110 and 111:
92 5 Web Linkage Mining{ /1 out deg
Page 112 and 113:
94 5 Web Linkage Mininga =(a(1),·
Page 114 and 115:
96 5 Web Linkage Mining5.4.1 Bipart
Page 116 and 117:
98 5 Web Linkage MiningNext, consid
Page 118 and 119:
100 5 Web Linkage Mining(5) Creatin
Page 120 and 121:
102 5 Web Linkage Miningpower-law d
Page 122 and 123:
104 5 Web Linkage MiningFig. 5.10.
Page 124 and 125:
106 5 Web Linkage Miningbetween use
Page 126 and 127:
6Web Usage MiningIn previous chapte
Page 129 and 130:
6.1 Modeling Web User Interests usi
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
6.2 Web Usage Mining using Probabil
Page 139 and 140:
6.2 Web Usage Mining using Probabil
Page 141 and 142: 6.2 Web Usage Mining using Probabil
Page 143 and 144: 6.3 Finding User Access Pattern via
Page 149 and 150: 6.4 Co-Clustering Analysis of weblo
Page 151 and 152: 6.5 Web Usage Mining Applications 1
Page 161: Part IIISocial Networking and Web R
Page 164 and 165: 146 7 Extracting and Analyzing Web
Page 188 and 189: 170 8 Web Mining and Recommendation
Page 208 and 209: 190 9 Conclusionsries commonly used
Page 210 and 211: 192 9 Conclusionsas computer scienc
Page 212 and 213: 194 9 Conclusionsresearches have de
Page 214 and 215: 196 References14. J. Ayres, J. Gehr
Page 216 and 217: 198 References49. D. Chakrabarti, R
Page 218 and 219: 200 References82. C. Dwork, R. Kuma
Page 220 and 221: 202 References119. J. Hou and Y. Zh
Page 222 and 223: 204 References151. A. N. Langville
Page 224 and 225: 206 References186. J. K. Mui and K.
Page 226 and 227: 208 References223. C. Shahabi, A. M
Page 228: 210 References260. G.-R. Xue, D. Sh
show all

Web Mining and Social Networking: Techniques and ... - tud.ttu.ee

Create successful ePaper yourself

Delete template?

Save as template?