Web Mining and Social Networking: Techniques and ... - tud.ttu.ee

More documents

Recommendations

Info

190 9 Conclusionsries commonly used in matrix-based analysis are presented accordingly, such as matrixeigenvalue, eigenvector; norm, singular value decomposition (SVD) of matrix aswell as three-way tensor expression and decomposition. In addition, a number of wellknown performance evaluation metrics in the context of information retrieval andrecommendation are reviewed, and the basic concepts in social networks are summarizedas well. The chapter of algorithms and techniques covers three main aspectsof contents - fundamental data mining algorithms, such as association rules, sequentialpatterns, Markov models and Bayesian networks, clustering and classification;Web recommendation algorithms, e.g. content-based, user-based, model-based andkNN; and the detection and evolution analysis algorithms of social networks. Then,the book presents comprehensive materials on two-level focuses of how to utilizeWeb data mining to capture the inherent cohesion between various Web pages andbetween pages and users, and how to exploit or extend Web data mining in socialand collaborative applications.The former is covered in Chapter.4, Chapter.5 and Chapter.6, in which we systematicallydiscuss the research and application issues of Web mining from the differentperspectives of Web content, linkage and usage mining. Chapter 4 presentsmaterials about Web content mining. Following the vector space model, Web searchis first addressed to cover the methodologies of crawling, archiving and indexingcontent, and searching strategies. To overcome the challenges of sparse and lowoverlapping of textual features embedded in pages, feature enrichment and latentsemantic analysis methods are given sequentially. Moreover, two extended applicationsof content analysis in automatic topic extraction and opinion mining from Webdocuments are demonstrated the application potentials in this domain. Chapter 5 ismainly talking about another important issue in Web mining, i.e. Web linkage mining.Starting with the principles of co-citation and bibliographic coupling, which isfrom information science, this chapter presents highly summarized materials on twowell-known algorithms in Web search, namely PageRank and HITS, which are thoroughlyand substantially investigated and cited in a large amount of literatures afterthe great success of Google search engine. In addition to these two algorithms, thischapter also present the topic of Web community discovery as well. The conceptsand algorithms like bipartite cores, network flow, cut-based notations of communitiesand Web community chart are substantially discussed along with the theoriesof Web graph measurement and modeling. An extended application of Web linkageanalysis for Web page classification is proposed to show how the linkage analysis facilitatesWeb page organization and presentation. Different from Chap.4 and Chap.5,in which the Web mining is mainly performed on Web pages standalone, Chapter 6reports the research and application progresses from the point of view of interactionbetween users and machines, i.e. Web usage mining. By introducing the usage datamodel of user-pageview matrix, this chapter first gives the idea of modeling usernavigation interest in terms of page-weight pair vector, and propose clustering-basedalgorithms to measure the similarity of user interest, and in turn, to find user navigationalpatterns. In additional to clustering, latent semantic analysis and its variantsare explored to be employed in Web usage mining. Two recently well studied latentsemantic analysis algorithms, namely PLSA and LDA, are presented and elaborated
9.2 Future Directions 191to show the procedures of capturing the underlying navigation tasks and forming theuser access patterns from Web logs via probability inference approaches. The combinationof clustering and latent semantic analysis is fully addressed as well. Then anumber of Web usage mining applications are reviewed in this chapter to emphasizethe application potentials along with some experimental studies.The latter mission of this book is reflected in Chapter.7 and Chapter.8, where tworecently active and popular topics - social networks and Web recommendation, arecovered. Following the basic backgrounds discussed in Chap.2 and Chap.3, Chapter7 concentrates mainly on a few important technical issues of social networking.It presents some algorithms with respect to detecting, extracting and analyzing theWeb community structures and social networks and their dynamic evolutions by usingWeb data mining. We discuss the approach of using Web archive and graph tocapture the Web community and its evolution, which is based on graph mining andWeb community discovery approaches. In addition, we report the studies of temporalanalysis of Web networked structures using three-way tensor analysis. The additionaldimension of temporal feature makes it possible to capture the dynamic change ofnetworked structures at a high level of spatial-temporal space. The reported work oncombining social network discovery and evolution analysis provides an instructivehint in a unified fashion of social network analysis. We aim to present the materialsof social network analysis from the perspectives of longitudinal, evolutionary andunified aspects. Apart from the algorithmic researches, we also give an real worldstudy of social network analysis in the context of societal and social behavior ine-commerce. Chapter 8 talks about the topic of Web recommendation. We aim topresent materials following a thread of Web data mining. After introducing algorithmsand techniques of the traditional recommender systems, such as user-base,item-based and a hybrid recommender systems, we aim to illustrate how Web miningis able to help improve the recommendation. Especially we report several studiesin Web recommendation via Web mining. In usage-based user profiling approaches,we discuss the algorithmic descriptions on using the user access patterns derivedwith Web usage mining to predict the users’ more interested contents. We also reviewthe study of combining Web archives and logs (i.e. the combination of Webcontent and usage mining) for Web query recommendation. With th potential capabilityof Web recommendation in improving user satisfaction and enterprise marketvalue, this chapter prepares a progressive landscaping of the start-of-the-art Web recommendation.Next, we will outline some future research directions in the area ofWeb mining and social networking, focusing on the issues of combination of thesetwo aspects, and social media and social network computing because it attracts alarge volume of attentions from various disciplines.9.2 Future DirectionsWith the coming era of Web 2.0 and propagation of related technologies and applications,social media mining and social network computing is becoming an activeinterdisciplinary area, which attracts attention from different research areas, such
Page 2 and 3:
Web Mining and Social Networking
Page 4:
Guandong Xu • Yanchun Zhang • L
Page 8 and 9:
VIIIPrefacefollowing characteristic
Page 11:
Acknowledgements: We would like to
Page 14 and 15:
XIVContents3.1.2 Basic Algorithms f
Page 16 and 17:
XVIContentsPart III Social Networki
Page 19:
Part IFoundation
Page 22 and 23:
4 1 Introduction(3). Learning usefu
Page 24 and 25:
6 1 Introductioncalled computationa
Page 26 and 27:
8 1 Introduction• The data on the
Page 28 and 29:
10 1 Introductionin a broad range t
Page 31 and 32:
2Theoretical BackgroundsAs discusse
Page 33 and 34:
2.2 Textual, Linkage and Usage Expr
Page 35 and 36:
2.4 Eigenvector, Principal Eigenvec
Page 37 and 38:
2.5 Singular Value Decomposition (S
Page 39 and 40:
2.6 Tensor Expression and Decomposi
Page 41 and 42:
2.7 Information Retrieval Performan
Page 43 and 44:
2.8 Basic Concepts in Social Networ
Page 45:
2.8 Basic Concepts in Social Networ
Page 48 and 49:
30 3 Algorithms and TechniquesTable
Page 50 and 51:
32 3 Algorithms and TechniquesSpeci
Page 52 and 53:
34 3 Algorithms and Techniquesa sub
Page 54 and 55:
36 3 Algorithms and TechniquesMetho
Page 56 and 57:
38 3 Algorithms and TechniquesCusto
Page 58 and 59:
40 3 Algorithms and TechniquesTable
Page 60 and 61:
42 3 Algorithms and Techniquesa bSI
Page 62 and 63:
44 3 Algorithms and Techniques{a}10
Page 64 and 65:
46 3 Algorithms and Techniques3.2 S
Page 66 and 67:
48 3 Algorithms and TechniquesConce
Page 68 and 69:
50 3 Algorithms and TechniquesNaive
Page 70 and 71:
52 3 Algorithms and Techniquesuses
Page 72 and 73:
54 3 Algorithms and Techniquesin th
Page 74 and 75:
56 3 Algorithms and Techniques// Fu
Page 76 and 77:
58 3 Algorithms and Techniquesendd
Page 78 and 79:
60 3 Algorithms and Techniquesstart
Page 80 and 81:
62 3 Algorithms and TechniquesHere
Page 82 and 83:
64 3 Algorithms and Techniques3.8.2
Page 84 and 85:
66 3 Algorithms and Techniquesfor e
Page 86 and 87:
68 3 Algorithms and Techniquesthat
Page 89 and 90:
4Web Content MiningIn recent years
Page 91 and 92:
score(q,d)=4.2 Web Search 73V(q) ·
Page 93 and 94:
4.2 Web Search 75algorithm. The Web
Page 95 and 96:
4.3 Feature Enrichment of Short Tex
Page 97 and 98:
4.4 Latent Semantic Indexing 794.4
Page 99 and 100:
Notation4.5 Automatic Topic Extract
Page 101 and 102:
4.5 Automatic Topic Extraction from
Page 103 and 104:
4.6 Opinion Search and Opinion Spam
Page 105:
4.6 Opinion Search and Opinion Spam
Page 108 and 109:
90 5 Web Linkage Mining5.2 Co-citat
Page 110 and 111:
92 5 Web Linkage Mining{ /1 out deg
Page 112 and 113:
94 5 Web Linkage Mininga =(a(1),·
Page 114 and 115:
96 5 Web Linkage Mining5.4.1 Bipart
Page 116 and 117:
98 5 Web Linkage MiningNext, consid
Page 118 and 119:
100 5 Web Linkage Mining(5) Creatin
Page 120 and 121:
102 5 Web Linkage Miningpower-law d
Page 122 and 123:
104 5 Web Linkage MiningFig. 5.10.
Page 124 and 125:
106 5 Web Linkage Miningbetween use
Page 126 and 127:
6Web Usage MiningIn previous chapte
Page 129 and 130:
6.1 Modeling Web User Interests usi
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
6.2 Web Usage Mining using Probabil
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
6.3 Finding User Access Pattern via
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
6.4 Co-Clustering Analysis of weblo
Page 151 and 152:
6.5 Web Usage Mining Applications 1
Page 153 and 154:
Page 155 and 156:
Page 157 and 158: 6.5 Web Usage Mining Applications 1
Page 159 and 160: 6.5 Web Usage Mining Applications 1
Page 161: Part IIISocial Networking and Web R
Page 164 and 165: 146 7 Extracting and Analyzing Web
Page 188 and 189: 170 8 Web Mining and Recommendation
Page 210 and 211: 192 9 Conclusionsas computer scienc
Page 212 and 213: 194 9 Conclusionsresearches have de
Page 214 and 215: 196 References14. J. Ayres, J. Gehr
Page 216 and 217: 198 References49. D. Chakrabarti, R
Page 218 and 219: 200 References82. C. Dwork, R. Kuma
Page 220 and 221: 202 References119. J. Hou and Y. Zh
Page 222 and 223: 204 References151. A. N. Langville
Page 224 and 225: 206 References186. J. K. Mui and K.
Page 226 and 227: 208 References223. C. Shahabi, A. M
Page 228: 210 References260. G.-R. Xue, D. Sh
show all

Web Mining and Social Networking: Techniques and ... - tud.ttu.ee

Create successful ePaper yourself

Delete template?

Save as template?