Download - Academy Publisher
Download - Academy Publisher
Download - Academy Publisher
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
ISBN 978-952-5726-09-1 (Print)<br />
Proceedings of the Second International Symposium on Networking and Network Security (ISNNS ’10)<br />
Jinggangshan, P. R. China, 2-4, April. 2010, pp. 182-185<br />
Research on Tag-based Collaborative Filtering<br />
Strategy<br />
Shuo Wan 1 , and Huizhong Qiu 2<br />
1 School of Software, University of<br />
Electronic Science and Technology of China, Chengdu, P.R.China<br />
Email: wanshuo168@gmail.com<br />
2 School of Computer Science and Engineering, University of<br />
Electronic Science and Technology of China, Chengdu, P.R.China<br />
Email: hzqiu@uestc.edu.cn<br />
Abstract—Recommendation technology is designed to take<br />
the initiative to recommend using the user's history<br />
behavior information, without requiring users to explicitly<br />
specify the query case information. Collaborative filtering is<br />
the most widely recommended technique. However, some<br />
problems of the traditional collaborative filtering<br />
recommendation system still exist, and these problems<br />
significantly affect the recommended results. Tag system as<br />
the essential functions of Web2.0 websites in recent years<br />
has been very widely used. This article will combine the tag<br />
information with the collaborative filtering recommendation,<br />
and recommend resources by recommending tags. By<br />
analyzing a problem of the traditional collaborative filtering<br />
strategy, this experiment proves tag-based recommendation<br />
strategy can effectively solve these problems and improve<br />
the accuracy of recommendation.<br />
Index Terms—tag, collaborative filtering, recommendation,<br />
similarity, preference<br />
I. TRADITIONAL USER-BASED SIMILARITY<br />
MODEL ALGORITHM<br />
Collaborative filtering assumes that a user in the<br />
system and his similar groups both have the similar<br />
preferences on the system resources, and the users have<br />
the similar preference on the similar resources. Thus<br />
mining the collective wisdom embedded in massive data,<br />
and predicting the individuals with the similar groups.<br />
The traditional user-based similarity collaborative<br />
filtering algorithm is mainly in two steps [6]:<br />
1. To calculate the user's similarity with other users,<br />
and receive the top N preference similar users.<br />
2. Based on the similar user's preferences to predict the<br />
user's preferences, and recommend resources.<br />
Formal description is as follows:<br />
U = { u1, u2,..., u m<br />
} stands for m users collection,<br />
I = { i1, i2,..., i n<br />
} stands for n resources collection, the<br />
preferences of the user on the resources can make a<br />
matrix R (shown as follows), r<br />
ui ,<br />
in matrix R stands<br />
for the preferences of user u on resource i , r<br />
ui ,<br />
= 0<br />
stands for the user u have not evaluated preferences on<br />
resource i .<br />
⎛r11 K r1<br />
n ⎞<br />
⎜<br />
⎟<br />
R = M O M<br />
⎜r<br />
r ⎟<br />
⎝ L ⎠<br />
m1<br />
mn<br />
1. The user’s similarity calculation is based on the row<br />
vector of the matrix R . Calculate cosine similarity of the<br />
vector.<br />
s<br />
uv ,<br />
=<br />
i∈I<br />
i∈I<br />
∑<br />
∑<br />
uv ,<br />
r<br />
r<br />
⋅ r<br />
ui , vi ,<br />
∑<br />
2 2<br />
ui , vi ,<br />
i∈I<br />
uv , uv ,<br />
I<br />
uv , is the intersection of user u and v ’s preference<br />
resources.<br />
2. Select the top N preference similar users. Predict the<br />
user’s preference.<br />
rui ,<br />
= ru +∂ ∑ suv ,<br />
⋅( rvi ,<br />
−rv)<br />
(2)<br />
v∈S<br />
u<br />
∩U<br />
S<br />
u<br />
is the similarity of the top-N users. i<br />
i<br />
r<br />
(1)<br />
U is the<br />
users’ evaluate preference on resource i ,<br />
∂= ∑ v∈ Su ∩ Ui.<br />
1/ s v uv ,<br />
II. THE PROBLEM EXISTED IN USER-BASED<br />
SIMILARITY MODEL<br />
The traditional user-based similarity model is based on<br />
two underlying assumptions:<br />
1. There is only one user's preferences similar model,<br />
and using the only value generated by this similarity<br />
model will be able to determine the similarity of the<br />
user's preferences.<br />
2. The preference similarity on one type of the<br />
resources produced by this model can also be applied to<br />
other types.<br />
But there are some problems in these two assumptions.<br />
The article will illustrate the unreasonableness with the<br />
following examples.<br />
© 2010 ACADEMY PUBLISHER<br />
AP-PROC-CS-10CN006<br />
182