12.01.2015 Views

Download - Academy Publisher

Download - Academy Publisher

Download - Academy Publisher

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

ISBN 978-952-5726-09-1 (Print)<br />

Proceedings of the Second International Symposium on Networking and Network Security (ISNNS ’10)<br />

Jinggangshan, P. R. China, 2-4, April. 2010, pp. 182-185<br />

Research on Tag-based Collaborative Filtering<br />

Strategy<br />

Shuo Wan 1 , and Huizhong Qiu 2<br />

1 School of Software, University of<br />

Electronic Science and Technology of China, Chengdu, P.R.China<br />

Email: wanshuo168@gmail.com<br />

2 School of Computer Science and Engineering, University of<br />

Electronic Science and Technology of China, Chengdu, P.R.China<br />

Email: hzqiu@uestc.edu.cn<br />

Abstract—Recommendation technology is designed to take<br />

the initiative to recommend using the user's history<br />

behavior information, without requiring users to explicitly<br />

specify the query case information. Collaborative filtering is<br />

the most widely recommended technique. However, some<br />

problems of the traditional collaborative filtering<br />

recommendation system still exist, and these problems<br />

significantly affect the recommended results. Tag system as<br />

the essential functions of Web2.0 websites in recent years<br />

has been very widely used. This article will combine the tag<br />

information with the collaborative filtering recommendation,<br />

and recommend resources by recommending tags. By<br />

analyzing a problem of the traditional collaborative filtering<br />

strategy, this experiment proves tag-based recommendation<br />

strategy can effectively solve these problems and improve<br />

the accuracy of recommendation.<br />

Index Terms—tag, collaborative filtering, recommendation,<br />

similarity, preference<br />

I. TRADITIONAL USER-BASED SIMILARITY<br />

MODEL ALGORITHM<br />

Collaborative filtering assumes that a user in the<br />

system and his similar groups both have the similar<br />

preferences on the system resources, and the users have<br />

the similar preference on the similar resources. Thus<br />

mining the collective wisdom embedded in massive data,<br />

and predicting the individuals with the similar groups.<br />

The traditional user-based similarity collaborative<br />

filtering algorithm is mainly in two steps [6]:<br />

1. To calculate the user's similarity with other users,<br />

and receive the top N preference similar users.<br />

2. Based on the similar user's preferences to predict the<br />

user's preferences, and recommend resources.<br />

Formal description is as follows:<br />

U = { u1, u2,..., u m<br />

} stands for m users collection,<br />

I = { i1, i2,..., i n<br />

} stands for n resources collection, the<br />

preferences of the user on the resources can make a<br />

matrix R (shown as follows), r<br />

ui ,<br />

in matrix R stands<br />

for the preferences of user u on resource i , r<br />

ui ,<br />

= 0<br />

stands for the user u have not evaluated preferences on<br />

resource i .<br />

⎛r11 K r1<br />

n ⎞<br />

⎜<br />

⎟<br />

R = M O M<br />

⎜r<br />

r ⎟<br />

⎝ L ⎠<br />

m1<br />

mn<br />

1. The user’s similarity calculation is based on the row<br />

vector of the matrix R . Calculate cosine similarity of the<br />

vector.<br />

s<br />

uv ,<br />

=<br />

i∈I<br />

i∈I<br />

∑<br />

∑<br />

uv ,<br />

r<br />

r<br />

⋅ r<br />

ui , vi ,<br />

∑<br />

2 2<br />

ui , vi ,<br />

i∈I<br />

uv , uv ,<br />

I<br />

uv , is the intersection of user u and v ’s preference<br />

resources.<br />

2. Select the top N preference similar users. Predict the<br />

user’s preference.<br />

rui ,<br />

= ru +∂ ∑ suv ,<br />

⋅( rvi ,<br />

−rv)<br />

(2)<br />

v∈S<br />

u<br />

∩U<br />

S<br />

u<br />

is the similarity of the top-N users. i<br />

i<br />

r<br />

(1)<br />

U is the<br />

users’ evaluate preference on resource i ,<br />

∂= ∑ v∈ Su ∩ Ui.<br />

1/ s v uv ,<br />

II. THE PROBLEM EXISTED IN USER-BASED<br />

SIMILARITY MODEL<br />

The traditional user-based similarity model is based on<br />

two underlying assumptions:<br />

1. There is only one user's preferences similar model,<br />

and using the only value generated by this similarity<br />

model will be able to determine the similarity of the<br />

user's preferences.<br />

2. The preference similarity on one type of the<br />

resources produced by this model can also be applied to<br />

other types.<br />

But there are some problems in these two assumptions.<br />

The article will illustrate the unreasonableness with the<br />

following examples.<br />

© 2010 ACADEMY PUBLISHER<br />

AP-PROC-CS-10CN006<br />

182

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!