12.01.2015 Views

Download - Academy Publisher

Download - Academy Publisher

Download - Academy Publisher

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

A. Problem Description<br />

In a small book recommendation system, the user<br />

collection is{ u0, u1, u2,..., u<br />

5}<br />

, the resource collection is<br />

{ i0, i1, i2,..., i<br />

5}<br />

, i 0<br />

, i 1<br />

, i 2<br />

are classified literature books,<br />

i3, i4,<br />

i<br />

5<br />

are classified mathematics books. The system<br />

use the evaluate-preference way to score the books, the<br />

rating interval is [1, 5]. The system needs to predict u<br />

0<br />

’s<br />

score to book i 5<br />

based on the other users’ score. And the<br />

system decides whether to recommend i 5<br />

tou 0<br />

.<br />

The score matrix table is shown as follows:<br />

TABLE I.<br />

SCORE MATRIX TABLE<br />

Res.<br />

User i i i i i i<br />

0 1 2 3 4 5<br />

u<br />

0 4 4 5 5 4 <br />

u<br />

1 5 3 4 1 2 1<br />

u<br />

2 3 4 5 2 2 1<br />

u<br />

3 4 3 3 3 1 1<br />

B. Problem Analysis<br />

The reason is that this method ignores the individuals'<br />

preference differences in various aspects of interest, and<br />

the obtained similarity is vague and not accurate. both<br />

like literature books and mathematics books, but under<br />

the user-based similarity model, the users that the system<br />

recommended are only interested in literature books.<br />

Therefore, using their preference degree of mathematics<br />

books to predict the preference degree of is not accurate.<br />

In other words, using traditional user-based similarity<br />

model to obtain the similar users is incomplete and does<br />

not cover the user's preferences.<br />

III. USER-BASED SIMILARITY MODEL<br />

Tags are a special kind of meta-data (metadata), and<br />

they are one of the necessary functions of Web2.0 sites<br />

[5]. They are from the tag-maker's subjective experience<br />

on resources, while they are also used to describe<br />

resources and classify resources by user. Tag information<br />

directly stands for the user's interest and preference, and<br />

it describes the relationship between users and resources<br />

and has a high potential value. The use of tag information<br />

can provides a useful supplement for collaborative<br />

filtering system, and it can improve the recommendation<br />

accuracy.<br />

u<br />

4 1 2 1 4 5 5<br />

u<br />

5 1 1 2 5 4 5<br />

Using the user-based similarity model collaborative<br />

filtering, each row of the score matrix is every user’s<br />

preference vector, for example, preference( u<br />

0) = {4,4,5,5,4} .<br />

Using Eq. (1), calculate the preference vectors<br />

similarity between user u 0<br />

and user u1...<br />

u<br />

5<br />

, and the<br />

results are sorted from high to low.<br />

su (<br />

0, u<br />

2)<br />

= 0.94;<br />

su (<br />

0, u<br />

3)<br />

= 0.94;<br />

su (<br />

0, u<br />

1)<br />

= 0.89;<br />

su (<br />

0, u<br />

5)<br />

= 0.87;<br />

su (<br />

0, u<br />

4)<br />

= 0.84;<br />

Suppose N=3, and select the top 3 preference similar<br />

users u1, u2,<br />

u<br />

3<br />

to predict u<br />

0<br />

’s score to book i 5<br />

. Using<br />

Eq. (2):<br />

ru (<br />

0, i<br />

5)<br />

= 2.73<br />

Therefore u<br />

0<br />

doesn’t like booki 5<br />

.<br />

This conclusion is clearly at odds with reality.<br />

Seeing fromu 0<br />

’s score to booki 3<br />

, i 4<br />

, u<br />

0<br />

is also very<br />

favorite to mathematics books. Therefore, u<br />

0<br />

is likely to<br />

also show a preference for booki 5<br />

.<br />

Figure 1. Traditional user-based similarity model<br />

Figure 2. User-based similarity model<br />

Tags classify the resources, and reflect the user's<br />

preferences. Through the user’s preference tag to<br />

recommend resources, the users can get a more accurate<br />

recommendation. The methods are as follows:<br />

The tags collections for user u and user v are T<br />

u<br />

and<br />

T<br />

v<br />

. The intersection T u<br />

∩ T v<br />

is the two users’ common<br />

183

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!