27.03.2014 Views

SEKE 2012 Proceedings - Knowledge Systems Institute

SEKE 2012 Proceedings - Knowledge Systems Institute

SEKE 2012 Proceedings - Knowledge Systems Institute

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

such as one week and gradually deteriorate as statistical<br />

period goes long, this may indicate that users’ preference<br />

varies fast, the most popular products this week will be different<br />

from those of next week. Item-to-item-CF performs<br />

better when time goes by because it really needs some time<br />

to collect users’ visiting or purchasing data for computing<br />

item similarity.<br />

5. Acknowledgment<br />

Figure 2. Top-5 precision of different time period<br />

Content-based method.<br />

User-based-CF: user-based-CF performs much worse<br />

than item-to-item-CF mainly because the key point of this<br />

algorithm is accurately computation of the user similarity.<br />

As mentioned in Data Set section, on average every user<br />

visits only 5 products in the site, so it is severely lack of<br />

co-visited products of users to compute user similarity. Another<br />

factor of user-based-CF’s bad performance is the same<br />

as content-based method, we have no ratings to know users’<br />

preference to items. To get rating information needs users’<br />

extra effort. Because the number of user is much large<br />

than that of item,computing user similarity costs more than<br />

computing item similarity, that is why User-based-CF costs<br />

more than Item-to-item-CF.<br />

Most-popular: This simple benchmark does a good job<br />

in our experiment because of its high cost performance.<br />

This method needs not much information such as item feature<br />

information, user rating and calculation cost is also low.<br />

It only needs the statical information of users’ visiting or<br />

purchasing information.<br />

Cheapest&Newest: These two methods derived from<br />

common business sense do not surprise us, but we still recommend<br />

to have a try on these simplest methods because<br />

sometimes simple does not mean ineffective. To make the<br />

recommendation process complex and obscure is not the<br />

objective of recommendation.<br />

What we also want to discuss is the time value of the<br />

data. We choose item-to-item-CF and most popular which<br />

get relatively better performance to test how time factor affect<br />

results. Figure 2 shows the result of these two methods<br />

in different time period.<br />

We can figure out data from different time period will<br />

affect result a lot, and also the effect to different methods<br />

is different. Most-popular perform better in a short period<br />

The work described in this article was partially supported<br />

by the Humanities and Social Sciences Foundation<br />

of Ministry of Education of China (10YJC870020,<br />

10YJC630283), the National Natural Science Foundation<br />

of China(11171148, 61003024). The authors would like to<br />

thank the industrial partner for sharing data.<br />

References<br />

[1] G. Adomavicius and A. Tuzhilin. Toward the next generation<br />

of recommender systems: A survey of the stateof-the-art<br />

and possible extensions. IEEE Trans. Knowl.<br />

Data Eng., 17(6):734–749, 2005.<br />

[2] M. Balabanovic and Y. Shoham. Fab: Contentbased,<br />

collaborative recommendation. Commun. ACM,<br />

40(3):66–72, November 1997.<br />

[3] W. Hill, L. Stead, M. Rosenstein, and G. Furnas. Recommending<br />

and evaluating choices in a virtual community<br />

of use. CHI ’95, pages 194–201, New York, NY,<br />

USA, 1995.<br />

[4] J. A. Konstan, B. N. Miller, D. Maltz, J. L. Herlocker,<br />

and L. R. G. J. Riedl. Grouplens: Applying collaborative<br />

filtering to usenet news. Commun. ACM, 40(3):77–<br />

87, 1997.<br />

[5] G. Linden, B. Smith, and J. York. Amazon.com recommendations:<br />

Item-to-item collaborative filtering. IEEE<br />

INTERNET COMPUTING, 7(1):76–80, 2003.<br />

[6] E. Ngai, L. Xiu, and D. Chau. Application of data<br />

mining techniques in customer relationship management:<br />

A literature review and classification. Expert<br />

Syst. Appl., 36(2):2592–2602, 2009.<br />

[7] X. Su and T. M. Khoshgoftaar. A survey of collaborative<br />

filtering techniques. Adv. in Artif. Intell., January<br />

2009.<br />

[8] A. Toscher and M. Jahrer. The bigchaos solution to the<br />

netflix grand prize. 2001.<br />

142

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!