20.01.2015 Views

UNIVERSITY OF CALIFORNIA Los Angeles - Users - UCLA

UNIVERSITY OF CALIFORNIA Los Angeles - Users - UCLA

UNIVERSITY OF CALIFORNIA Los Angeles - Users - UCLA

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

innovative ways of visualizing data with other content, rather than simply<br />

directly representing data, and using computer screens as the dominant<br />

form.<br />

Weekly YouTube 44 was created as a group project during the<br />

Statistical Computing class with Statistics Department students at<br />

<strong>UCLA</strong>. Interesting starting points for this project included using<br />

text-based data instead of numeric data, and representing it as an old<br />

media form (a paper), although the data is from a new media<br />

platform, the YouTube.com website. We stored data from the<br />

website for one week, and were able to analyze a portion of each of<br />

the main categories, the most viewed, and people’s comments about<br />

particular videos during the period. We mainly used Python<br />

Beautiful Soup 45 software for this project to store as well as other<br />

tools such as MySQL and R to analyze the stored data. Possible tools<br />

that can be used for this kind of projects will be discussed in the next<br />

chapter.<br />

Figure 27. Analyzed categories<br />

from the YouTube website<br />

during one week period<br />

Figure 28. The portions of pages depend on the relevant percentages of the categories<br />

44<br />

Fall, 2007 (Pro. Mark Hansen)<br />

45<br />

http://www.crummy.com/software/BeautifulSoup/ (accessed March.12.2008) Beautiful<br />

Soup is a Python HTML/XML parser designed for quick turnaround projects like screenscraping<br />

J.KIM_A LANDSCAPE <strong>OF</strong> EVENTS 36

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!