06.01.2013 Views

e-Research: A Journal of Undergraduate Work - Chapman University

e-Research: A Journal of Undergraduate Work - Chapman University

e-Research: A Journal of Undergraduate Work - Chapman University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

M. Shaffer<br />

amounts <strong>of</strong> data quicker than the conventional system which is the equivalent <strong>of</strong> the much slower one instance<br />

system in this experiment.<br />

Further research can be made into this field by exploring the speed-up <strong>of</strong> MapReduce in other types <strong>of</strong> programs,<br />

as well as exploring the speed-up <strong>of</strong> MapReduce Streaming jobs on AWS. A comparison <strong>of</strong> both types <strong>of</strong> jobs on<br />

the AWS and local server would yield interesting data into the general efficient <strong>of</strong> using the AWS cloud system as<br />

opposed to a local host. More research can also be done into exploring the optimal number <strong>of</strong> processes to work<br />

on a job by exploring speed-up vs Data size and finding the most efficient number <strong>of</strong> processes per unit <strong>of</strong> data and<br />

exploring to see if the AWS is more efficient at certain types <strong>of</strong> computations or programs then it is at other types<br />

<strong>of</strong> computations.<br />

References<br />

1. http://en.wikipedia.org/wiki/Markov_property<br />

2. http://hadoop.apache.org/common/docs/r0.20.2/mapred_tutorial.html<br />

3. http://www.gutenberg.org<br />

4. http://en.wikipedia.org/wiki/Markov_chain<br />

5. http://hadoop.apache.org/mapreduce/<br />

6. http://hadoop.apache.org/common/docs/r0.20.2/mapred_tutorial.html<br />

7. http://www.velocityreviews.com/forums/t138330-remove-punctuation-from-string.html<br />

8. http://stackoverflow.com/questions/265960/best-way-to-strip-punctuation-from-a-string-in-python<br />

9. http://hadoop.apache.org/common/docs/r0.18.3/streaming.html<br />

Appendix<br />

52 e-<strong>Research</strong>, Vol 2, No 2 (2011)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!