15.07.2013 Views

EE592 Robocode Project 2009 - Courses

EE592 Robocode Project 2009 - Courses

EE592 Robocode Project 2009 - Courses

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

EECE 592 <strong>Robocode</strong> <strong>Project</strong><br />

Report Guide<br />

As mentioned earlier, function approximation of an RL value or Q-function is in fact a<br />

topic of research (for example see http://rlai.cs.ualberta.ca/RLAI/RLFA.html). That it,<br />

there is no clear or well-defined solution guaranteed to work in all cases. In fact,<br />

successful application of function approximation to RL is a delicate art! There is<br />

obviously then, no correct or right answer that I will be looking for. Your understanding<br />

and expertise expressed through your report will be key to attaining a good mark.<br />

Your report should be well structured, written clearly and demonstrate an understanding<br />

of the backpropagation and reinforcement learning paradigms. For each of these, it<br />

should describe the problem being addressed and provide an analysis of how that learning<br />

mechanism was applied. It is important that you describe how your solution was<br />

evaluated and offer a conclusion. Pay attention to your results and be scientific in your<br />

approach.<br />

To help you, the following set of questions provides a guide for what your report should<br />

contain and how it will be marked. Try to be as thorough and clear as possible with your<br />

answers. The answers to many of these questions are not unique. You'll be judged based<br />

on what you can deduce from your experiments and how well you understand the theory.<br />

Please also format your report such that you precede each answer with a copy of the<br />

question in italics.<br />

Important Note: Inline with Departmental policy, while I encourage you to discuss your<br />

project work with your fellow students, the entire report must be written<br />

IN YOUR OWN WORDS. In previous years students have been<br />

penalized for sections of text which where copied, verbatim, from other<br />

assignments done in either current or previous years. If you have any<br />

concerns or questions, please do not hesitate to ask.<br />

Schedule and Weighting<br />

Part 1 BP Learning Due 11 Nov <strong>2009</strong> 20%<br />

Part 2 RL with LUT Due 02 Dec <strong>2009</strong> 40%<br />

Part 3 RL with Backpropagation Due 16 Dec <strong>2009</strong> 40%<br />

Important note:<br />

Please consider the above weightings as a guide only. The final weightings will not<br />

be known until I have completed the marking schedule.<br />

S. Sarkaria <strong>2009</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!