01.12.2012 Views

c11.pdf

c11.pdf

c11.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Algoritmi de invatare<br />

Invatarea SARSA<br />

1. SSe iinitializeaza iti li Q( Q(s, a) ) iin mod d aleator l t<br />

2. Repeta<br />

1. Initializeaza starea s<br />

2. Repeta<br />

11. Alege actiunea a in functie de strategia aleasa (ε (ε-greedy greedy sau Softmax)<br />

2. Alege actiunea a’ din s’ folosing aceeasi strategie<br />

3. Q(s, a) = Q(s, a) + α[r + γQ(s’, a’) – Q(s, a)]<br />

4. s = s’; a = a’;<br />

3. Pana cand s este stare terminala<br />

3. Pana cand se intalneste conditia de oprire (un numar de iteratii)<br />

20

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!