23.02.2015 Views

Machine Learning - DISCo

Machine Learning - DISCo

Machine Learning - DISCo

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

102 MACHINE LEARNING<br />

chain rule to write<br />

Given Equation (4.22), our remaining task is to derive a convenient expression<br />

for z. We consider two cases in turn: the case where unit j is an output unit<br />

for the network, and the case where j is an internal unit.<br />

Case 1: raini in^ Rule for Output Unit Weights. Just as wji can influence the<br />

rest of the network only through net,, net, can influence the network only through<br />

oj. Therefore, we can invoke the chain rule again to write<br />

To begin, consider just the first term in Equation (4.23)<br />

The derivatives &(tk - ok12 will be zero for all output units k except when k = j.<br />

We therefore drop the summation over output units and simply set k = j.<br />

Next consider the second term in Equation (4.23). Since oj = a(netj), the<br />

derivative $ is just the derivative of the sigmoid function, which we have<br />

already noted is equal to a(netj)(l - a(netj)). Therefore,<br />

Substituting expressions (4.24) and (4.25) into (4.23), we obtain

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!