Tiled Matrix Multiplication lecture 21 jan 2013.pdf
Tiled Matrix Multiplication lecture 21 jan 2013.pdf
Tiled Matrix Multiplication lecture 21 jan 2013.pdf
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Breaking Md and Nd into Tiles• Break up the innerproduct loop of eachthread into phases• At the beginning of eachphase, load the Md andNd elements thateveryone needs duringthe phase into sharedmemory• Everyone access the Mdand Nd elements from theshared memory duringthe phaseNd 1,0 Nd 0,0Nd 0,1 Nd 1,1Nd 0,2 Nd 1,2Nd 0,3 Nd 1,3Md 0,0 Md 1,0 Md 2,0 Md 3,0 Pd 0,0 Pd 1,0 Pd 2,0 Pd 3,0Md 0,1 Md 1,1 Md 2,1 Md 3,1 Pd 0,1 Pd 1,1 Pd 2,1 Pd 3,1Pd 0,2 Pd 1,2 Pd 2,2 Pd 3,2Pd 0,3 Pd 1,3 Pd 2,3 Pd 3,34