Dense Matrix Algorithms -- Chapter 8 Introduction
Dense Matrix Algorithms -- Chapter 8 Introduction
Dense Matrix Algorithms -- Chapter 8 Introduction
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
5<br />
Diagram Of The Computational And<br />
Communication Process For 2-D Row<br />
Partition <strong>Matrix</strong>-Vector Multiplication<br />
5/6/2003 densematrix 9<br />
Parallel Runtime Of 2-D Partition<br />
Algorithm<br />
• The three communication steps<br />
• The first (from last column to diagonal) takes Θ(1)<br />
• The second and third (broadcast and reduction) each<br />
take Θ(log n) -- see Table 4.1 with m = 1<br />
• The computation takes Θ(1)<br />
• Thus, the parallel runtime is: Θ(log n)<br />
• The cost (process-time product) is: Θ(n 2 log n)<br />
• Thus, the algorithm is not cost optimal<br />
5/6/2003 densematrix 10