Dense Matrix Algorithms -- Chapter 8 Introduction

More documents

Recommendations

Info

2 <strong>Matrix</strong>-Vector Multiplication y = Ax • Assumptions: • The matrix A is of size n×n • The number of processors is n, arranged in a linear array • The matrix A is distributed by rows, 1 row per processor • The vector x is initially distributed rowwise -- that is, one element per processor • Algorithm • Perform an all-to-all broadcast of x to all processors • Perform the vector times row dot product to produce a rowdistributed value for the vector y 5/6/2003 densematrix 3 Parallel Runtime • The all-to-all broadcast takes Θ(n) • n processors (p=n) and message of length m=1 from Table 4.1 on most networks • The operation time is also Θ(n) • Thus, the overall parallel time is Θ(n) • The cost or process-product time is thus Θ(n 2 ) • Because the serial time is also Θ(n 2 ), the algorithm is cost-optimal 5/6/2003 densematrix 4
3 Fewer Than n Processors • Suppose the number of processors p < n • Place n/p rows per processor of both A and x • An all-to-all broadcast distributed n/p elements of x • Each processor computes n/p elements of y • No further distribution of y is needed • The broadcast takes: t s log p + t w (n/p)(p–1) ≈ t s log p + t w n • The computation per processor is: (n/p)n = n 2 /p • The total parallel runtime T P is: n 2 /p + t s log p + t w n • The cost is: n 2 + t s p log p + t w np = Θ(n 2 ) • The work W = n 2 • The algorithm is cost-optimal, provided p = O(n) 5/6/2003 densematrix 5 Diagram Of The Computational And Communication Process For 1-D Row Partition <strong>Matrix</strong>-Vector Multiplication 5/6/2003 densematrix 6
Page 1: 1 Dense Matrix Algorithms -- Chapte
Page 5 and 6: 5 Diagram Of The Computational And
Page 7 and 8: 7 Determining The Cost-Optimal Cons
Page 9 and 10: 9 Performance And Analysis for the
Page 11 and 12: 11 The DNS Algorithm • Partitions
Page 13: 13 The DNS Algorithm Continued 5/6/

Dense Matrix Algorithms -- Chapter 8 Introduction

Create successful ePaper yourself

Delete template?

Save as template?