A Compiler for Parallel Exeuction of Numerical Python Programs on ...
A Compiler for Parallel Exeuction of Numerical Python Programs on ...
A Compiler for Parallel Exeuction of Numerical Python Programs on ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
List <str<strong>on</strong>g>of</str<strong>on</strong>g> Tables2.1 Comparisi<strong>on</strong> <str<strong>on</strong>g>of</str<strong>on</strong>g> bandwidth available to Rade<strong>on</strong> 4870 and Phenom II X4 940 . 177.1 Executi<strong>on</strong> time <str<strong>on</strong>g>for</str<strong>on</strong>g> matrix multiplicati<strong>on</strong> benchmark <str<strong>on</strong>g>for</str<strong>on</strong>g> 32-bit floating point(sec<strong>on</strong>ds) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 547.2 Speedups <str<strong>on</strong>g>for</str<strong>on</strong>g> matrix multiplicati<strong>on</strong> using GPU <str<strong>on</strong>g>for</str<strong>on</strong>g> 32-bit floating point overATLAS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 547.3 Executi<strong>on</strong> time <str<strong>on</strong>g>for</str<strong>on</strong>g> matrix multiplicati<strong>on</strong> benchmark <str<strong>on</strong>g>for</str<strong>on</strong>g> 64-bit floating point(sec<strong>on</strong>ds) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 557.4 Speedups <str<strong>on</strong>g>for</str<strong>on</strong>g> matrix multiplicati<strong>on</strong> using GPU <str<strong>on</strong>g>for</str<strong>on</strong>g> 64-bit floating point overATLAS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 557.5 Executi<strong>on</strong> time <str<strong>on</strong>g>for</str<strong>on</strong>g> CP benchmark (sec<strong>on</strong>ds) . . . . . . . . . . . . . . . . . . . 567.6 Speedups <str<strong>on</strong>g>for</str<strong>on</strong>g> CP using GPU over OpenMP . . . . . . . . . . . . . . . . . . . 567.7 Executi<strong>on</strong> time <str<strong>on</strong>g>for</str<strong>on</strong>g> Black-Scholes benchmark (sec<strong>on</strong>ds) . . . . . . . . . . . . . 577.8 Speedups <str<strong>on</strong>g>for</str<strong>on</strong>g> Black-Scholes benchmark using GPU over OpenMP . . . . . . . 577.9 Executi<strong>on</strong> time <str<strong>on</strong>g>for</str<strong>on</strong>g> 5-point stencil benchmark (millisec<strong>on</strong>ds) . . . . . . . . . . 58