TY - GEN
T1 - Executing optimized irregular applications using task graphs within existing parallel models
AU - Krieger, Christopher D.
AU - Strout, Michelle Mills
AU - Roelofs, Jonathan
AU - Bajwa, Amanreet
PY - 2012
Y1 - 2012
N2 - Many sparse or irregular scientific computations are memory bound and benefit from locality improving optimizations such as blocking or tiling. These optimizations result in asynchronous parallelism that can be represented by arbitrary task graphs. Unfortunately, most popular parallel programming models with the exception of Threading Building Blocks (TBB) do not directly execute arbitrary task graphs. In this paper, we compare the programming and execution of arbitrary task graphs qualitatively and quantitatively in TBB, the OpenMP doall model, the OpenMP 3.0 task model, and Cilk Plus. We present performance and scalability results for 8 and 40 core shared memory systems on a sparse matrix iterative solver and a molecular dynamics benchmark.
AB - Many sparse or irregular scientific computations are memory bound and benefit from locality improving optimizations such as blocking or tiling. These optimizations result in asynchronous parallelism that can be represented by arbitrary task graphs. Unfortunately, most popular parallel programming models with the exception of Threading Building Blocks (TBB) do not directly execute arbitrary task graphs. In this paper, we compare the programming and execution of arbitrary task graphs qualitatively and quantitatively in TBB, the OpenMP doall model, the OpenMP 3.0 task model, and Cilk Plus. We present performance and scalability results for 8 and 40 core shared memory systems on a sparse matrix iterative solver and a molecular dynamics benchmark.
KW - irregular applications
KW - parallel programming models
KW - task graphs
UR - http://www.scopus.com/inward/record.url?scp=84876592412&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84876592412&partnerID=8YFLogxK
U2 - 10.1109/SC.Companion.2012.43
DO - 10.1109/SC.Companion.2012.43
M3 - Conference contribution
AN - SCOPUS:84876592412
SN - 9780769549569
T3 - Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012
SP - 261
EP - 268
BT - Proceedings - 2012 SC Companion
T2 - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012
Y2 - 10 November 2012 through 16 November 2012
ER -