TY - GEN
T1 - A holistic dataflow-inspired system design
AU - Zuckerman, Stephane
AU - Wei, Haitao
AU - Gao, Guang R.
AU - Wong, Howard
AU - Gaudiot, Jean Luc
AU - Louri, Ahmed
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/4/17
Y1 - 2014/4/17
N2 - Computer systems have undergone a fundamental transformation recently, from single-core processors to devices with increasingly higher core counts within a single chip. The semi-conductor industry now faces the infamous power and utilization walls. To meet these challenges, heterogeneity in design, both at the architecture and technology levels, will be the prevailing approach for energy efficient computing as specialized cores, accelerators, etc., can eliminate the energy overheads of general-purpose homogeneous cores. However, with future technological challenges pointing in the direction of on-chip heterogeneity, and because of the traditional difficulty of parallel programming, it becomes imperative to produce new system software stacks that can take advantage of the heterogeneous hardware. As a case in point, the core count per chip continues to increase dramatically while the available on-chip memory per core is only getting marginally bigger. Thus, data locality, already a must-have in high-performance computing, will become even more critical as memory technology progresses. In turn, this makes it crucial that new execution models be developed to better exploit the trends of future heterogeneous computing in many-core chips. To solve these issues, we propose a cross-cutting cross-layer approach to address the challenges posed by future heterogeneous many-core chips.
AB - Computer systems have undergone a fundamental transformation recently, from single-core processors to devices with increasingly higher core counts within a single chip. The semi-conductor industry now faces the infamous power and utilization walls. To meet these challenges, heterogeneity in design, both at the architecture and technology levels, will be the prevailing approach for energy efficient computing as specialized cores, accelerators, etc., can eliminate the energy overheads of general-purpose homogeneous cores. However, with future technological challenges pointing in the direction of on-chip heterogeneity, and because of the traditional difficulty of parallel programming, it becomes imperative to produce new system software stacks that can take advantage of the heterogeneous hardware. As a case in point, the core count per chip continues to increase dramatically while the available on-chip memory per core is only getting marginally bigger. Thus, data locality, already a must-have in high-performance computing, will become even more critical as memory technology progresses. In turn, this makes it crucial that new execution models be developed to better exploit the trends of future heterogeneous computing in many-core chips. To solve these issues, we propose a cross-cutting cross-layer approach to address the challenges posed by future heterogeneous many-core chips.
KW - Codelets
KW - Dataflow
KW - Heterogeneous architecture
KW - Streaming
UR - http://www.scopus.com/inward/record.url?scp=84949924625&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84949924625&partnerID=8YFLogxK
U2 - 10.1109/DFM.2014.16
DO - 10.1109/DFM.2014.16
M3 - Conference contribution
AN - SCOPUS:84949924625
T3 - Proceedings - 2014 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2014
SP - 46
EP - 49
BT - Proceedings - 2014 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2014
Y2 - 24 August 2014
ER -