TY - GEN
T1 - Performance analysis of IBM Cell Broadband Engine on sequence alignment
AU - Song, Yang
AU - Striemer, Gregory M.
AU - Akoglu, Ali
PY - 2009
Y1 - 2009
N2 - The Smith-Waterman (SW) algorithm is the most accurate sequence alignment approach used by computational biologists for DNA matching. However it's computational complexity makes SW impractical to use in clinical environment compared to much faster but less accurate sequence alignment technique such as BLAST. High performance computing community is examining alternative multi core architectures such as IBM Cell Broadband Engine (BE) and Graphics Processing Units (GPUs) that address the limitations of modern cache-based designs. In this paper we investigate the performance of IBM Cell BE architecture in the context of SW. We present an analysis on architectural features of the Cell BE, study the architecture's fitness for accelerating sequence alignment based on its parallel processing power, interconnect structure and communication protocols among the processing cores. We then evaluate the performance of Cell BE against the state of art implementation of SW on NVIDIA's Tesla GPU. Results show that based on the memory architecture of the SW algorithm, Cell BE performs much better than Tesla GPU in terms of both cycle count and execution time metrics. Compared to purely serial implementation, in terms of cycle count, while state of the art GPU implementation delivers 15x speedup, our solution achieves 64x speedup.
AB - The Smith-Waterman (SW) algorithm is the most accurate sequence alignment approach used by computational biologists for DNA matching. However it's computational complexity makes SW impractical to use in clinical environment compared to much faster but less accurate sequence alignment technique such as BLAST. High performance computing community is examining alternative multi core architectures such as IBM Cell Broadband Engine (BE) and Graphics Processing Units (GPUs) that address the limitations of modern cache-based designs. In this paper we investigate the performance of IBM Cell BE architecture in the context of SW. We present an analysis on architectural features of the Cell BE, study the architecture's fitness for accelerating sequence alignment based on its parallel processing power, interconnect structure and communication protocols among the processing cores. We then evaluate the performance of Cell BE against the state of art implementation of SW on NVIDIA's Tesla GPU. Results show that based on the memory architecture of the SW algorithm, Cell BE performs much better than Tesla GPU in terms of both cycle count and execution time metrics. Compared to purely serial implementation, in terms of cycle count, while state of the art GPU implementation delivers 15x speedup, our solution achieves 64x speedup.
UR - http://www.scopus.com/inward/record.url?scp=72849150749&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=72849150749&partnerID=8YFLogxK
U2 - 10.1109/AHS.2009.16
DO - 10.1109/AHS.2009.16
M3 - Conference contribution
AN - SCOPUS:72849150749
SN - 9780769537146
T3 - Proceedings - 2009 NASA/ESA Conference on Adaptive Hardware and Systems, AHS 2009
SP - 439
EP - 446
BT - Proceedings - 2009 NASA/ESA Conference on Adaptive Hardware and Systems, AHS 2009
T2 - 2009 NASA/ESA Conference on Adaptive Hardware and Systems, AHS 2009
Y2 - 29 July 2009 through 1 August 2009
ER -