TY - GEN
T1 - Load redundancy elimination on executable code
AU - Fernández, Manel
AU - Espasa, Roger
AU - Debray, Saumya
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2001.
PY - 2001
Y1 - 2001
N2 - Optimizations performed at linkt ime or directly applied to final program executables have received increased attention in recent years. This paper discuss the discovery and elimination of redundant load operations in the context of a linkt ime optimizer, an optimization that we call Load Redundancy Elimination (LRE). Our experiments show that between 50% and 75% of a program’s memory references can be considered redundant because they are accessing memory locations that have been referenced less than 200–400 instructions away. We then present three profile-based LRE algorithms targeted at optimizing away these redundancies. Our results show that between 5% and 30% of the redundancy detected can indeed be eliminated, which translates into program speedups in the range of 3% to 8%. We also test our algorithm assuming different cache latencies, and show that, if latencies continue to grow, the load redundancy elimination will become more important.
AB - Optimizations performed at linkt ime or directly applied to final program executables have received increased attention in recent years. This paper discuss the discovery and elimination of redundant load operations in the context of a linkt ime optimizer, an optimization that we call Load Redundancy Elimination (LRE). Our experiments show that between 50% and 75% of a program’s memory references can be considered redundant because they are accessing memory locations that have been referenced less than 200–400 instructions away. We then present three profile-based LRE algorithms targeted at optimizing away these redundancies. Our results show that between 5% and 30% of the redundancy detected can indeed be eliminated, which translates into program speedups in the range of 3% to 8%. We also test our algorithm assuming different cache latencies, and show that, if latencies continue to grow, the load redundancy elimination will become more important.
UR - http://www.scopus.com/inward/record.url?scp=84937424248&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84937424248&partnerID=8YFLogxK
U2 - 10.1007/3-540-44681-8_33
DO - 10.1007/3-540-44681-8_33
M3 - Conference contribution
AN - SCOPUS:84937424248
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 221
EP - 229
BT - Euro-Par 2001 Parallel Processing - 7th International Euro-Par Conference, Proceedings
A2 - Sakellariou, Rizos
A2 - Gurd, John
A2 - Freeman, Len
A2 - Keane, John
PB - Springer-Verlag
T2 - 7th European Conference on Parallel Computing, Euro-Par 2001
Y2 - 28 August 2001 through 31 August 2001
ER -