TY - GEN
T1 - Characterizing Open Addressing Hash Functions
AU - Luo, Wenbin
AU - Heileman, Gregory L.
N1 - Publisher Copyright:
© 18th International Conference on Computers and Their Applications 2003, CATA 2003. All rights reserved.
PY - 2003
Y1 - 2003
N2 - In [1], we showed that different open addressing hash functions perform differently when the data elements are not uniformly distributed. So, it is tempting to attribute their difference to some mechanism governing the behavior of the hash functions. In this paper, a simple method of characterizing open addressing hash functions is presented. We showed that, indeed, the nature of data spreading ability characterizes the behavior of different open addressing hash functions. We measured and analyzed the spreading speed of a cluster of data elements under different open addressing hash functions. Our experimental results and theoretical analysis showed that different hash functions have different abilities of spreading out clustered data elements. The hash function, which spreads out clustered data elements over the whole table space more uniformly and faster, has better performance when it is applied to clustered data. Experimental results are presented to support our claims, which is followed by some theoretic analysis.
AB - In [1], we showed that different open addressing hash functions perform differently when the data elements are not uniformly distributed. So, it is tempting to attribute their difference to some mechanism governing the behavior of the hash functions. In this paper, a simple method of characterizing open addressing hash functions is presented. We showed that, indeed, the nature of data spreading ability characterizes the behavior of different open addressing hash functions. We measured and analyzed the spreading speed of a cluster of data elements under different open addressing hash functions. Our experimental results and theoretical analysis showed that different hash functions have different abilities of spreading out clustered data elements. The hash function, which spreads out clustered data elements over the whole table space more uniformly and faster, has better performance when it is applied to clustered data. Experimental results are presented to support our claims, which is followed by some theoretic analysis.
UR - http://www.scopus.com/inward/record.url?scp=85132022804&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85132022804&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85132022804
T3 - 18th International Conference on Computers and Their Applications 2003, CATA 2003
SP - 21
EP - 24
BT - 18th International Conference on Computers and Their Applications 2003, CATA 2003
A2 - Debnath, Narayan C.
PB - The International Society for Computers and Their Applications (ISCA)
T2 - 18th International Conference on Computers and Their Applications, CATA 2003
Y2 - 26 March 2003 through 28 March 2003
ER -