TY - GEN
T1 - Cache-aware load balancing for question answering
AU - Dominguez-Sal, David
AU - Aguilar-Saborit, Josep
AU - Surdeanu, Mihai
AU - Larriba-Pey, Josep Lluis
PY - 2008
Y1 - 2008
N2 - The need for high performance and throughput Question Answering (QA) systems demands for their migration to distributed environments. However, even in such cases it is necessary to provide the distributed system with cooper- ative caches and load balancing facilities in order to achieve the desired goals. Until now, the literature on QA has notconsidered such a complex system as a whole. Currently, the load balancer regulates the assignment of tasks based only on the CPU and I/O loads without considering the status of the system cache. This paper investigates the load balancing problem propos- ing two novel algorithms that take into account the dis- tributed cache status, in addition to the CPU and I/O load in each processing node. We have implemented, and tested the proposed algorithms in a fully fledged distributed QA system. The two algorithms show that the choice of using the status of the cache was determinant in achieving good performance, and high throughput for QA systems.
AB - The need for high performance and throughput Question Answering (QA) systems demands for their migration to distributed environments. However, even in such cases it is necessary to provide the distributed system with cooper- ative caches and load balancing facilities in order to achieve the desired goals. Until now, the literature on QA has notconsidered such a complex system as a whole. Currently, the load balancer regulates the assignment of tasks based only on the CPU and I/O loads without considering the status of the system cache. This paper investigates the load balancing problem propos- ing two novel algorithms that take into account the dis- tributed cache status, in addition to the CPU and I/O load in each processing node. We have implemented, and tested the proposed algorithms in a fully fledged distributed QA system. The two algorithms show that the choice of using the status of the cache was determinant in achieving good performance, and high throughput for QA systems.
UR - http://www.scopus.com/inward/record.url?scp=70349229911&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349229911&partnerID=8YFLogxK
U2 - 10.1145/1458082.1458250
DO - 10.1145/1458082.1458250
M3 - Conference contribution
AN - SCOPUS:70349229911
SN - 9781595939913
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 1271
EP - 1280
BT - Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM'08
T2 - 17th ACM Conference on Information and Knowledge Management, CIKM'08
Y2 - 26 October 2008 through 30 October 2008
ER -