Improving Cloud Service Resilience using Brownout-Aware Load-Balancing
Författare
Summary, in English
Combining resilience with the brownout paradigm is to date an open practical problem. The issue is to ensure that replica self-adaptivity would not confuse the load-balancing algorithm, overloading replicas that are already struggling with capacity shortage. For example, load-balancing strategies based on response times are not able to decide which replicas should be selected, since the response times are already controlled by the brownout paradigm.
In this paper we propose two novel brownout-aware load-balancing algorithms. To test their practical applicability, we extended the popular lighttpd web server and load-balancer, thus obtaining a production-ready implementation. Experimental evaluation shows that the approach enables cloud services to remain responsive despite cascading failures. Moreover, when compared to Shortest Queue First (SQF), believed to be near-optimal in the non-adaptive case, our algorithms improve user experience by 5%, with high statistical significance, while preserving response time predictability.
Avdelning/ar
Publiceringsår
2014
Språk
Engelska
Sidor
31-40
Publikation/Tidskrift/Serie
[Host publication title missing]
Fulltext
- Available as PDF - 425 kB
- Download statistics
Dokumenttyp
Konferensbidrag
Förlag
IEEE - Institute of Electrical and Electronics Engineers Inc.
Ämne
- Control Engineering
Conference name
33rd IEEE International Symposium on Reliable Distributed Systems
Conference date
2014-10-07
Conference place
Nara, Japan
Status
Published
Projekt
- EIT_VR CLOUD Cloud Control
Forskningsgrupp
- LCCC