Delay versus Stickiness Violation Trade-offs for Load Balancing in Large-Scale Data Centers

  • Borst S.
  • Liang Q.

Load balancing provides a key mechanism for achieving efficient resource allocation in data centers, ensuring high levels of server utilization and robust application performance. The load balancing techniques implemented in current data centers tend to rely on mapping packets to a server IP address through a hash value calculated from the flow five-tuple in the packet header. The hash calculation allows extremely fast packet forwarding (at line speed) and provides flow 'stickiness', meaning that all packets belonging to the same flow get dispatched to the same server. Unfortunately, a nominal static hashing operation may not always yield an optimal degree of load balancing, e.g. due to variations in server processing speeds or in traffic characteristics of flows. Dynamic weighted hashing, e.g. implemented via 'bins' to add a level of indirection, provides a natural way to mitigate load imbalances. Specifically, bin reassignment to adjust the hashing weights and redirect flows can improve the degree of load balancing and hence the delay performance, but at the expense of flow stickiness violation and possible disruption of active flows. In the present paper we examine the fundamental trade-off between flow stickiness violation and "delay" performance, where "delay" refers to either bin reassignment delay or packet-level latency. We establish that relaxing the stickiness requirement by a minuscule amount yields a notable reduction in the bin reassignment delay, translating into a significant speed-up of the bin reassignment process. We further demonstrate that flow stickiness violation can help improve packet latency performance even once the bin reassignment process has reached an equilibrium where the structural mismatches between traffic loads and server capacities have been resolved. In particular, a minor level of stickiness violation tolerance is highly effective in clipping the tail of the latency distribution.

Recent Publications

January 01, 2019

Friendly, appealing or both? Characterising user experience in sponsored search landing pages

  • Bron M.
  • Chute M.
  • Evans H.
  • Lalmas M.
  • Redi M.
  • Silvestri F.

© 2017 International World Wide Web Conference Committee (IW3C2), published under Creative Commons CC BY 4.0 License. Many of today's websites have recognised the importance of mobile friendly pages to keep users engaged and to provide a satisfying user experience. However, next to the experience provided by the sites themselves, ...

January 01, 2019

Analyzing uber's ride-sharing economy

  • Aiello L.
  • Djuric N.
  • Grbovic M.
  • Kooti F.
  • Lerman K.
  • Radosavljevic V.

© 2017 International World Wide Web Conference Committee (IW3C2), published under Creative Commons CC BY 4.0 License. Uber is a popular ride-sharing application that matches people who need a ride (or riders) with drivers who are willing to provide it using their personal vehicles. Despite its growing popularity, there exist ...

January 01, 2019

The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race

  • Cresci S.
  • Petrocchi M.
  • Pietro R.
  • Spognardi A.
  • Tesconi M.

© 2017 International World Wide Web Conference Committee (IW3C2), published under Creative Commons CC BY 4.0 License. Recent studies in social media spam and automation provide anecdotal argumentation of the rise of a new generation of spambots, so-called social spambots. Here, for the first time, we extensively study this novel ...