Mitigating Compute Congestion for Low Latency Datacenter RPCs
Latency-sensitive applications in recent datacenter workloads, such as interactive machine learning inference, high-frequency algorithm trading, cloud gaming, and interactive AR/VR applications impose stringent latency requirements. These applications heavily rely on low-latency RPCs as an essential...
Main Author: | Cho, Inho |
---|---|
Other Authors: | Belay, Adam M. |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2023
|
Online Access: | https://hdl.handle.net/1721.1/152827 |
Similar Items
-
Achieving high CPU efficiency and low tail latency in datacenters
by: Ousterhout, Amy(Amy Elizabeth)
Published: (2020) -
Shenango: Achieving high CPU efficiency for latency-sensitive datacenter workloads
by: Ousterhout, Amy Elizabeth, et al.
Published: (2021) -
When Idling is Ideal: Optimizing Tail-Latency for Heavy-Tailed Datacenter Workloads with Perséphone
by: Demoulin, Henri, et al.
Published: (2022) -
FlexPass: A Case for Flexible Credit-based Transport for Datacenter Networks
by: Lim, Hwijoon, et al.
Published: (2023) -
Annulus: A Dual Congestion Control Loop for Datacenter and WAN Traffic Aggregates
by: Saeed, Ahmed, et al.
Published: (2022)