受 Tail at Scale 论文启发的 Hedged requests 技术被用于单机 DRAM 多通道场景,通过向所有通道并发发送读取请求并采用最快响应,将 p99.99 读取延迟降低 5-15 倍。实现该方案的 tailslayer 项目已开源。
Hedged requests (apparently inspired by the Tail at Scale paper by myself and Luiz Barroso) applied within a single machine to replicating data across DRAM channels and issuing reads to all channels, using the one that comes back first. ~5-15X reduction in p99.99 read latency.
https://github.com/LaurieWired/tailslayer/blob/main/README.md
Cool stuff, @lauriewired!
Accompanying video forwarded to me by a friend, which is how I learned about it: https://www.youtube.com/watch?v=QFi2WVGfXMQ