NVIDIA 软件护城河警报:最近发布的 AWS Trainium <> Cerebras 仍将使用少量 NVIDIA 软件代码。为了在 prefill Trainium 与 decode Cerebras wafer 之间传输 kvcache,AWS 将使用 NVIDIA NIXL KVcache 传输代理以及 EFA。他们将通过 EFA 从 Trainium 向 Cerebras 的 cpu host 内存进行 RDMA,然后 cpu host 再通过 wafer 引擎的 FGPA 与 wafer 通信。
NVIDIA SOFTWARE MOAT ALERT: the recently announced AWS Trainium <> Cerebras will still be using a small bit of NVIDIA software code. In order to transfer kvcache between prefill Trainium & decode Cerebras wafer, AWS will be using NVIDIA NIXL KVcache transfer agent along with EFA. They will RDMA over EFA from Trainium over to Cerebras's cpu host memory before cpu host talking to wafer via wafer engine's FGPA.