AI 摘要
NVIDIA 比任何人都更了解其客户的需求。他们直接听到这些需求。这就是为什么解耦推理是未来,以及为什么 LPU 实际上在流水线的某些部分超越了 GPU。
NVIDIA knows more about what its customers need than anyone else. They hear the asks directly. That is why disaggregated inference is the future, and why the LPU actually surpasses the GPU in certain parts of the pipeline.