TPU警报:针对开源生产级Kubernetes分布式推理,Google刚为llm-d添加了夜间CI。这是Google推动更广泛ML社区使用TPU的重要一步。TPU在llm-d CI和代码质量方面正追赶NVIDIA。相比之下,尽管AMD官方推荐的生产级Kubernetes推理方案是llm-d,但@AnushElangovan尚未将任何AMD GPU或AMD网卡加入CI。
TPU ALERT: For OSS production Kubernetes distributed inferencing, Google just added nightly CI for llm-d. Great step by Google to start enabling the wider ML community for TPUs. TPU is catching up to NVIDIA for llm-d CI & code quality. In comparison, although AMD's official recommended production kubernetes inferencing solution is llm-d, @AnushElangovan has yet to add any AMD GPUs or AMD NICs into the CI.