@sgl_project 和 @radixark 团队在优化DeepSeek V4推理方面取得了惊人成果,包括在B200、B300上的优化,以及@ChengWan17近期在GB300上实现的4倍等交互吞吐量提升!正如@elonmusk所说,GB300是最佳AI计算机,而此类软件优化正展现其真正潜力!
Amazing work from the @sgl_project and @radixark team for their work optimizing DeepSeek V4 inference on B200, B300, and the recent 4x iso-interactivity throughput improvements on GB300 by @ChengWan17! As @elonmusk said, The GB300 is the best AI computer, and software optimizations like this show its true potential!