NVIDIA 在 Huggingface 上发布 Nemotron 3 Ultra(Nemotron-3-Ultra-550B-A55B-NVFP4),一个 550B 参数的 MoE 前沿智能开源大语言模型,专为长时间运行的 AI 智能体设计。相比其他开源前沿模型,推理速度提升 5 倍,复杂智能体任务成本降低 30%。模型具备强大的智能体、推理和对话能力。
NVIDIA 🔥: Nemotron 3 Ultra has been released on Huggingface with 5x faster inference and 30% lower costs in comparison to other open models.
Nemotron-3-Ultra-550B-A55B-NVFP4 is a frontier-scale large language model (LLM) trained by NVIDIA, designed to deliver strong agentic, reasoning, and conversational capabilities.