SemiAnalysis@SemiAnalysis_

2026-06-14 04:30·19天前

AI 摘要

情况检测到：里约热内卢市后训练了一个模型。基于 Qwen 7/2，Rio 3.5 Open 397B 在基础 Qwen 模型之上添加了 SwiReasoning——一个在标准链式推理与隐空间推理之间动态切换的框架，由基于熵的置信信号引导，使模型仅在必要时"出声思考"，其余时间在隐藏空间内静默推理，以提高 token 效率。

SITUATION DETECTED： The city of Rio de Janerio has post-trained a model.

Based on Qwen 7/2， Rio 3.5 Open 397B adds SwiReasoning on top of the base Qwen model - a framework that dynamically switches between standard chain-of-thought and latent-space reasoning， guided by entropy-based confidence signals， so the model only "thinks out loud" when it needs to and otherwise reasons silently in hidden space for better token efficiency.

推理模型发布

在 X 查看原推导出 Markdown

SemiAnalysis@SemiAnalysis_ · X

47导出 Markdown

2026-06-14 04:30·19天前

在 X 看原推· x.com

AI 摘要

SITUATION DETECTED： The city of Rio de Janerio has post-trained a model.

推理模型发布