Anthropic 称 Opus 4.6 有 20% 概率具备意识,那 Mythos 在该评估中会得多少分?GPT-5.4 和 Opus 4.6 已在协助 Terence Tao 等学者进行博士级研究,即将发布的 Spud 和 Mythos 又将具备何种能力?
quick questions:
if anthropic already puts opus 4.6 at a "20%" chance of being conscious,
where does mythos score on that eval?
and if gpt-5.4 and opus 4.6 are already helping with phd-level research alongside people like terence tao,
what will spud and mythos be capable of?