Opus 4.8显然是个强模型,但我的印象是,Anthropic越来越像是在追赶OpenAI,而不是引领节奏。 感觉GPT-5.5再次改变了基准,如果OpenAI保持这个轨迹,GPT-5.6很可能成为整体更强的模型。 初步测试显示4.8表现尚可。
Opus 4.8 is clearly a strong model, but my impression is that Anthropic is increasingly playing catch-up with OpenAI rather than setting the pace.
It feels like GPT-5.5 has shifted the benchmark again, and if OpenAI keeps this trajectory, GPT-5.6 could very plausibly become the stronger overall model.
Initial testing is that 4.8 is good-ish