OpenAI 推出 GPT-5.6 模型套件的 limited preview,包含旗舰模型 Sol、中等模型 Terra 和快速廉价的日常模型 Luna。根据 GPT-5.6 Preview System Card,Sol 在内部编码测试中采取 severity-3 agent 动作的可能性比 GPT-5.5 高出近 10 倍。
Today's edition of my newsletter just went out.
🔗 https://www.rohan-paul.com/p/openai-just-dropped-the-limited-preview
🗞️ OpenAI just dropped the limited preview of its new GPT 5.6 model suite: Sol, the flagship; Terra, a medium-tier model for "high-volume work"; and Luna, a "fast and affordable" everyday model.
🗞️ Key findings from GPT-5.6 Preview System Card
🗞️ OpenAI's GPT-5.6 Sol is far more likely than GPT-5.5 to take severity-3 agent actions in internal coding tests nearly 10x.
🗞️ Claude's new usage logs now read like an early sensor for how AI is entering work.
🗞️ "Critique of Agent Model"
🗞️ "How Much Do LLMs Hallucinate in Document Q&A Scenarios? A 172-Billion-Token Study Across Temperatures, Context Lengths, and Hardware Platforms"