本期周刊涵盖多项AI动态:OpenAI新论文展示智能体可执行大部分办公室工作的首个版本;NYT称OpenAI倾向于2027年IPO;OpenAI新研究发现基于真实人类场景的RL训练使模型在未来任务中更安全、有用;MIT研究显示代码量激增300%但产出仅增长30%;Qwen发布Qwen-AgentWorld,一个35B参数开放权重世界模型,可学习终端、浏览器、Android设备、代码仓库、搜索系统、OS工具及MCP服务器对AI智能体操作的响应。
Today's edition of my newsletter just went out.
🔗 https://www.rohan-paul.com/p/openais-new-paper-shows-how-they
🗞️ OpenAI's new paper shows how they are now seeing the first version of office work where agents do most of the execution.
🗞️ New report on "The State of the AI Economy"
🗞️ New York Times: OpenAI is now leaning toward a 2027 IPO because the public market is testing whether AI giants deserve trillion-dollar prices before they prove durable profits.
🗞️ Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention
🗞️ The Economist: AI has pushed the internet's content machine into a new phase, with books, lawsuits, research papers, apps, and songs now being produced at volumes that old review systems were not built to handle.