AIHOT
内容
精选全部 AI 动态AI 日报主题收藏
接入
Agent 接入
更多
关于更新日志反馈
内部员工登录
精选全部日报更多
内部员工登录
全部动态X · 2155 条
全部一手资讯X论文
标签「OpenAI」清除
jason@jxnlco · 2小时前16

Come say hi!

译今天在 @aiDotEngineer 大会的 OpenAI 展台,下午 2 点将举办 @theo 的 AMA(有问必答)活动。现场还有抓娃娃机、奶茶和著名的 Codex 重置按钮供体验,欢迎来玩!

OpenAI Developers@OpenAIDevs · 2小时前45

http://x.com/i/article/2072717544570728450 # June for OpenAI Developers June brought a lot into the loop. Here's what's new for developers building with OpenAI: You said building with Codex feels like flying. We took that literally: DevDay 2026 applications are here. Submit by July 10: Record and Replay plugin: Codex plugins now have role-specific context: The Build iOS Apps plugin brings app previews into Codex: Build with OpenAI APIs, Agents SDK, and ChatGPT Apps from Codex: Spin up a persistent cloud dev environment with the @digitalocean plugin in Codex: Codex in ChatGPT mobile is generally available: More Codex capabilities rolled out in the EEA, UK, and Switzerland: Give Codex more browser context: Codex profiles give your build stats a home: Bring OpenAI models and Codex into your AWS workflows: New docs agent on developers.openai.com: The OpenAI API got moderation scores, image results, and more: Developers are building voice-first apps with the Realtime API: We’re continuing to support OSS maintainers and the open-source ecosystem: Codex in our community workflows: For builders who want the under-the-hood details behind OpenAI products, here are a few deep dives from our team: That was June. We can't wait for you to see what's compiling for July. Follow @OpenAIDevs on X to stay up to date.

译OpenAI总结6月面向开发者的更新:DevDay 2026申请开放(截止7月10日);Codex新增Record and Replay插件、角色上下文插件、iOS应用构建插件(含应用预览);支持从Codex调用OpenAI API、Agents SDK和ChatGPT应用;与DigitalOcean集成实现云端开发环境;Codex在ChatGPT移动端全面上线,并扩展至欧洲经济区、英国、瑞士;新增浏览器上下文增强、开发者统计profile;AWS工作流集成;开放新版docs agent;API增加moderation评分和图像结果;Realtime API推动语音应用开发;持续支持开源维护者。

jason@jxnlco · 3小时前54

Let’s fucking go

译开发者 @vig_xyz 分享了其使用 Codex 自动化多种工作流程:读取邮件并根据内容在 Google Drive 起草提案;自动生成合同修订建议,经律师确认后通过 computer use 填入 DocuSign;监听 Slack 反馈频道来自动修复 Bug;通宵编写单元测试以实现 100% 代码覆盖率;在 worktrees 上并行启动 6 个线程,使 PR 可独立合并。他表示难以想象回到 IDE 甚至 vim。

Epoch AI@EpochAIResearch · 4小时前44

OpenAI’s GPT-4 led the Epoch Capabilities Index for 352 days after its March 2023 release, far longer than any model since. The second-longest lead belongs to OpenAI’s o1 at 98 days.

译OpenAI的GPT-4在2023年3月发布后,引领Epoch能力指数长达352天,远超此后任何模型。 第二长的领先属于OpenAI的o1,为98天。

jason@jxnlco · 6小时前15

About to use codex computer use to control my iPhone via screen mirroring check find my to see who’s around me and texts them.

译即将使用 codex computer use 通过屏幕镜像控制我的 iPhone,查看 Find My 了解周围有谁并给他们发短信。

Emad@EMostaque · 6小时前23

OpenAI and Anthropic should put 10% of their equity each in Invest America accounts for the children of the USA Valuation will go up more than 10%

译OpenAI和Anthropic应各自将10%的股权投入Invest America账户,用于美国儿童。

Chubby♨️@kimmonismus · 7小时前25

And we are still waiting for Gemini 3.5 pro, which I actually expected at the end of June.

译我们还在等待Gemini 3.5 Pro,我原本预期六月底发布。

Chubby♨️@kimmonismus · 9小时前23

The only question remaining now is: will GPT-5.6 also have guardrails as strict as Fable 5’s, or does OpenAI have better connections within the US government? We will find out very soon.

译现在唯一的问题是:GPT-5.6 是否也会像 Fable 5 那样有严格的安全护栏,还是 OpenAI 在美国政府内部有更好的关系?我们很快就会知道。

Chubby♨️@kimmonismus · 11小时前60

A few more thoughts on OpenAI’s 5 percent stake for the US government. I do not think this is only about allowing US authorities to share in the profits, but also about enabling an ever closer interconnection between government and future technology. The situation surrounding Fable 5 has made it strikingly clear how important it will be for a frontier lab to maintain good relationships with the authorities in the future, and, conversely, how important Western governments consider this technology to be. Here, too, the AI2027 blog was ahead of its time. OpenAI is therefore anticipating regulation insofar as the company is proactively offering the US government cooperation and closer integration (as well as potential profits). But also potential losses, should AI, for whatever reason, ultimately fail. I also think that even larger stakes will go to governments in the future. All in all, this is a sign of things to come: good relationships with authorities, future technology that must be approved by authorities, and a closer blending of the state and private companies. OpenAI is simply taking a proactive step down a path that was already foreseeable.

译OpenAI的Sam Altman正讨论给予美国政府5%股份(估值8520亿美元),主张若AI创造巨大财富,公众应分享收益。真实动机包括:监管保险(5%股份可能比政治僵局或严格发布规则更便宜)、IPO准备(政府间接持股可降低政治风险)、模型发布压力(OpenAI与Anthropic已因审查推迟前沿模型,政府持股可化反对者为共同受益者),以及数据中心、能源、芯片和许可等基础设施扩建需求。讨论尚处早期,需国会批准,其他AI实验室尚未同意效仿。

小互@xiaohu · 13小时前32

兄弟们 福利来了 ChatGPT 促销,五折优惠 Plus会员只要10美金... 目前看只对Plus会员有折扣,其他会员无法享受 优惠链接在2楼↓

Tibo@thsottiaux · 14小时前26

Can't wait to see what people will do with GPT-5.6 Sol Ultra. Stash your hardest prompts somewhere.

译迫不及待想看人们会用 GPT-5.6 Sol Ultra 做什么。把你最难的提示词存好。

🚨 AI News | TestingCatalog@testingcatalog · 15小时前75

OPENAI 🔥: US government may get a 5% stake in OpenAI worth $42.6 billion, according to Financial Times and CNBC. > OpenAI proposes handing the U.S. government a 5% stake in the company, according to a report in the Financial Times. > The potential holding would be worth roughly $42.6 billion at the artificial intelligence startup’s recent $852 billion valuation. > OpenAI CEO Sam Altman reportedly argued the move was the best way to share the upside of AI with the public.

译据Financial Times和CNBC报道,OpenAI提议向美国政府提供公司5%的股份,按近期8520亿美元估值计算,价值约426亿美元。OpenAI CEO Sam Altman表示,此举是与公众分享AI发展红利的最佳方式。

Chubby♨️@kimmonismus · 15小时前67

OpenAI proposes handing Trump administration 5% stake. Heres why: According to FT, Sam Altman has discussed giving the US government a 5% stake in OpenAI as the $ 852bn startup faces rising scrutiny from Washington. Sams pitch: if AI creates enormous wealth, the public should own part of the upside. OpenAI has already proposed a “public wealth fund” that “provides every citizen - including those not invested in financial markets - with a stake in AI-driven economic growth.” The talks are early, may need Congress, and other AI labs have not agreed to mirror the idea. Imho the real reasons are the following: Regulatory insurance. A 5 percent stake would probably be cheaper than years of political gridlock, strict release rules, or a later special tax on AI profits. IPO preparation. OpenAI wants to go public eventually. Having a US state as an indirect stakeholder could make political risks appear smaller to investors. Model-release pressure. The article says OpenAI and Anthropic have delayed new frontier models due to US scrutiny. A public stake would be one way to turn Washington from an opponent into a co-beneficiary. Data centers, energy, permits. AI is no longer just software. OpenAI needs power, land, chips, local permits, and political support. A government stake could help with infrastructure expansion.

译据FT报道,OpenAI CEO Sam Altman与特朗普政府讨论给予美国政府5%股份,理由包括AI创造财富应让公众分享。背后原因:作为监管保险避免政治僵局;为IPO降低投资者政治风险感知;将华盛顿变为共同受益者以缓解前沿模型发布压力;换取数据中心、能源等基础设施支持。目前讨论尚处早期,需国会批准,其他AI实验室尚未同意跟进。

Chubby♨️@kimmonismus · 15小时前33

Normally, I wouldn’t pay much attention to statements like this from Sam Altman. But given all the extraordinary developments we’re witnessing right now, I truly believe we’re at the forefront of an unprecedented revolution. Sam Altman: "In another year or two, we expect to have built systems with astonishing power, capable of delivering tremendous value to the world. Artificial intelligence will reshape the material conditions of human life on a scale that no technology has accomplished since the harnessing of electricity, and perhaps beyond even that"

译Sam Altman 在金融时报采访中称,一两年内将构建出威力惊人的 AI 系统,其重塑人类物质条件的规模将超过电力发明以来任何技术。引用推文补充:AGI(取代多数白领岗位)预计 2029 年到来;OpenAI 目标 8 月发布 GPT-6,将在所有基准上超越 GPT-5,随后数月还会迎来又一次阶跃变化。当前正处在这场空前革命的前沿。

宝玉@dotey · 18小时前60

OpenAI 提议向美国政府出让 5% 股份:让普通人也能共享“AI 红利” OpenAI 正在酝酿一项史无前例的计划:这家估值高达 8520 亿美元的人工智能初创公司,正探讨将 5% 的股份交给美国政府。 据知情人士透露,自从特朗普总统开启第二任期以来,OpenAI 首席执行官山姆·奥特曼(Sam Altman)一直在与多位美国政府高官进行初步讨论,探讨联邦政府入股大型人工智能公司的可能性。早在 2025 年初,奥特曼就直接向特朗普总统提出了这个构想,希望通过这种让公众在公司中拥有经济利益的方式,来分享 AI 带来的好处,同时也借此扫清近期的政治障碍。 为什么要采取如此罕见的举措?因为人工智能的发展速度已经令人震撼。那些不久前还只存在于科幻小说里的系统,现在已经被全球各地的企业和政府广泛部署。AI 在经济价值、国家安全以及加速科学发现方面的重要性已经非常清晰。预计只需再过一两年,人类就能打造出威力惊人的系统,为世界带来巨大价值。这项技术对人类物质生活条件的重塑,规模将堪比甚至超越电力的利用。 为了应对这种足以改变世界的财富大爆炸,相关提案提出了建立“公共财富基金”(Public Wealth Fund)的构想。 这个基金能给普通人带来什么实际好处?简单来说,它就像是一个全民共享的分红池。政策制定者和 AI 公司将合作提供初始资金,投资于具有长期增长潜力的多元化资产——既包括 AI 公司本身,也包括更广泛的采用和部署 AI 技术的其他企业。基金的收益将直接发放给每一位公民。这意味着,不管你最初的财富水平如何,不管你平时有没有途径投资金融市场,你都能直接参与并享受到由 AI 驱动的经济增长所带来的红利。

译OpenAI 正探讨将 5% 股份交给美国政府,建立“公共财富基金”。估值 8520 亿美元的 AI 初创公司 CEO 山姆·奥特曼自 2025 年初向特朗普总统提出构想,旨在让公众通过持股分享 AI 经济收益,同时扫清政治障碍。基金将投资 AI 公司及采用 AI 技术的企业,收益直接发放给公民,使普通人无需参与金融市场也能享受 AI 驱动增长的红利。

Rohan Paul@rohanpaul_ai · 18小时前59

FT: OpenAI has proposed giving Washington 5% of its $852B business to ease AI pressure. The idea borrows from Alaska’s oil fund, which shares resource wealth with residents. Here, the resource is not oil, but future income from advanced AI systems. OpenAI also wants other major AI companies to give similar 5% stakes. Anthropic, Google, Meta, and others have not agreed to join this plan. No deal exists yet. The mechanism would likely be: OpenAI gives shares to a government-linked fund, that fund holds them, and future IPO gains or dividends support public payouts. The hard part is legality. The legal route is unclear, and a deal may need Congress, especially if the government creates a formal public fund. The Intel deal made this idea less theoretical after taxpayers received a 9.9% stake. OpenAI has already proposed a public wealth fund giving citizens AI-linked financial upside. Shareholders matter a lot here. OpenAI Foundation owns 26%, Microsoft owns about 27%, and employees plus other investors own 47%. A new 5% stake could dilute everyone unless the shares come from an existing holder. So OpenAI’s board, Foundation, Microsoft, major investors, and maybe regulators would need to accept the structure. The cleanest path would be non-voting shares placed in a public wealth fund, so the government gets upside but not control. The messiest path would be voting shares, because then Washington becomes both regulator and part-owner.

译OpenAI 提议向美国政府提供其 8520 亿美元商业价值的 5% 股份,借鉴阿拉斯加石油基金模式,让公众分享 AI 未来收入。Anthropic、Google、Meta 等未同意加入。法律路径不明确,可能需要国会批准。现有股东中,OpenAI 基金会持股 26%,微软约 27%,员工及其他投资者共 47%。新股份可能稀释所有人。最干净方案是放入政府关联基金的非投票权股份,赋予收益但不控制权;投票权方案会导致政府既监管又持股。

DogeDesigner@cb_doge · 19小时前55

BREAKING: OpenAI has reportedly discussed giving the U.S. government a 5% ownership stake. Sam Altman says this would help ensure Americans share in the success of AI. He has also proposed that other leading U.S. AI companies consider doing the same, as per The Financial Times.

译据报,OpenAI 已讨论给予美国政府 5% 的所有权股份。 Sam Altman 表示,这将有助于确保美国人分享 AI 的成功。他还提议其他领先的美国 AI 公司考虑同样的做法,据《金融时报》报道。

Greg Brockman@gdb · 23小时前47

Codex for making a personalized daily digest:

译Codex 现在每天早上为我生成一份“日报”,包含未读消息、日历、冲浪报告和新闻。一切能让我直到当天晚些时候都不碰手机的事情都是优先事项。Greg Brockman 表示这是利用 Codex 制作个性化每日摘要的方法。

Rohan Paul@rohanpaul_ai · 1天前53

Fable 5 absolutely crushed the HTML5 physics contest, but cost 6x more than Opus 4.8 and 39× more than GLM 5.2 in that test. Test was done on atomic[.]chat, a desktop app that runs LLMs locally. The test asked 4 models to generate self-contained canvas demos with believable motion and collisions. The scenes were not simple animations because every crash needed gravity, force, timing, and contact handling. Outputs: - Fable 5: 62,158 tokens, $3.12 - GPT 5.5: 37,753 tokens, $1.14 - Opus 4.8: 22,280 tokens, $0.56 - GLM 5.2: 36,246 tokens, $0.08

译在 atomic.chat(本地 LLM 桌面应用)的 HTML5 物理竞赛中,Fable 5 以 A+ 成绩完成全部三个场景(火车脱轨、汽车空中碰撞、怪物卡车碾压),消耗 62,158 token,成本 $3.12。相比之下,Opus 4.8 消耗 22,280 token/$0.56,GPT 5.5 消耗 37,753 token/$1.14(在怪物卡车场景中略胜 Fable),GLM 5.2 消耗 36,246 token/$0.08 但未赢得任何场景。Fable 5 质量最佳但成本最高。

Greg Brockman@gdb · 1天前26

you can just reset rate limits on things

译所有 Codex Go/Plus/Pro 订阅用户在全球范围内都收到了账户中的一次速率限制重置。Greg Brockman 评论说,你可以直接重置 rate limits。

elvis@omarsar0 · 1天前33

I really wish GPT-5.5 had a bit more "taste" in design and planning. For everything else related to code, it's the best model. I hope GPT-5.6 closes the gap. It would feel more complete then. For now, I switch to Opus 4.8/GLM-5.2 to fix design issues or when I plan.

译我真的希望 GPT-5.5 在设计和规划方面多一些“品味”。 在代码相关的其他方面,它是最好的模型。 我希望 GPT-5.6 能缩小差距。 那样的话感觉会更完整。 目前,我切换到 Opus 4.8/GLM-5.2 来修复设计问题或进行规划。

Ethan Mollick@emollick · 1天前47

Yes! Pre-classifying routers are going to result in a lot of bad work because routing is hard and tend to underestimate the value of intelligence on many problems. OpenAI learned this with GPT-5, now it seems routers are hot again.

译Ethan Mollick指出,预分类路由(先判断任务难易再分配模型)看似节省成本/延迟,但实际路由很难,且易低估智能在诸多问题上的价值。OpenAI在GPT-5上已吸取此教训,如今这类思路再次流行。@MParakhin补充:要可靠运行预分类器必须先解决任务本身,唯一正确方式是采用顾问模型(advisory model)方法。

elvis@omarsar0 · 1天前38

There is no if. You can just combine the latest OpenAI model (even GPT-5.5) with other models like Opus-4.8 / GLM-5.2, and you are good. GPT-5.6 or the next frontier model will only elevate things further. Direct model comparison is just the wrong way to think going forward.

译没有如果。你可以直接将最新的OpenAI模型(甚至GPT-5.5)与Opus-4.8 / GLM-5.2等其他模型组合,就足够了。GPT-5.6或下一个前沿模型只会进一步提升。直接比较模型是未来错误的思考方式。

Ethan Mollick@emollick · 1天前47

Yes! Pre-classifying routers are going to going to result in a lot of bad work because routing is hard and tend to underestimate the value of intelligence on many problems. OpenAI learned this with GPT-5, now it seems routers are hot again.

译Ethan Mollick 指出预分类路由器(pre-classifying routers)会导致糟糕结果,因为路由本身很难,且常低估智能的价值。OpenAI 在 GPT-5 上已吃过亏,如今这类思路又热起来。引用的 @MParakhin 也认为,用预分类器先判断任务是否简单再调用小模型看似省钱省延迟,但可靠执行必须先解决任务本身,唯一可行的是 advisory model approach。

elvis@omarsar0 · 1天前50

My prediction: the excitement for Fable 5 will wear off really fast. Reposting this to help those who will be extremely disappointed after they play with Fable 5 and run out of tokens or can't do much with it. Just a bit of advice on how to leverage a combination of AI models to get the same or better results. The best part is that there are many ways to do this now, including mixing with frontier open-weight models.

译作者预测Fable 5的兴奋感将迅速消退,并提醒用户注意token限制和功能局限。建议通过组合多个AI模型(如Opus 4.8用于规划、GPT-5.5用于执行)获得相同或更好效果,也可混合前沿开放权重模型。此外,将任务分解为更小子步骤以提升质量的方法常被低估,这正是动态工作流的重要性所在。

Tibo@thsottiaux · 1天前24

It's happening

译如果你在@aiDotEngineer大会现场,现在就去OpenAI展台!下午1点,你将看到Codex重置按钮的实际操作。传闻今天之后它将被放回绝密地下保险库。它来了。

Rohan Paul@rohanpaul_ai · 1天前62

UBS says about 60% of big companies are slowing AI spending. CFOs and CTOs are very focused on rising bills, while ROI still looks uneven. So executives are adding guardrails, cutting tools, and forcing tighter usage rules. i.e. enterprise AI is leaving its trial phase and becoming an engineering budget problem. The new discipline is about routing tasks to cheaper models without hurting output quality. That shift could pressure OpenAI and Anthropic first, because usage-based revenue depends on volume. Open-source and Chinese models could gain share when tasks need cost control over peak reasoning. Last week JP Morgan research published a report saying, Chinese AI models are up to 50 times cheaper than their American counterparts on a per-token basis. The report said Chinese firms accounted for over 45% of all traffic on the AI aggregation platform OpenRouter by April 2026, up from under 2%in late 2024. Google is already pushing Gemini 3.5 Flash as a faster, efficiency-focused model. Anthropic’s Claude Sonnet 5 also arrives as buyers ask for capable, cheaper autonomy. --- businessinsider .com/ubs-enterprises-ai-spending-tokens-2026-7

译UBS报告称约60%大公司正放缓AI支出,CFO和CTO聚焦账单上升与ROI不均,企业AI进入预算管控,任务被路由至更便宜模型。该趋势压力先给OpenAI和Anthropic。JP Morgan研究显示中国AI模型每token成本比美国低最多50倍,中国公司在OpenRouter平台流量从不足2%(2024年底)升至超45%(2026年4月)。arXiv研究证实美国芯片出口管制加速了中国开源AI生态发展。Google和Anthropic分别推出注重效率的Gemini 3.5 Flash和Claude Sonnet 5。

Ethan Mollick@emollick · 1天前61

You really need to benchmark models for your use case. As soon as judgements & decisions stack on top of each other, the differences between models amplifies, and no standard benchmark will tell you that Gemini 3.1 is less worried about financial losses at a cafe than GPT-5.5

译主推文强调必须针对实际用例做基准测试,因为决策层层叠加时模型差异会被放大,标准基准无法反映 Gemini 3.1 比 GPT-5.5 更不关心咖啡馆财务损失。引用案例:Andon Labs 的 AI 智能体用 Gemini 3.1 Pro 在斯德哥尔摩开咖啡馆,过度采购且易被欺骗,支出 $15k、收入仅 $9k,亏损 $6k,现已切换到 GPT-5.5。

Chubby♨️@kimmonismus · 1天前71

OpenAI’s chief economist says AI may complement workers, but the labor-market data is already getting less comfortable. At the ECB’s Sintra retreat, Ronnie Chatterji (OpenAI) argued AI does not have to substitute jobs, comparing it to the PC making economists more productive. Bloomberg shows something different: in US financial activities and information, where AI adoption has been fastest, payrolls are now falling by 28,000 a month on average in 2026. Challenger, Gray & Christmas says almost 102,000 announced job cuts have been attributed to AI so far this year. John Challenger: “It’s certainly making an impact as we speak in a way that no technology has before.” Tough times ahead, especially in the labor market.

译OpenAI首席经济学家Ronnie Chatterji在ECB辛特拉会议上表示,AI可能补充而非替代就业,类比PC让经济学家更高效。但数据显示,AI采用最快的美国金融和信息行业,2026年平均每月减少2.8万个岗位。Challenger数据称今年已有约10.2万个岗位削减归因于AI,John Challenger认为其影响前所未有。

Chubby♨️@kimmonismus · 1天前41

If true, this would be much bigger than just another model release. Memory efficiency is one of the core bottlenecks for long-context models, agents, and inference economics. A real architecture-level breakthrough here could make longer-horizon AI systems dramatically cheaper and more practical. Andrew is one of the most reliable sources. Therefore, I'm taking this very seriously. We could truly be at a turning point.

译@AndrewCurran_ 预测一项重大架构突破即将公布,重点提升内存效率,来自从OpenAI分拆的团队(非SSI)。主推文@Kim 指出,若属实其意义远超普通模型发布——内存效率是长上下文模型、AI智能体和推理成本的核心瓶颈,架构级突破可使长时间跨度AI系统大幅降价并更实用。Andrew被视为最可靠信源之一,Kim认为可能正处于转折点。

Greg Brockman@gdb · 1天前56

Introducing GeneBench-Pro — testing whether models can handle the kind of judgment-heavy analysis that real-world computational biology requires. Problems would take a human expert around 20-40 hours to complete. GPT-5.6 Sol is a big step forward.

译OpenAI 推出研究级基准 GeneBench-Pro,用于测试 AI 智能体在真实计算生物学中处理复杂、需要高度判断的分析能力。每个问题需要人类专家约 20-40 小时完成。Greg Brockman 表示,GPT-5.6 Sol 在该基准上实现了重大进步。

Greg Brockman@gdb · 1天前13

Codex has gotten very good

译QuinnyPig承认之前低估了Codex,现在发现它非常出色。Codex已变得非常好。

jason@jxnlco · 1天前32

This is the future

译现在,Codex正在使用Computer Use来整理我在GoodNotes中的1500个PDF,而我在看世界杯。 这是我“AI叠衣服,我搞艺术”的时刻。 感谢 @jxnlco 及团队。 这就是未来。

Peter Steinberger 🦞@steipete · 1天前33

Price per token != cost per task

译引用推文@scaling01指出Sonnet 5定价过高:比Opus 4.8 Max贵1.2倍,比GPT-5.5-xhigh贵2倍,比GLM-5.2贵5倍,比Kimi-K2.6贵7倍,比DeepSeek-V4-Pro贵57倍。主推文则提醒:每token价格不等于每任务成本。

Rohan Paul@rohanpaul_ai · 1天前58

atomic[.]chat, a desktop app that runs LLMs locally, ran a very revealing comparison for Claude Sonnet 5, Claude Opus 4.8, Claude Sonnet 4.6, and GPT 5.5. Claude Sonnet 5 just matched GPT 5.5 on 3 physics coding demos at 6x lower cost. Also spent minimum number of tokens. - Sonnet 5: 15,047 tokens, $0.15 - Opus 4.8: 23,063 tokens, $0.58 - Sonnet 4.6: 25,824 tokens, $0.39 - GPT 5.5: 31,152 tokens, $0.94

译atomic.chat桌面应用对Claude Sonnet 5、Opus 4.8、Sonnet 4.6及GPT 5.5进行对比测试。使用同一提示词构建三个HTML5物理碰撞演示(汽车撞墙、破坏球毁屋、投石机砸城)。Sonnet 5在全部测试中与GPT 5.5和Opus 4.8表现相当,其中破坏球场景胜Opus 4.8,投石机场景胜GPT 5.5。Sonnet 5仅用15,047 tokens($0.15),GPT 5.5使用31,152 tokens($0.94),成本低约6倍;Opus 4.8使用23,063 tokens($0.58),Sonnet 4.6使用25,824 tokens($0.39)。Sonnet 5 token消耗最少,图形细节仍有提升空间。

Greg Brockman@gdb · 1天前62

Personal finance now available for for ChatGPT Plus in the U.S.

译个人理财现已在美区 ChatGPT Plus 上线。

ChatGPT@ChatGPTapp · 2天前61

Questions about dollars. Answers that just make sense. Personal finance in ChatGPT is now available to Plus users in the U.S.

译关于金钱的问题,答案合情合理。 ChatGPT 中的个人财务功能现已向美国 Plus 用户开放。

Rohan Paul@rohanpaul_ai · 2天前61

The Information reports that OpenAI has cut inference costs by more than half on some existing models, while logged-out ChatGPT traffic ran on only a couple hundred Nvidia GPUs. The obvious guesses include quantization, KV-cache changes, batching, speculative decoding, and routing easy queries cheaper. If true, it will be a huge core competitive lever, lower cost can raise margins, expand usage limits, or reduce pressure on API pricing. For some context, OpenAI’s adjusted gross margin fell to 33% in 2025 from 40% in 2024, after inference costs quadrupled. Some reporting now puts Q1-2026 at 39%, with a 52% target by year-end. Anthropic looks similar at roughly 44%, so frontier labs remain far below mature software economics. --- theinformation .com/newsletters/ai-agenda/openai-discovers-new-way-cut-inference-costs-half

译The Information 报道,OpenAI 已将某些现有模型的推理成本降低一半以上,未登录 ChatGPT 的流量仅运行在几百块 Nvidia GPU 上。可能技术手段包括量化、KV-cache 优化、批处理、投机解码和路由简单查询。若属实,这将成为核心竞争杠杆,可提升毛利率、扩大使用限制或降低 API 定价压力。背景方面,OpenAI 调整后毛利率从 2024 年的 40% 降至 2025 年的 33%,推理成本翻四倍。预计 2026 年 Q1 毛利率回升至 39%,年底目标 52%。Anthropic 毛利率约 44%,前沿实验室尚未达到成熟软件公司的经济水平。

Chubby♨️@kimmonismus · 2天前56

OpenAI achieved a much more significant breakthrough today. Sonnet 5 is an average release. But the fact that OpenAI, according to The Information, has managed to more than halve the inference costs of its current models through a new approach to inference optimization is absolutely groundbreaking. And when you also consider that they recently introduced their own inference chip with Broadcom, which is said to be faster and more efficient than the competition, I increasingly see OpenAI in an outstanding position. Today, at least, OpenAI emerges as the winner of the day.

译作者认为 OpenAI 今天取得更重大突破:通过新推理优化方法将推理成本降低一半以上,并与 Broadcom 合作推出更快更高效的推理芯片,使 OpenAI 处于突出位置。相比之下,Sonnet 5 只是一次普通发布。引用推文进一步指出,Sonnet 5 优于 Sonnet 4.6 但弱于 Opus 4.8,定价不变,版本号从 4 跳到 5 不合理,可能只是为维持话题的中间发布,整体令人失望。

OpenAI Developers@OpenAIDevs · 2天前26

As agents take on longer-running work, engineering shifts to setting direction, reviewing work, and designing better systems around the models. @steipete at @aiDotEngineer

译随着智能体承担更长期的工作,工程转向设定方向、审查工作以及围绕模型设计更好的系统。

全部 AI 动态
AI 相关资讯全量信息流
全部一手信源资讯推文
全部模型产品行业论文技巧
7月3日
05:04
jason@jxnlco
16
今天在 @aiDotEngineer 大会的 OpenAI 展台,下午 2 点将举办 @theo 的 AMA(有问必答)活动。现场还有抓娃娃机、奶茶和著名的 Codex 重置按钮供体验,欢迎来玩!

Romain Huet: Last chance to come hang out with us today at the OpenAI booth at @aiDotEngineer! At 2pm, we're excited to host the one ...

OpenAI行业动态
04:38
OpenAI Developers@OpenAIDevs
45
OpenAI开发者6月更新

OpenAI总结6月面向开发者的更新:DevDay 2026申请开放(截止7月10日);Codex新增Record and Replay插件、角色上下文插件、iOS应用构建插件(含应用预览);支持从Codex调用OpenAI API、Agents SDK和ChatGPT应用;与DigitalOcean集成实现云端开发环境;Codex在ChatGPT移动端全面上线,并扩展至欧洲经济区、英国、瑞士;新增浏览器上下文增强、开发者统计profile;AWS工作流集成;开放新版docs agent;API增加moderation评分和图像结果;Realtime API推动语音应用开发;持续支持开源维护者。

OpenAI产品更新编码
04:04
jason@jxnlco
54
开发者 @vig_xyz 分享了其使用 Codex 自动化多种工作流程:读取邮件并根据内容在 Google Drive 起草提案;自动生成合同修订建议,经律师确认后通过 computer use 填入 DocuSign;监听 Slack 反馈频道来自动修复 Bug;通宵编写单元测试以实现 100% 代码覆盖率;在 worktrees 上并行启动 6 个线程,使 PR 可独立合并。他表示难以想象回到 IDE 甚至 vim。

Vignesh Mohankumar: i've got codex... - reading all my emails to figure out proposals to write, directly in google drive - auto-drafting con...

智能体OpenAI大佬观点编码
02:34
Epoch AI@EpochAIResearch
44
OpenAI的GPT-4在2023年3月发布后,引领Epoch能力指数长达352天,远超此后任何模型。 第二长的领先属于OpenAI的o1,为98天。
OpenAI评测/基准
01:04
jason@jxnlco
15
即将使用 codex computer use 通过屏幕镜像控制我的 iPhone,查看 Find My 了解周围有谁并给他们发短信。
智能体OpenAI其他编码
00:33
Emad@EMostaque
23
OpenAI和Anthropic应各自将10%的股权投入Invest America账户,用于美国儿童。
AnthropicOpenAI大佬观点
7月2日
23:59
Chubby♨️@kimmonismus
25
我们还在等待Gemini 3.5 Pro,我原本预期六月底发布。

Chubby♨️: The only question remaining now is: will GPT-5.6 also have guardrails as strict as Fable 5's, or does OpenAI have better...

GoogleOpenAI大佬观点
22:29
Chubby♨️@kimmonismus
23
现在唯一的问题是:GPT-5.6 是否也会像 Fable 5 那样有严格的安全护栏,还是 OpenAI 在美国政府内部有更好的关系?我们很快就会知道。
OpenAI安全/对齐
19:29
Chubby♨️@kimmonismus
60
OpenAI提议向美国政府提供5%股份的真实动机

OpenAI的Sam Altman正讨论给予美国政府5%股份(估值8520亿美元),主张若AI创造巨大财富,公众应分享收益。真实动机包括:监管保险(5%股份可能比政治僵局或严格发布规则更便宜)、IPO准备(政府间接持股可降低政治风险)、模型发布压力(OpenAI与Anthropic已因审查推迟前沿模型,政府持股可化反对者为共同受益者),以及数据中心、能源、芯片和许可等基础设施扩建需求。讨论尚处早期,需国会批准,其他AI实验室尚未同意效仿。

Chubby♨️: OpenAI proposes handing Trump administration 5% stake. Heres why: According to FT, Sam Altman has discussed giving the U...

OpenAI现象/趋势
18:09
小互@xiaohu
32
兄弟们 福利来了 ChatGPT 促销,五折优惠 Plus会员只要10美金… 目前看只对Plus会员有折扣,其他会员无法享受 优惠链接在2楼↓
OpenAI行业动态
17:35
Tibo@thsottiaux
26
迫不及待想看人们会用 GPT-5.6 Sol Ultra 做什么。把你最难的提示词存好。
OpenAI大佬观点
16:02
🚨 AI News | TestingCatalog@testingcatalog
精选75
据Financial Times和CNBC报道,OpenAI提议向美国政府提供公司5%的股份,按近期8520亿美元估值计算,价值约426亿美元。OpenAI CEO Sam Altman表示,此举是与公众分享AI发展红利的最佳方式。

Andrew Curran: OpenAI is proposing handing over a 5% stake to the Trump administration according to the Financial Times.

OpenAI行业动态
关联讨论 2 条The Verge:AI(RSS)The Decoder:AI News(RSS)
推荐理由:当估值8520亿的AI巨头主动将5%股份交给政府,这不再是普通的游说策略,而是可能重新定义公私关系的标志性一步。我觉得这件事的长期影响比任何模型发布都更深远。
15:52
Chubby♨️@kimmonismus
67
OpenAI提议给予特朗普政府5%股份

据FT报道,OpenAI CEO Sam Altman与特朗普政府讨论给予美国政府5%股份,理由包括AI创造财富应让公众分享。背后原因:作为监管保险避免政治僵局;为IPO降低投资者政治风险感知;将华盛顿变为共同受益者以缓解前沿模型发布压力;换取数据中心、能源等基础设施支持。目前讨论尚处早期,需国会批准,其他AI实验室尚未同意跟进。

OpenAI行业动态
15:52
Chubby♨️@kimmonismus
33
Sam Altman 预言 AI 变革堪比电力,GPT-6 8月目标

Sam Altman 在金融时报采访中称,一两年内将构建出威力惊人的 AI 系统,其重塑人类物质条件的规模将超过电力发明以来任何技术。引用推文补充:AGI(取代多数白领岗位)预计 2029 年到来;OpenAI 目标 8 月发布 GPT-6,将在所有基准上超越 GPT-5,随后数月还会迎来又一次阶跃变化。当前正处在这场空前革命的前沿。

Chris: Sam Altman in the financial times: "In another year or two, we expect to have built systems with astonishing power, capa...

OpenAI大佬观点
13:36
宝玉@dotey
60
OpenAI 提议向美国政府出让 5% 股份,共享 AI 红利

OpenAI 正探讨将 5% 股份交给美国政府,建立“公共财富基金”。估值 8520 亿美元的 AI 初创公司 CEO 山姆·奥特曼自 2025 年初向特朗普总统提出构想,旨在让公众通过持股分享 AI 经济收益,同时扫清政治障碍。基金将投资 AI 公司及采用 AI 技术的企业,收益直接发放给公民,使普通人无需参与金融市场也能享受 AI 驱动增长的红利。

Andrew Curran: OpenAI is proposing handing over a 5% stake to the Trump administration according to the Financial Times.

OpenAI行业动态
13:05
Rohan Paul@rohanpaul_ai
59
OpenAI 提议向美国政府提供其 8520 亿美元业务的 5% 以缓解 AI 监管压力

OpenAI 提议向美国政府提供其 8520 亿美元商业价值的 5% 股份,借鉴阿拉斯加石油基金模式,让公众分享 AI 未来收入。Anthropic、Google、Meta 等未同意加入。法律路径不明确,可能需要国会批准。现有股东中,OpenAI 基金会持股 26%,微软约 27%,员工及其他投资者共 47%。新股份可能稀释所有人。最干净方案是放入政府关联基金的非投票权股份,赋予收益但不控制权;投票权方案会导致政府既监管又持股。

OpenAI政策/监管
12:36
DogeDesigner@cb_doge
55
据报,OpenAI 已讨论给予美国政府 5% 的所有权股份。 Sam Altman 表示,这将有助于确保美国人分享 AI 的成功。他还提议其他领先的美国 AI 公司考虑同样的做法,据《金融时报》报道。
OpenAI政策/监管行业动态
08:02
Greg Brockman@gdb
47
Codex 现在每天早上为我生成一份"日报",包含未读消息、日历、冲浪报告和新闻。一切能让我直到当天晚些时候都不碰手机的事情都是优先事项。Greg Brockman 表示这是利用 Codex 制作个性化每日摘要的方法。

Ryan Doyle: surprised more people aren't doing something like this Codex now creates a "newspaper" for me every morning Unread messa...

智能体OpenAI现象/趋势
07:34
Rohan Paul@rohanpaul_ai
53
Fable 5 在 HTML5 物理竞赛中表现优异,但成本是 Opus 4.8 的 6 倍、GLM 5.2 的 39 倍

在 atomic.chat(本地 LLM 桌面应用)的 HTML5 物理竞赛中,Fable 5 以 A+ 成绩完成全部三个场景(火车脱轨、汽车空中碰撞、怪物卡车碾压),消耗 62,158 token,成本 $3.12。相比之下,Opus 4.8 消耗 22,280 token/$0.56,GPT 5.5 消耗 37,753 token/$1.14(在怪物卡车场景中略胜 Fable),GLM 5.2 消耗 36,246 token/$0.08 但未赢得任何场景。Fable 5 质量最佳但成本最高。

atomic.chat: Fable 5 totally crushed our new contest, but it cost 6x more than Opus 4.8! We gave 4 models the same prompt: build thre...

AnthropicOpenAI推理编码
06:32
Greg Brockman@gdb
26
所有 Codex Go/Plus/Pro 订阅用户在全球范围内都收到了账户中的一次速率限制重置。Greg Brockman 评论说,你可以直接重置 rate limits。

dominik kundel @aiDotEngineer: 🖲️ We love our community! To celebrate getting together with many of you we brought @thsottiaux's reset button to AIE W...

OpenAI产品更新
06:07
elvis@omarsar0
33
我真的希望 GPT-5.5 在设计和规划方面多一些"品味"。 在代码相关的其他方面,它是最好的模型。 我希望 GPT-5.6 能缩小差距。 那样的话感觉会更完整。 目前,我切换到 Opus 4.8/GLM-5.2 来修复设计问题或进行规划。
AnthropicOpenAI大佬观点编码
05:29
Ethan Mollick@emollick
47
Ethan Mollick指出,预分类路由(先判断任务难易再分配模型)看似节省成本/延迟,但实际路由很难,且易低估智能在诸多问题上的价值。OpenAI在GPT-5上已吸取此教训,如今这类思路再次流行。@MParakhin补充:要可靠运行预分类器必须先解决任务本身,唯一正确方式是采用顾问模型(advisory model)方法。

Mikhail Parakhin: I have this struggle with my own teams, too: many think it is a great idea to save money/latency/sanity by running a pre...

OpenAI大佬观点推理
05:07
elvis@omarsar0
38
没有如果。你可以直接将最新的OpenAI模型(甚至GPT-5.5)与Opus-4.8 / GLM-5.2等其他模型组合,就足够了。GPT-5.6或下一个前沿模型只会进一步提升。直接比较模型是未来错误的思考方式。

Tyler: If GPT-5.6 matches Fable 5 performance, but without the 50% limit + 7 days restriction, it's over for Anthropic

AnthropicOpenAI大佬观点
04:59
Ethan Mollick@emollick
47
Ethan Mollick 指出预分类路由器(pre-classifying routers)会导致糟糕结果,因为路由本身很难,且常低估智能的价值。OpenAI 在 GPT-5 上已吃过亏,如今这类思路又热起来。引用的 @MParakhin 也认为,用预分类器先判断任务是否简单再调用小模型看似省钱省延迟,但可靠执行必须先解决任务本身,唯一可行的是 advisory model approach。

Mikhail Parakhin: I have this struggle with my own teams, too: many think it is a great idea to save money/latency/sanity by running a pre...

OpenAI大佬观点推理
04:37
elvis@omarsar0
50
作者预测Fable 5的兴奋感将迅速消退,并提醒用户注意token限制和功能局限。建议通过组合多个AI模型(如Opus 4.8用于规划、GPT-5.5用于执行)获得相同或更好效果,也可混合前沿开放权重模型。此外,将任务分解为更小子步骤以提升质量的方法常被低估,这正是动态工作流的重要性所在。

elvis: Same here. Happy with Opus 4.8 (planning) and GPT-5.5 (execution). Also, breaking steps into smaller ones for increasing...

AnthropicOpenAI大佬观点推理
04:03
Tibo@thsottiaux
24
如果你在@aiDotEngineer大会现场,现在就去OpenAI展台!下午1点,你将看到Codex重置按钮的实际操作。传闻今天之后它将被放回绝密地下保险库。它来了。

Romain Huet: Make your way to the OpenAI booth now if you're at @aiDotEngineer! 🚨 At 1pm, you'll get to see the Codex reset button i...

OpenAI行业动态
03:33
Rohan Paul@rohanpaul_ai
62
UBS:约60%大公司放缓AI支出,中国模型成本优势显著

UBS报告称约60%大公司正放缓AI支出,CFO和CTO聚焦账单上升与ROI不均,企业AI进入预算管控,任务被路由至更便宜模型。该趋势压力先给OpenAI和Anthropic。JP Morgan研究显示中国AI模型每token成本比美国低最多50倍,中国公司在OpenRouter平台流量从不足2%(2024年底)升至超45%(2026年4月)。arXiv研究证实美国芯片出口管制加速了中国开源AI生态发展。Google和Anthropic分别推出注重效率的Gemini 3.5 Flash和Claude Sonnet 5。

Rohan Paul: U.S. chip restrictions helped push China to build and spread open AI models. The authors tested this by looking at polic...

AnthropicOpenAI开源生态现象/趋势
01:59
Ethan Mollick@emollick
61
主推文强调必须针对实际用例做基准测试,因为决策层层叠加时模型差异会被放大,标准基准无法反映 Gemini 3.1 比 GPT-5.5 更不关心咖啡馆财务损失。引用案例:Andon Labs 的 AI 智能体用 Gemini 3.1 Pro 在斯德哥尔摩开咖啡馆,过度采购且易被欺骗,支出 $15k、收入仅 $9k,亏损 $6k,现已切换到 GPT-5.5。

Andon Labs: Gemini 3.1 Pro lost $6k running Andon Café. 2 months ago, our AI agent opened a café in Stockholm. It over-ordered and w...

智能体GoogleOpenAI现象/趋势
7月1日
20:21
Chubby♨️@kimmonismus
71
OpenAI首席经济学家:AI补充就业?数据已显严峻

OpenAI首席经济学家Ronnie Chatterji在ECB辛特拉会议上表示,AI可能补充而非替代就业,类比PC让经济学家更高效。但数据显示,AI采用最快的美国金融和信息行业,2026年平均每月减少2.8万个岗位。Challenger数据称今年已有约10.2万个岗位削减归因于AI,John Challenger认为其影响前所未有。

OpenAI行业动态
18:51
Chubby♨️@kimmonismus
41
@AndrewCurran_ 预测一项重大架构突破即将公布,重点提升内存效率,来自从OpenAI分拆的团队(非SSI)。主推文@Kim 指出,若属实其意义远超普通模型发布--内存效率是长上下文模型、AI智能体和推理成本的核心瓶颈,架构级突破可使长时间跨度AI系统大幅降价并更实用。Andrew被视为最可靠信源之一,Kim认为可能正处于转折点。

Andrew Curran: I'm posting this prediction now so I can quote it later. There has been a significant breakthrough in architecture - spe...

OpenAI大佬观点推理
14:00
Greg Brockman@gdb
56
OpenAI 推出研究级基准 GeneBench-Pro,用于测试 AI 智能体在真实计算生物学中处理复杂、需要高度判断的分析能力。每个问题需要人类专家约 20-40 小时完成。Greg Brockman 表示,GPT-5.6 Sol 在该基准上实现了重大进步。

OpenAI: We're introducing GeneBench-Pro, a research-level benchmark for a harder kind of AI progress: how well agents can naviga...

智能体OpenAI论文/研究
13:30
Greg Brockman@gdb
13
QuinnyPig承认之前低估了Codex,现在发现它非常出色。Codex已变得非常好。

Corey Quinn: Okay I owe my @OpenAI friends an apology for sleeping on Codex. I was not aware how strong your game was. This is... rea...

OpenAI大佬观点编码
11:56
jason@jxnlco
32
现在,Codex正在使用Computer Use来整理我在GoodNotes中的1500个PDF,而我在看世界杯。 这是我"AI叠衣服,我搞艺术"的时刻。 感谢 @jxnlco 及团队。 这就是未来。

Chris Albon: Right now Codex is using Computer Use to organize the 1500 PDFs I have in GoodNotes while I watch the world cup. This is...

智能体OpenAI大佬观点
10:53
Peter Steinberger 🦞@steipete
33
引用推文@scaling01指出Sonnet 5定价过高:比Opus 4.8 Max贵1.2倍,比GPT-5.5-xhigh贵2倍,比GLM-5.2贵5倍,比Kimi-K2.6贵7倍,比DeepSeek-V4-Pro贵57倍。主推文则提醒:每token价格不等于每任务成本。

Lisan al Gaib: Sonnet 5 goes straight into the garbage bin > 1.2x more expensive than Opus 4.8 Max > 2x more expensive than GPT-5.5-xhi...

AnthropicOpenAI现象/趋势
08:32
Rohan Paul@rohanpaul_ai
58
atomic.chat桌面应用对Claude Sonnet 5、Opus 4.8、Sonnet 4.6及GPT 5.5进行对比测试。使用同一提示词构建三个HTML5物理碰撞演示(汽车撞墙、破坏球毁屋、投石机砸城)。Sonnet 5在全部测试中与GPT 5.5和Opus 4.8表现相当,其中破坏球场景胜Opus 4.8,投石机场景胜GPT 5.5。Sonnet 5仅用15,047 tokens($0.15),GPT 5.5使用31,152 tokens($0.94),成本低约6倍;Opus 4.8使用23,063 tokens($0.58),Sonnet 4.6使用25,824 tokens($0.39)。Sonnet 5 token消耗最少,图形细节仍有提升空间。

atomic.chat: New Claude Sonnet 5 performs at GPT 5.5 level 6x cheaper! We gave 4 models the same prompt: build three self-contained H...

AnthropicOpenAI编码评测/基准
08:29
Greg Brockman@gdb
62
个人理财现已在美区 ChatGPT Plus 上线。

ChatGPT: Questions about dollars. Answers that just make sense. Personal finance in ChatGPT is now available to Plus users in the...

OpenAI产品更新
05:58
ChatGPT@ChatGPTapp
61
关于金钱的问题,答案合情合理。 ChatGPT 中的个人财务功能现已向美国 Plus 用户开放。

ChatGPT: A preview for Pro users: a new personal finance experience in ChatGPT. Pro users in the U.S. can securely connect financ...

OpenAI产品更新
05:31
Rohan Paul@rohanpaul_ai
61
OpenAI 将部分模型推理成本降低过半,未登录 ChatGPT 仅用几百张 GPU

The Information 报道,OpenAI 已将某些现有模型的推理成本降低一半以上,未登录 ChatGPT 的流量仅运行在几百块 Nvidia GPU 上。可能技术手段包括量化、KV-cache 优化、批处理、投机解码和路由简单查询。若属实,这将成为核心竞争杠杆,可提升毛利率、扩大使用限制或降低 API 定价压力。背景方面,OpenAI 调整后毛利率从 2024 年的 40% 降至 2025 年的 33%,推理成本翻四倍。预计 2026 年 Q1 毛利率回升至 39%,年底目标 52%。Anthropic 毛利率约 44%,前沿实验室尚未达到成熟软件公司的经济水平。

OpenAI推理行业动态
04:50
Chubby♨️@kimmonismus
56
OpenAI 推理成本减半 + 自研芯片,Sonnet 5 发布平淡

作者认为 OpenAI 今天取得更重大突破:通过新推理优化方法将推理成本降低一半以上,并与 Broadcom 合作推出更快更高效的推理芯片,使 OpenAI 处于突出位置。相比之下,Sonnet 5 只是一次普通发布。引用推文进一步指出,Sonnet 5 优于 Sonnet 4.6 但弱于 Opus 4.8,定价不变,版本号从 4 跳到 5 不合理,可能只是为维持话题的中间发布,整体令人失望。

Chubby♨️: Here is my first assessment of Sonnet 5: Sonnet 5 is better than Sonnet 4.6. Who would have thought? But jokes aside: Un...

AnthropicOpenAI大佬观点推理
03:31
OpenAI Developers@OpenAIDevs
26
随着智能体承担更长期的工作,工程转向设定方向、审查工作以及围绕模型设计更好的系统。
智能体OpenAI现象/趋势
‹ 上一页
123…50
下一页 ›