Ten years ago, AlphaGo's legendary match in Seoul heralded the start of the modern era in AI. Its famous 'Move 37' signa...
Ten years ago, AlphaGo's legendary match in Seoul heralded the start of the modern era in AI. Its famous 'Move 37' signa...
What if your video generator could refine itself-at inference time? ❌No new models. ❌No retraining. ❌No external verifie...
Yann is just plain incorrect here, he's confusing general intelligence with universal intelligence. Brains are the most ...
Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL wi...
Today we describe how we leverage AlphaEvolve, a @GoogleDeepMind system for iteratively evolving code, to morph snippets...
未来几周将推出新的计算密集型产品,部分功能仅限 Pro 订阅者,部分新产品需额外付费。尽管长期目标仍是降低智能成本并普及服务,但当前希望探索在高计算投入下能实现哪些新可能性。
1/n I'm really excited to share that our @OpenAI reasoning system got a perfect score of 12/12 during the 2025 ICPC Worl...
1/n I'm really excited to share that our @OpenAI reasoning system got a perfect score of 12/12 during the 2025 ICPC Worl...
关联讨论 1 条Google DeepMind:Blog(RSS)Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is "Defeating Nondetermin...
Claim: gpt-5-pro can prove new interesting mathematics. Proof: I took a convex optimization paper with a clean open prob...
Today I got the best peer review of my career (I'm a physicist with 150+ publications, Science, PRL). It was from GPT-5 ...
I'm actually really surprised GPT 5 thinking doesn't hallucinate this. Most AI models will hallucinate a benchmark if th...
GPT-5 is the first series of models that actually doesn't hallucinate basically at all, especially when given mildly bus...
It is mind-blowing to me that only 7% of Plus users were using o3.
today we are significantly increasing rate limits for reasoning for chatgpt plus users, and all model-class limits will ...
What a show! The Kaggle Game Arena AI Chess Exhibition Tournament is complete, and the winner is O3 🏆! A huge thank you...
🚀 Introducing Qwen3-4B-Instruct-2507 & Qwen3-4B-Thinking-2507 - smarter, sharper, and 256K-ready! 🔹 Instruct: Boosted ...
the openai IMO news hit me pretty heavy this weekend i'm still in the acute phase of the impact, i think i consider myse...
My bar for AGI is an AI winning a Nobel Prize for a new theory it originated.
We are seeing much faster AI progress than **Paul Christiano** and **Yudkowsky** predicted, who had gold in 2025 at 8% a...
So, all the models underperform humans on the new International Mathematical Olympiad questions, and Grok-4 is especiall...
i mostly use my visual intelligence when trying to solve this sota approaches to arc agi are mostly symbolic, vision doe...