Anthropic联合创始人描绘递归式AI改进如何超越人类监督者
阅读原文· the-decoder.comAnthropic联合创始人Jack Clark在长文中指出,AI系统训练其自身后继者所需的基础构件已基本就位。他预测到2028年底,AI实现递归式自我改进的可能性高达60%。这一进程可能使AI的进化速度超越负责监督的人类能力,引发对AI发展自主性的关键讨论。
Anthropic co-founder maps out how recursive AI improvement could outpace the humans meant to supervise it
Jack Clark argues in a long essay that the building blocks for AI systems training their own successors are largely in place. He puts the odds at 60 percent by the end of 2028.
In his newsletter Import AI, Anthropic co-founder Jack Clark says public data points to an imminent automation of AI research. What he means specifically is a system that can train a more powerful successor on its own, "no-human-involved." He pegs the odds at roughly 60 percent by the end of 2028, and 30 percent by 2027.
Clark builds his case mainly on benchmark trends. On SWE-Bench, which tests how well AI systems handle real-world GitHub issues, success rates jumped from about two percent (Claude 2, late 2023) to 93.9 percent, essentially saturating the benchmark. The METR time horizons measure, which tracks how complex a task an AI can complete at 50 percent reliability based on how many hours a skilled human would need, climbed from about 30 seconds with GPT-3.5 to roughly twelve hours with today's frontier models. METR researcher Ajeya Cotra thinks 100 hours by the end of 2026 is plausible.