AI 摘要
第二扩展定律依然所向披靡。 若想从大语言模型中获得更强的黑客能力(或数学、科学、填字游戏解题能力),只需增加思维令牌。 目前看来尚未出现任何性能瓶颈。
The Second Scaling Law remains undefeated. If you want better hacking (or math, or science, or crossword puzzle solving) out of an LLM, just add thinking tokens. There doesn't seem to be any plateau so far.
Very important update from UK AISI. This is a meaningful change from the previous report. Here's what the new data would look like for "Mythos Preview (new)" wi...