归功于 GLM-5.2 Max,这个新的开放权重模型,成功完成了这个任务。 ...但你能看出它和 Fable 之间的区别,这种区别是基准测试无法体现的。GLM-5.2 给出了一首正确的诗(威尔士语很有趣),但 Fable 将消失的字母融入了诗歌主题。
Credit to GLM-5.2 Max, the new open weights model, for pulling this off.
…but you can see the difference between it and Fable in a way benchmarks don't show. GLM-5.2 gives a correct poem (&; the Welsh is fun) but Fable weaves the disappearing letters into the theme of the poem.