亚马逊员工"刷榜"内部AI排名,公司推出自动提示优化功能
阅读原文· the-decoder.com据报道,部分亚马逊员工为提升内部AI排行榜排名,正通过自动化手段执行非必要任务。与此同时,亚马逊为其Bedrock AI服务推出了自动提示词优化功能,旨在简化耗时的手动提示工程流程,根据任务不同可将模型性能最高提升22%。该功能已在Bedrock平台的Claude-3、LLaMA-3等多个模型上提供。尽管Anthropic和OpenAI等公司也提供了类似工具,但整个行业在准确评估这些自动优化结果的有效性方面仍面临普遍挑战。
"Tokenmaxxing" spreads at Amazon as employees game internal AI leaderboards
Amazon employees are automating unnecessary tasks just to climb internal AI leaderboards.
The in-house tool "MeshClaw" lets employees create AI agents that can trigger code deployments, triage emails, or interact with apps like Slack. But according to the Financial Times, staff are deliberately using the software to artificially inflate their token consumption.
"There is just so much pressure to use these tools," one Amazon employee said to the FT. "Some people are just using MeshClaw to maximise their token usage." The background: Amazon has set targets for more than 80 percent of developers to use AI each week, and earlier this year began tracking token consumption on internal leaderboards.