测试评估了AI金融建模智能体在构建苹果公司历史与预测财务报表这一真实分析师任务中的表现。其中,工具Primer表现突出,关键在于其生成了可审计的关联财务系统,而非逐单元格拼接的表格。Primer将Excel视为最终输出格式,先构建完整的三表模型,再将其转化为结构化记录(如收入、成本、假设、公式链接等),使AI能直接查询和验证财务逻辑。这指出专业AI智能体的价值将更多取决于其产出物能否通过审计。
WallStreetPrep did a very practical AI benchmarking exercise for real-world finance.
It tested financial modeling agents on a real analyst assignment, not a toy prompt with a neat answer key.
The task was a serious analyst job: build Apple's historical and forecast financial statements, cite sources, link assumptions, add schedules, and make the workbook auditable.
Primer, an AI financial modeling tool, came out ahead in this test, but the more useful point is why: its output looked less like a spreadsheet patched together cell by cell and more like a connected financial system that could be audited.
Primer treats Excel as the final output format, not the agent's working language, so the AI can build a stronger 3-statement financial model first and then convert it into an auditable spreadsheet.