Meta Harnesses是由斯坦福与DSPy作者提出的自动化框架生成技术,通过自动生成单文件Python程序(harness)来优化特定任务的提示词、检索与编排逻辑,实现无需人工干预的持续迭代。相比Autoresearch,其抽象层级更高,适用于结果可验证的特定领域任务(如数学推理、编程),能自动将问题分类并制定差异化策略,但在需要统一方法论的任务上存在局限。
Meta Harnesses is Autoresearch on steroids.
Something I've been exploring recently is to get long running agents to hill climb on a verifiable task to continuously improve without my intervention. Karpathy's Autoresearch did this pretty well on specific tasks, but this weekend I tried Meta Harnesses which moves one level of abstraction up.
What does Meta Harness do? Autoresearch can be used in harness like Claude Code / Codex to generate experiments to try, evaluate results, and continue looping. Meta Harness generates a harness itself that optimizes on a task or a set of task. Here, we define a harness as "a single-file Python program that modifies task-specific prompting, retrieval, memory, and orchestration logic". The idea is that LLMs are very powerful today, but to harness 【pun intended】 their power, you need to give it the right prompts and context. Meta Harnesses automates coming up with the right prompts and the right way to retrieve context to solve a problem.