Rohan Paul@rohanpaul_ai

2026-05-17 20:49·46天前

AI 摘要

研究指出，在编码智能体需精确定位证据（如符号、函数名、错误信息）的任务中，基于grep的精确字符串搜索比向量检索更具优势。关键在于，检索性能高度依赖智能体的设计框架——结果呈现方式（内联、文件或CLI）会极大影响搜索效果。论文挑战了“智能体栈必须始于嵌入”的默认假设，强调应区分任务类型：是语义发现问题，还是证据定位问题。对于后者，为模型提供原始工具、清晰上下文和精确搜索的框架，往往比构建复杂索引更有效。向量数据库在模糊语义搜索和大规模场景中仍有价值。

Is Grep All You Need？

The surprising result is not that grep is powerful， but that agent design makes it powerful.

The paper says not that grep beats vectors， but that agents fail or win through their harness.

That sounds like a small distinction until you look at what was actually tested.

The authors compare grep-style search and vector retrieval across LongMemEval tasks， where agents must recover facts from long conversation histories full of distractors. Inline grep beats inline vector across every harness-model pair in their main experiment， sometimes by wide margins.

The tempting headline is that vector databases are overbuilt for coding agents.

The better reading is sharper： when the answer is anchored in literal evidence， names， dates， file paths， function names， error strings， user preferences， grep gives the model a clean mechanical advantage.

Embeddings are built to tolerate paraphrase， but tolerance has a cost. They can pull in semantically nearby clutter， especially when a short agent query is vague.

Grep has the opposite failure mode. It is dumb， cheap， and narrow， but when the agent knows the right string to hunt for， dumb becomes a feature.

The deeper finding is that retrieval is not a component you can benchmark in isolation. The same search method behaves differently depending on whether results are injected inline， written to files， routed through a CLI， or wrapped in a custom agent loop.

So the question is not "Do we still need vector databases？"

The question is whether your agent is solving a semantic discovery problem or an evidence-location problem.

For coding agents， a surprising amount of work is evidence-location： find the symbol， trace the call， inspect the diff， read the failing test， recover the exact line.

Rohan Paul@rohanpaul_ai · X

63导出 Markdown