# The Grid：按任务难度定价的LLM推理平台

- 来源：Rohan Paul (@rohanpaul_ai)
- 发布时间：2026-05-28 05:46
- AIHOT 分数：67
- AIHOT 链接：https://aihot.virxact.com/items/cmpom7lt407a9slv4o2k0j51d
- 原文链接：https://x.com/rohanpaul_ai/status/2059753111925957092

## AI 摘要

The Grid推出新的LLM推理平台，用实时供需市场定价取代传统的固定费率。它按任务难度分层：简单任务（如分类）用“standard”，常规生产任务（如RAG、智能体步骤）用“prime”，高难度任务（如长上下文推理）用“max”。应用将请求发送至The Grid，平台会自动匹配该层级当前最便宜的可用供应商。开发者仍使用单一API，但后端模型可动态切换。新账户享受前200 million tokens免费额度。文中以Hermes Agent集成为例，展示了如何通过“agent-prime”层级处理工单。

## 正文

Most teams are overpaying for inference without realising it. Fixed rate cards have no competitive pressure.

The Grid replaces them with live supply and demand， prices track the market， not a vendor's margin.

The Grid sits in the middle and basically says， "Don't pick the model， pick the level of work you need."

A boring task like classifying support tickets does not need the smartest model， so it can run on standard.

A normal production task like RAG， drafting， support replies， or agent steps can run on prime.

A hard task with long context， high error cost， or difficult reasoning can run on max.

Your app sends the request to The Grid， not directly to OpenAI， Anthropic， or one hosting company.

The Grid then checks which suppliers currently qualify for that tier and sends the request to the cheapest one available at that moment.

You still use one API key and mostly the same code， but the model behind the request can change as prices and quality change.

So you stop paying premium prices for easy work， and also you are not trapped inside one vendor's model names， pricing， outages， or deprecations.

New accounts get the first 200 million tokens covered.

Here， I integrated Hermes Agent with The Grid in minutes， kept the agent running locally on my Ubuntu machine， and used "agent-prime" to read support tickets， apply a policy file， and write a triage report through The Grid's API.

You just need to
- install Hermes Agent
- select The Grid as a custom AI provider.
- No local model download. No GPU setup. The request goes through the grid.
- The Hermes Agent ran locally， but the AI calls went through The Grid.

🧵 1.
