尽管使用频率不高，Pro Max 5x 的流量配额仍在 1.5 小时内用尽

2026-04-12 21:55·81天前·cmaster11

AI 摘要

Claude Code Pro Max 5x 用户反馈，在 moderate usage（中等使用强度）下，流量配额仅 1.5 小时即耗尽。该问题已提交至 GitHub issue，引发对配额限制合理性的质疑。

原文 · 未翻译

Notifications You must be signed in to change notification settings

Fork 21.4k

Star 132k

[BUG] Pro Max 5x Quota Exhausted in 1.5 Hours Despite Moderate Usage #45756

Description

Preflight Checklist

I have searched existing issues and this hasn't been reported yet

This is a single bug report (please file separate reports for different bugs)

I am using the latest version of Claude Code

What's Wrong?

Pro Max 5x Quota Exhausted in 1.5 Hours Despite Moderate Usage

Summary

On a Pro Max 5x (Opus) plan, quota resets at a fixed interval. After reset, with moderate usage (mostly Q&A, light development), quota was exhausted within 1.5 hours. Prior to reset, 5 hours of heavy development (multi-file implementation, graphify pipeline, multi-agent spawns) consumed the previous quota window — but that was expected given the workload. The post-reset exhaustion was not.

Investigation reveals the likely root cause: cache_read tokens appear to count at full rate against the rate limit, negating the cost benefit of prompt caching for quota purposes.

Environment

Plan: Pro Max 5x

Model: claude-opus-4-6 (1M context)

Platform: Claude Code CLI on WSL2

Session: Single continued session with 2 auto-compacts

Data Collection Method

All data extracted from ~/.claude/projects/*//*.jsonl session files, specifically the usage object on each API response:

~/.claude/projects/*//*.jsonl

usage

{ "cache_read_input_tokens": ..., "cache_creation_input_tokens": ..., "input_tokens": ..., "output_tokens": ... }

Measured Token Consumption

Window 1: 15:00-20:00 (5 hours, heavy development)

Metric Value API calls 2,715 Cache read 1,044M tokens Cache create 16.8M tokens Input tokens 8.9k tokens Output tokens 1.15M tokens Peak context 966,078 tokens Effective input (cache_read at 1/10) 121.8M tokens

Workload: Full feature implementation (Express server + iOS app), graphify knowledge graph pipeline, SPEC-driven multi-agent coordination. 2 auto-compacts as context hit ~960k.

Window 2: 20:00-21:30 (1.5 hours, moderate usage)

Main session (vibehq):

Metric Value API calls 222 Cache read 23.2M tokens Cache create 1.4M tokens Input tokens 304 tokens Output tokens 91k tokens Peak context 182,302 tokens Effective input (cache_read at 1/10) 2.8M tokens

Hacker News 热门（buzzing.cc 中文翻译）

导出 Markdown