General5 min read

Anthropic Doubled the Rate Limits: What Changed and What It Means for You

rate-limitsapianthropicthroughputlimits

What Actually Changed

Anthropic doubled the rate limits on their Claude API across all model tiers. This is not a subtle backend adjustment — it is a meaningful change to the constraints that shaped how developers built on the API.

The practical effect: if you have been working around rate limits in your Claude Code usage or API integrations, the new limits remove most of those constraints. What you could do in an hour before, you can now do in roughly half the time at the same concurrency level. Or you can run at the same pace with more headroom before hitting limits.

What the Numbers Look Like

The exact numbers vary by tier and plan, but the direction is consistent: twice the requests per minute, twice the tokens per minute, and significantly higher burst tolerances. Opus, Sonnet, and Haiku all see meaningful increases.

The rate limit increases apply to both requests-per-minute and tokens-per-minute. If you were constrained before, the new limits give you room to actually use Claude Code the way you wanted to rather than the way the limits forced you to.

Who This Actually Helps

The developers who benefit most are the ones who have been rationing Claude Code usage due to rate limits. If you have been throttling API calls, limiting concurrent sessions, or choosing smaller models to stay under caps, the new limits change those calculations.

Auto Mode users benefit significantly. Claude Code running in Auto Mode can burn through rate limits faster since it executes without pausing for confirmation. The doubled limits give Auto Mode more room to operate before hitting constraints that would interrupt a long-running task.

High-volume API consumers also benefit. If your application makes many sequential API calls — processing many documents, running continuous agents, handling long conversational threads — the per-minute ceiling increase means fewer interruptions and less need for retry logic.

What to Revisit

If you built Claude Code workflows that worked around rate limits — batch processing to stay under caps, model downgrades to avoid hitting limits, explicit throttling in your integration — it is worth revisiting those choices. The constraints that shaped those decisions have changed.

This does not mean throwing out everything you built around the old limits. It means evaluating whether the workarounds are still necessary. Many of them are not.

The Practical Summary

The rate limit increase is real and meaningful. If you have been hitting limits or building around them, you now have more room. Revisit the workflows where you made trade-offs to stay under the old caps. Some of those trade-offs do not need to exist anymore.

Get Started with Claude Code

Start building with Claude Code today. Free to download, powerful enough for production.