Claude Code and Claude API Limits Rise Sharply: What Anthropic’s SpaceX Compute Deal Changes

On May 6, 2026, Anthropic announced changes many developers have been waiting for: Claude Code usage limits doubled, Claude API rate limits increased materially, and all updates took effect immediately.

The direct driver is a large-scale compute agreement between Anthropic and SpaceX.

What changed?

Anthropic published three changes that apply right away:

1. Claude Code 5-hour rolling limit doubled

For subscribers on Pro, Max, Team, and Enterprise (per-seat billing), the 5-hour rolling rate limit for Claude Code doubled.

Heavy coding workflows that used to hit the cap quickly—analyzing large repos, running Agent-style loops, or editing many files in parallel—now have more headroom.

2. Peak-hour throttling removed

Previously, Pro and Max users could see extra rate reductions during peak hours (often described as US business hours). Anthropic removed that policy.

For developers in China, this is especially practical: your daytime and evening work often overlaps US peak hours. You may have noticed slower responses or limits triggering sooner; that peak-time penalty is gone.

3. Claude Opus API rate limits raised

For flagship models such as claude-opus-4-6 and claude-opus-4-7, Anthropic raised API rate limits (official wording: “considerably”).

Teams running Opus in production for hard reasoning, long documents, or high-quality generation face less pressure when scaling throughput.

Why now? SpaceX Colossus 1 datacenter

The capacity behind these changes comes from Anthropic’s compute deal with SpaceX:

Site: SpaceX Colossus 1 datacenter
New capacity: more than 300 megawatts (MW)
GPUs: more than 220,000 NVIDIA GPUs
Timeline: brought online within about one month

This is not Anthropic’s first large infrastructure bet. Context from earlier partnerships:

Partner	Scale (as reported)
Amazon	Up to 5 GW; nearly 1 GW targeted online by end of 2026
Google + Broadcom	5 GW, rolling out from 2027
Microsoft + NVIDIA	$30B Azure capacity
Fluidstack	$50B US AI infrastructure investment
SpaceX (new)	300+ MW, 220K+ GPUs, ~one month to integrate

The step-up from MW to GW signals Anthropic is pre-building infrastructure for large-scale Agent workloads—not only future promises.

What it means for developers

Scenario 1: Long Claude Code sessions interrupt less often

Typical heavy use: multi-file refactors, or tight test → fix → verify loops. When you hit the 5-hour cap, you used to wait; doubling the window makes it easier to finish end-to-end workflows in one stretch.

Scenario 2: Agent pipelines get more breathing room

If you automate with the Claude API—scheduled jobs, report generation, user-facing agents—high concurrency used to mean more 429 rate limits and brittle production behavior. Higher Opus limits improve headroom for those systems (still monitor quotas and errors in your environment).

Scenario 3: Code review and doc generation throughput

Batch tasks (PR review, docstrings, comment generation) are rate-sensitive. Higher ceilings raise the throughput cap for those pipelines.

Scenario 4: No more “wait until US night” for big jobs

For China-based schedules aligned with US peak hours, removing peak throttling means you are less likely to defer large Claude Code jobs to off-peak US time.

Call Claude via ClaudeAPI.com: quick setup

If direct access to the Anthropic API is unreliable from your network, you can route through ClaudeAPI.com—a third-party API gateway, not an Anthropic product. Use the Anthropic SDK with a different base_url and your ClaudeAPI key.

Python (Anthropic SDK)

import anthropic

client = anthropic.Anthropic(
    api_key="your-api-key",
    base_url="https://gw.claudeapi.com"
)

message = client.messages.create(
    model="claude-opus-4-7",   # flagship; Anthropic raised Opus rate limits
    max_tokens=4096,
    messages=[
        {"role": "user", "content": "Analyze performance bottlenecks in this code"}
    ]
)
print(message.content[0].text)

import anthropic

client = anthropic.Anthropic(
    api_key="your-api-key",
    base_url="https://gw.claudeapi.com"
)

message = client.messages.create(
    model="claude-opus-4-7",   # flagship; Anthropic raised Opus rate limits
    max_tokens=4096,
    messages=[
        {"role": "user", "content": "Analyze performance bottlenecks in this code"}
    ]
)
print(message.content[0].text)

Node.js

import Anthropic from "@anthropic-ai/sdk";

const client = new Anthropic({
  apiKey: "your-api-key",
  baseURL: "https://gw.claudeapi.com",
});

const message = await client.messages.create({
  model: "claude-opus-4-7",
  max_tokens: 4096,
  messages: [{ role: "user", content: "Review boundary cases for this function" }],
});

console.log(message.content[0].text);

import Anthropic from "@anthropic-ai/sdk";

const client = new Anthropic({
  apiKey: "your-api-key",
  baseURL: "https://gw.claudeapi.com",
});

const message = await client.messages.create({
  model: "claude-opus-4-7",
  max_tokens: 4096,
  messages: [{ role: "user", content: "Review boundary cases for this function" }],
});

console.log(message.content[0].text);

Claude Code configuration

In your Claude Code config:

{
  "env": {
    "ANTHROPIC_BASE_URL": "https://gw.claudeapi.com",
    "ANTHROPIC_API_KEY": "your-api-key"
  }
}

{
  "env": {
    "ANTHROPIC_BASE_URL": "https://gw.claudeapi.com",
    "ANTHROPIC_API_KEY": "your-api-key"
  }
}

Note: The May 6 limit changes apply to Anthropic subscription / official API products as announced. Gateway rate limits on ClaudeAPI.com follow platform capacity and your plan—check the console for current quotas.

Model picks after the limit bump

Use case	Model	Why
Daily coding in Claude Code	`claude-sonnet-4-6`	Fast, lower cost, good for high-frequency turns
Architecture / large refactors	`claude-opus-4-7`	Strongest reasoning; Opus API limits raised
Doc generation	`claude-haiku-4-5-20251001`	High throughput, low cost
Production agents	`claude-sonnet-4-6`	Strong cost/performance with more rate headroom

Full pricing: ClaudeAPI.com pricing

FAQ

Do doubled limits apply to API keys on ClaudeAPI.com?

Anthropic’s May 6 announcement targets Claude Code subscriptions (Pro/Max/Team/Enterprise) and Anthropic API rate limits for Opus. ClaudeAPI.com is a separate gateway—confirm your quotas and model availability in the console.

Which Claude Code plans get the 2× 5-hour limit?

Pro, Max, Team, and Enterprise (per-seat) as stated in Anthropic’s announcement. Verify eligibility in Anthropic’s current product docs.

Did peak-hour throttling affect API or only Claude Code?

The announcement emphasizes peak-hour limits for Pro/Max Claude Code users. API behavior may still have its own rate tiers—monitor 429 responses and headers in production.

Why does SpaceX matter for developers?

The SpaceX deal adds 300+ MW and 220K+ GPUs at Colossus 1 on a ~one-month integration timeline—capacity Anthropic cites as enabling higher limits now, not only future roadmap slides.

Summary

The signal is clear: Anthropic’s compute build-out is starting to show up in everyday developer limits, not only in long-term press releases.

If you run long Claude Code engineering sessions or depend on Opus in production, it is a good time to re-test workflows you previously avoided because you expected to hit rate limits.

CTA

Get an API key and route the Anthropic SDK to the gateway:

ClaudeAPI.com

Based on Anthropic’s public announcement; technical figures are from vendor statements. ClaudeAPI.com is an independent gateway—not affiliated with Anthropic. For API access, visit claudeapi.com.

Claude Code and Claude API Limits Rise Sharply: What Anthropic’s SpaceX Compute Deal Changes