Amazon Q · Rate Limits

Amazon Q Rate Limits

Amazon Q APIs (qbusiness, qdeveloper) follow standard AWS API throttling per account/region. Q Business agentic-request quotas and Q Developer agentic-request limits are tracked per user/subscription and surface as plan-level quotas rather than per-second throttles. AWS recommends exponential backoff with jitter on ThrottlingException.

5 Limits Throttle: 400 Quota: 400
Rate LimitingGenAIAmazon Q

Limits

Q Business control-plane API account/region
varies
see Service Quotas console for Amazon Q Business
Standard AWS API throttling envelope; raisable via Service Quotas.
Q Business chat / ChatSync application
requests_per_second
see Service Quotas console for Amazon Q Business
Per-application concurrency and TPS limits documented in Service Quotas.
Q Developer agentic requests (Free tier) user
requests_per_month · month
50
Plan-level monthly quota; not a rate limit per second.
Q Developer agentic requests (Pro tier) user
requests_per_month
see Q Developer pricing page
Elevated quota - exact value increases over time; check console.
Code-transformation lines (Pro) account
lines_per_month · month
4000
Pooled across all Pro users in the account; overage at $0.003/LOC.

Policies

Backoff with jitter
AWS SDKs default to standard retry mode (truncated exponential backoff with jitter, max 20s, 3 attempts).
Plan vs throttle quotas
Per-user agentic-request limits behave as monthly quotas; exceeding triggers a quota error rather than throttling.
Quota increases
Account-level API throttle limits are raisable via Service Quotas or AWS Support.

Sources