Amazon Q · Rate Limits
Amazon Q Rate Limits
Amazon Q APIs (qbusiness, qdeveloper) follow standard AWS API throttling per account/region. Q Business agentic-request quotas and Q Developer agentic-request limits are tracked per user/subscription and surface as plan-level quotas rather than per-second throttles. AWS recommends exponential backoff with jitter on ThrottlingException.
5 Limits
Throttle: 400
Quota: 400
Rate LimitingGenAIAmazon Q
Limits
Q Business control-plane API account/region
see Service Quotas console for Amazon Q Business
Standard AWS API throttling envelope; raisable via Service Quotas.
Q Business chat / ChatSync application
see Service Quotas console for Amazon Q Business
Per-application concurrency and TPS limits documented in Service Quotas.
Q Developer agentic requests (Free tier) user
50
Plan-level monthly quota; not a rate limit per second.
Q Developer agentic requests (Pro tier) user
see Q Developer pricing page
Elevated quota - exact value increases over time; check console.
Code-transformation lines (Pro) account
4000
Pooled across all Pro users in the account; overage at $0.003/LOC.
Policies
Backoff with jitter
AWS SDKs default to standard retry mode (truncated exponential backoff with jitter, max 20s, 3 attempts).
Plan vs throttle quotas
Per-user agentic-request limits behave as monthly quotas; exceeding triggers a quota error rather than throttling.
Quota increases
Account-level API throttle limits are raisable via Service Quotas or AWS Support.