Tetrate · Rate Limits
Tetrate Rate Limits
Tetrate's Agent Router (LLM Gateway / MCP Gateway / AI Guardrails) and enterprise platform do not publish a public rate-limits reference. The free Developer tier is bounded by free inference credits; enterprise throughput and concurrency are negotiated per engagement.
2 Limits
Rate LimitingAI GatewayService Mesh
Limits
Developer Tier (Credit-Bounded) account
bounded by free inference credits
Enterprise (Contract-Defined) contract
defined per Tetrate enterprise engagement
Policies
Credit Exhaustion
Developer-tier traffic is paused or throttled once free inference credits are exhausted.
Enterprise Capacity
Throughput, model routing fallbacks, and concurrency for the enterprise tier are sized with Tetrate during onboarding.