Azure Databricks · Rate Limits
Microsoft Azure Databricks Rate Limits
Azure Databricks REST APIs apply per-workspace and per-API rate limits. Most APIs target a few hundred requests per second per workspace; Jobs API and Files API have lower per-second targets. Some APIs (Statement Execution, Model Serving) use endpoint-specific limits. SCIM and account-level APIs are throttled separately.
5 Limits
Throttle: 429
Rate LimitingAnalyticsMicrosoft Azure
Limits
Workspace API requests workspace
see Databricks limits page (typical hundreds RPS per workspace)
Jobs API runs workspace
workspace-configurable max concurrent runs
Statement Execution API workspace
see Statement Execution API limits
Model Serving endpoint QPS endpoint
configurable per endpoint with concurrency budget
SCIM (account) account
see SCIM provisioning limits
Policies
Honor Retry-After
When the API returns 429, wait the duration specified in Retry-After before retrying.
Backoff with jitter
Apply exponential backoff with jitter to avoid thundering herds against the workspace control plane.
Bulk endpoints over loops
Prefer batch endpoints (e.g. Jobs runs list with filters) and Delta Live Tables over polling individual resources.
Workspace partitioning
Heavy multi-tenant workloads should be partitioned across multiple workspaces to avoid control-plane throttling.