Salesforce Einstein · Rate Limits

Salesforce Einstein Rate Limits

Salesforce Einstein consumption is limited at two layers. Calls into the core platform APIs inherit the per-org 24-hour API request allowance scaled by edition + license count. Generative Einstein features (Copilot, Prompt Builder, Trust Layer) additionally meter against per-org / per-user generative request budgets defined per add-on SKU. Exact published numbers were not retrievable in this run; this artifact captures the model and points to Salesforce's published limits.

3 Limits Throttle: 429

CRMAISalesforceRate Limiting

Limits

Per-org 24-hour API request allowance org

requests_per_day

see Salesforce App Limits Cheatsheet (varies by edition + license count)

Einstein API calls into the core platform consume the same 24-hour bucket as REST / Bulk / Tooling.

Einstein generative request quota org

requests_per_window

see Einstein add-on documentation (per-org / per-user budgets)

Copilot, Prompt Builder, and Trust Layer features have additional generative request budgets defined per add-on SKU.

Einstein Vision / Language predictions account

predictions_per_month

see Einstein Platform Services documentation

Legacy Einstein Vision / Language APIs document monthly prediction allowances per plan.

Policies

Edition + license scaling

Core API allowance scales with edition and license count.

Add-on quotas

Generative Einstein features add their own quotas on top of the core allowance; exhaustion affects only generative endpoints.

Trust Layer governance

The Einstein Trust Layer enforces masking, audit, and zero-retention policies that affect throughput characteristics of generative calls.

Sources

https://developer.salesforce.com/docs/atlas.en-us.salesforce_app_limits_cheatsheet.meta/salesforce_app_limits_cheatsheet/salesforce_app_limits_platform_api.htm