Volcano · Rate Limits
Volcano Rate Limits
Volcano does not expose a vendor-side request-rate limit. As a Kubernetes-native scheduler running in the customer's cluster, "limits" in Volcano are workload-level resource quotas - CPU, memory, GPU, and queue capacity - enforced via the Queue CRD and standard Kubernetes ResourceQuota objects. Consumers of the Volcano API interact with the Kubernetes API server, whose rate-limit and admission-controller behavior governs request throttling.
2 Limits
Open SourceKubernetesRate LimitingQuotas
Limits
Kubernetes API Server (cluster-side) cluster
see kube-apiserver --max-requests-inflight / --max-mutating-requests-inflight
Volcano is consumed via the Kubernetes API; rate enforcement is whatever the cluster's API server is configured with.
Volcano Queue Capacity queue
see Queue CRD spec.capability
Multi-dimensional resource cap (CPU, Memory, GPU) per Queue; enforced by Volcano scheduler, not as a request-rate limit.
Policies
Kubernetes-Native Throttling
All client interaction is via kubectl / client-go against the Kubernetes API server. Use the standard client-go QPS / burst tunables and respect 429 responses from the API server.
Queue-Based Fairness
Volcano enforces fairness through Queues with hierarchical resource inheritance, weight, and capability caps rather than per-second request limits.
Gang Scheduling Admission
Jobs are admitted only when the full PodGroup can be scheduled atomically; partial submissions are gated by the scheduler rather than throttled at the API surface.