Rate limits
When to use this page
Use this page to understand capacity protection and customer package behavior.
Limit layers
QuotaFlow can enforce limits at multiple layers:
- Global Codex capacity
- Pool capacity
- API key capacity
- Upstream account pressure
- Upstream provider usage limits
Dynamic pool capacity
QuotaFlow estimates capacity from accounts that are currently usable. Accounts in cooldown or usage-limit reset windows do not count toward active capacity until they recover.
Queue behavior
When capacity is exhausted, eligible requests may wait in a bounded queue. If the queue is full or the wait exceeds the configured safe window, the request fails with a capacity error.
Retry guidance
- Retry
429with exponential backoff and jitter. - Retry
503and timeouts. - Do not retry authentication errors until the key is fixed.
- Keep session ids stable across retries.
AI agents: start at
/llms.txt, fetch /llms-full.txt for full context, and parse /openapi.yaml for endpoint schemas.