QuotaFlow Docs

Rate limits

When to use this page

Use this page to understand capacity protection and customer package behavior.

Limit layers

QuotaFlow can enforce limits at multiple layers:

Dynamic pool capacity

QuotaFlow estimates capacity from accounts that are currently usable. Accounts in cooldown or usage-limit reset windows do not count toward active capacity until they recover.

Queue behavior

When capacity is exhausted, eligible requests may wait in a bounded queue. If the queue is full or the wait exceeds the configured safe window, the request fails with a capacity error.

Retry guidance

AI agents: start at /llms.txt, fetch /llms-full.txt for full context, and parse /openapi.yaml for endpoint schemas.