QuotaFlow Docs
QuotaFlow provides an OpenAI-compatible API gateway for Codex-style agents and model clients. Use it when you want one managed endpoint, API key control, usage tracking, quota protection, and upstream account failover.
When to use this page
Use this page to understand the product surface before configuring an SDK or agent.
Core endpoints
- API base URL:
https://api.quotaflow.ai - OpenAI-compatible Responses:
https://api.quotaflow.ai/openai/v1/responses - OpenAI-compatible Chat Completions:
https://api.quotaflow.ai/openai/v1/chat/completions - Model list:
https://api.quotaflow.ai/openai/v1/models
What QuotaFlow handles
- API key authentication
- OpenAI-compatible request shape
- Codex payload adaptation when enabled for your key
- Upstream account scheduling
- Sticky session routing for cache-friendly traffic
- Capacity guard, queueing, cooldown, and failover
- Usage and cost accounting
What you need
1. A QuotaFlow API key.
2. The API base URL.
3. A compatible client, SDK, Codex configuration, or direct HTTP call.
AI agent note
If you are an AI coding agent, start with /llms.txt, then fetch /llms-full.txt for full context, and use /openapi.yaml for endpoint schemas.
AI agents: start at
/llms.txt, fetch /llms-full.txt for full context, and parse /openapi.yaml for endpoint schemas.