QuotaFlow Docs
QuotaFlow provides managed, official-style AI API endpoints for OpenAI-compatible, Anthropic-compatible, and Gemini-compatible clients. Use it when you want one QuotaFlow API key, familiar SDK request shapes, usage tracking, quota protection, and production-ready capacity controls.
When to use this page
Use this page to understand the product surface before configuring an SDK or agent.
Core endpoints
- API base URL:
https://api.quotaflow.ai - OpenAI-compatible base URL:
https://api.quotaflow.ai/openai/v1 - All-in-one OpenAI-compatible base URL:
https://app.quotaflow.ai/openai/v1 - Anthropic-compatible Messages:
https://app.quotaflow.ai/v1/messages - Gemini-compatible generateContent:
https://app.quotaflow.ai/v1beta/models/{model}:generateContent
What QuotaFlow handles
- API key authentication
- Official-style request and response shapes
- Model discovery for your key
- Session continuity for cache-friendly agent traffic
- Capacity protection, bounded queueing, and retry-safe errors
- Usage and cost accounting
What you need
1. A QuotaFlow API key.
2. The API base URL for the protocol you are using.
3. A compatible client, SDK, Codex configuration, Claude-style configuration, Gemini-style configuration, or direct HTTP call.
AI agent note
If you are an AI coding agent, start with /llms.txt, then fetch /llms-full.txt for full context, and use /openapi.yaml for endpoint schemas.
AI agents: start at
/llms.txt, fetch /llms-full.txt for full context, and parse /openapi.yaml for endpoint schemas.