Quickstart
When to use this page
Use this page when you already have a QuotaFlow API key and want the fastest working request.
1. Set environment variables
export QUOTAFLOW_API_KEY="qf_your_key_here"
export OPENAI_BASE_URL="https://api.quotaflow.ai/openai/v1"
2. List models
curl https://api.quotaflow.ai/openai/v1/models \
-H "Authorization: Bearer $QUOTAFLOW_API_KEY"
Expected result shape. Use the ids returned by your key; current prod verified ids are gpt-5.2, gpt-5.3-codex, gpt-5.4, gpt-5.5, and gpt-4o-mini-transcribe.
{
"object": "list",
"data": [
{ "id": "gpt-5.2", "object": "model" },
{ "id": "gpt-5.3-codex", "object": "model" },
{ "id": "gpt-5.4", "object": "model" },
{ "id": "gpt-5.5", "object": "model" },
{ "id": "gpt-4o-mini-transcribe", "object": "model" }
]
}
3. Create a response
curl https://api.quotaflow.ai/openai/v1/responses \
-H "Authorization: Bearer $QUOTAFLOW_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-5.5",
"input": "Reply with only: QuotaFlow is connected.",
"stream": false
}'
4. Add a sticky session id
For agent workflows, include a stable session id so QuotaFlow can keep related calls on the same upstream account when possible.
curl https://api.quotaflow.ai/openai/v1/responses \
-H "Authorization: Bearer $QUOTAFLOW_API_KEY" \
-H "Content-Type: application/json" \
-H "x-session-id: customer-or-agent-session-123" \
-d '{
"model": "gpt-5.5",
"input": "Continue this coding task.",
"stream": true
}'
Production checklist
- Store the key in a secret manager or environment variable.
- Do not expose the key in browser code.
- Use a stable session id for long-running agents.
- Implement retry for
429,503, and network timeouts. - Monitor your usage in the QuotaFlow dashboard.
AI agents: start at
/llms.txt, fetch /llms-full.txt for full context, and parse /openapi.yaml for endpoint schemas.