Quickstart

When to use this page

Use this page when you already have a QuotaFlow API key and want the fastest working request.

1. Set environment variables

export QUOTAFLOW_API_KEY="qf_your_key_here"
export OPENAI_BASE_URL="https://api.quotaflow.ai/openai/v1"

2. List models

curl https://api.quotaflow.ai/openai/v1/models \
  -H "Authorization: Bearer $QUOTAFLOW_API_KEY"

Expected result shape. Use the ids returned by your key; current prod verified ids are gpt-5.2, gpt-5.3-codex, gpt-5.4, gpt-5.5, and gpt-4o-mini-transcribe.

{
  "object": "list",
  "data": [
    { "id": "gpt-5.2", "object": "model" },
    { "id": "gpt-5.3-codex", "object": "model" },
    { "id": "gpt-5.4", "object": "model" },
    { "id": "gpt-5.5", "object": "model" },
    { "id": "gpt-4o-mini-transcribe", "object": "model" }
  ]
}

3. Create a response

curl https://api.quotaflow.ai/openai/v1/responses \
  -H "Authorization: Bearer $QUOTAFLOW_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "input": "Reply with only: QuotaFlow is connected.",
    "stream": false
  }'

4. Add a sticky session id

For agent workflows, include a stable session id so QuotaFlow can keep related calls on the same upstream account when possible.

curl https://api.quotaflow.ai/openai/v1/responses \
  -H "Authorization: Bearer $QUOTAFLOW_API_KEY" \
  -H "Content-Type: application/json" \
  -H "x-session-id: customer-or-agent-session-123" \
  -d '{
    "model": "gpt-5.5",
    "input": "Continue this coding task.",
    "stream": true
  }'

Production checklist

Store the key in a secret manager or environment variable.
Do not expose the key in browser code.
Use a stable session id for long-running agents.
Implement retry for 429, 503, and network timeouts.
Monitor your usage in the QuotaFlow dashboard.

AI agents: start at /llms.txt, fetch /llms-full.txt for full context, and parse /openapi.yaml for endpoint schemas.