Responses API
When to use this page
Use this page for Codex-style agent calls, tool-capable workloads, and modern OpenAI-compatible clients.
Endpoint
POST https://api.quotaflow.ai/openai/v1/responses
Minimal request
{
"model": "gpt-5.5",
"input": "Write a concise deployment checklist.",
"stream": false
}
Streaming request
{
"model": "gpt-5.5",
"input": "Explain this repository structure.",
"stream": true
}
Curl example
curl https://api.quotaflow.ai/openai/v1/responses \
-H "Authorization: Bearer $QUOTAFLOW_API_KEY" \
-H "Content-Type: application/json" \
-H "x-session-id: repo-agent-123" \
-d '{
"model": "gpt-5.5",
"input": "Summarize the change in one paragraph.",
"stream": false
}'
Codex adaptation
If enabled for your API key, QuotaFlow normalizes compatible requests for Codex-style upstream execution. You can usually keep the OpenAI-compatible request shape.
Response shape
QuotaFlow returns OpenAI-compatible JSON for non-streaming calls and server-sent events for streaming calls.
AI agents: start at
/llms.txt, fetch /llms-full.txt for full context, and parse /openapi.yaml for endpoint schemas.