OpenAI
OpenAI provides developer APIs for GPT models, and Codex is also available as a ChatGPT-plan coding agent through OpenAI's Codex clients. OpenClaw keeps those surfaces separate so config stays predictable.
OpenClaw supports three OpenAI-family routes. Most ChatGPT/Codex subscribers who want Codex behavior should use the native Codex app-server runtime. The model prefix selects the provider/model name; a separate runtime setting selects who executes the embedded agent loop:
- API key - direct OpenAI Platform access with usage-based billing (
openai/*models) - Codex subscription with native Codex runtime - ChatGPT/Codex sign-in plus Codex app-server execution (
openai/*models plusagents.defaults.agentRuntime.id: "codex") - Codex subscription through PI - ChatGPT/Codex sign-in with the normal OpenClaw PI runner (
openai-codex/*models)
OpenAI explicitly supports subscription OAuth usage in external tools and workflows like OpenClaw.
Provider, model, runtime, and channel are separate layers. If those labels are getting mixed together, read Agent runtimes before changing config.
Quick choice
| Goal | Use | Notes |
|---|---|---|
| ChatGPT/Codex subscription with native Codex runtime | openai/gpt-5.5 plus agentRuntime.id: "codex" | Recommended Codex setup for most users. Sign in with openai-codex auth. |
| Direct API-key billing | openai/gpt-5.5 | Set OPENAI_API_KEY or run OpenAI API-key onboarding. |
| ChatGPT/Codex subscription auth through PI | openai-codex/gpt-5.5 | Use only when you intentionally want the normal PI runner. |
| Image generation or editing | openai/gpt-image-2 | Works with either OPENAI_API_KEY or OpenAI Codex OAuth. |
| Transparent-background images | openai/gpt-image-1.5 | Use outputFormat=png or webp and openai.background=transparent. |
Naming map
The names are similar but not interchangeable:
| Name you see | Layer | Meaning |
|---|---|---|
openai | Provider prefix | Direct OpenAI Platform API route. |
openai-codex | Provider prefix | OpenAI Codex OAuth/subscription route through the normal OpenClaw PI runner. |
codex plugin | Plugin | Bundled OpenClaw plugin that provides native Codex app-server runtime and /codex chat controls. |
agentRuntime.id: codex | Agent runtime | Force the native Codex app-server harness for embedded turns. |
/codex ... | Chat command set | Bind/control Codex app-server threads from a conversation. |
runtime: "acp", agentId: "codex" | ACP session route | Explicit fallback path that runs Codex through ACP/acpx. |
This means a config can intentionally contain both openai-codex/* and the
codex plugin. That is valid when you want Codex OAuth through PI and also want
native /codex chat controls available. openclaw doctor warns about that
combination so you can confirm it is intentional; it does not rewrite it.
GPT-5.5 is available through both direct OpenAI Platform API-key access and
subscription/OAuth routes. For ChatGPT/Codex subscription plus native Codex
execution, use openai/gpt-5.5 with agentRuntime.id: "codex". Use
openai-codex/gpt-5.5 only for Codex OAuth through PI, or openai/gpt-5.5
without a Codex runtime override for direct OPENAI_API_KEY traffic.
Enabling the OpenAI plugin, or selecting an openai-codex/* model, does not
enable the bundled Codex app-server plugin. OpenClaw enables that plugin only
when you explicitly select the native Codex harness with
agentRuntime.id: "codex" or use a legacy codex/* model ref.
If the bundled codex plugin is enabled but openai-codex/* still resolves
through PI, openclaw doctor warns and leaves the route unchanged.
OpenClaw feature coverage
| OpenAI capability | OpenClaw surface | Status |
|---|---|---|
| Chat / Responses | openai/<model> model provider | Yes |
| Codex subscription models | openai-codex/<model> with openai-codex OAuth | Yes |
| Codex app-server harness | openai/<model> with agentRuntime.id: codex | Yes |
| Server-side web search | Native OpenAI Responses tool | Yes, when web search is enabled and no provider pinned |
| Images | image_generate | Yes |
| Videos | video_generate | Yes |
| Text-to-speech | messages.tts.provider: "openai" / tts | Yes |
| Batch speech-to-text | tools.media.audio / media understanding | Yes |
| Streaming speech-to-text | Voice Call streaming.provider: "openai" | Yes |
| Realtime voice | Voice Call realtime.provider: "openai" / Control UI Talk | Yes |
| Embeddings | memory embedding provider | Yes |
Memory embeddings
OpenClaw can use OpenAI, or an OpenAI-compatible embedding endpoint, for
memory_search indexing and query embeddings:
{
agents: {
defaults: {
memorySearch: {
provider: "openai",
model: "text-embedding-3-small",
},
},
},
}
For OpenAI-compatible endpoints that require asymmetric embedding labels, set
queryInputType and documentInputType under memorySearch. OpenClaw forwards
those as provider-specific input_type request fields: query embeddings use
queryInputType; indexed memory chunks and batch indexing use
documentInputType. See the Memory configuration reference for the full example.
Getting started
Choose your preferred auth method and follow the setup steps.
Best for: direct API access and usage-based billing.
- Get your API key
Create or copy an API key from the OpenAI Platform dashboard.
- Run onboarding
openclaw onboard --auth-choice openai-api-keyOr pass the key directly:
openclaw onboard --openai-api-key "$OPENAI_API_KEY" - Verify the model is available
openclaw models list --provider openai
Route summary
| Model ref | Runtime config | Route | Auth |
|---|---|---|---|
openai/gpt-5.5 | omitted / agentRuntime.id: "pi" | Direct OpenAI Platform API | OPENAI_API_KEY |
openai/gpt-5.4-mini | omitted / agentRuntime.id: "pi" | Direct OpenAI Platform API | OPENAI_API_KEY |
openai/gpt-5.5 | agentRuntime.id: "codex" | Codex app-server harness | Codex app-server |
openai/* is the direct OpenAI API-key route unless you explicitly force
the Codex app-server harness. Use openai-codex/* for Codex OAuth through
the default PI runner, or use openai/gpt-5.5 with
agentRuntime.id: "codex" for native Codex app-server execution.
Config example
{
env: { OPENAI_API_KEY: "sk-..." },
agents: { defaults: { model: { primary: "openai/gpt-5.5" } } },
}
OpenClaw does not expose openai/gpt-5.3-codex-spark. Live OpenAI API requests reject that model, and the current Codex catalog does not expose it either.
Best for: using your ChatGPT/Codex subscription with native Codex app-server execution instead of a separate API key. Codex cloud requires ChatGPT sign-in.
- Run Codex OAuth
openclaw onboard --auth-choice openai-codexOr run OAuth directly:
openclaw models auth login --provider openai-codexFor headless or callback-hostile setups, add
--device-codeto sign in with a ChatGPT device-code flow instead of the localhost browser callback:openclaw models auth login --provider openai-codex --device-code - Use the native Codex runtime
openclaw config set plugins.entries.codex '{"enabled":true}' --strict-json --merge openclaw config set agents.defaults.model.primary openai/gpt-5.5 openclaw config set agents.defaults.agentRuntime '{"id":"codex"}' --strict-json - Verify Codex auth is available
openclaw models list --provider openai-codexAfter the gateway is running, send
/codex statusor/codex modelsin chat to verify the native app-server runtime.
Route summary
| Model ref | Runtime config | Route | Auth |
|---|---|---|---|
openai/gpt-5.5 | agentRuntime.id: "codex" | Native Codex app-server harness | Codex sign-in or selected openai-codex profile |
openai-codex/gpt-5.5 | omitted / runtime: "pi" | ChatGPT/Codex OAuth through PI | Codex sign-in |
openai-codex/gpt-5.4-mini | omitted / runtime: "pi" | ChatGPT/Codex OAuth through PI | Codex sign-in |
openai-codex/gpt-5.5 | runtime: "auto" | Still PI unless a plugin explicitly claims openai-codex | Codex sign-in |
Do not configure older openai-codex/gpt-5.1*, openai-codex/gpt-5.2*, or
openai-codex/gpt-5.3* model refs. ChatGPT/Codex OAuth accounts now reject
those models. Use openai-codex/gpt-5.5 for the PI OAuth route, or
openai/gpt-5.5 with agentRuntime.id: "codex" for native Codex runtime
execution.
Keep using the openai-codex provider id for auth/profile commands. The
openai-codex/* model prefix is also the explicit PI route for Codex OAuth.
It does not select or auto-enable the bundled Codex app-server harness. For
the common subscription plus native runtime setup, sign in with
openai-codex but keep the model ref as openai/gpt-5.5 and set
agentRuntime.id: "codex".
Config example
{
plugins: { entries: { codex: { enabled: true } } },
agents: {
defaults: {
model: { primary: "openai/gpt-5.5" },
agentRuntime: { id: "codex" },
},
},
}
To keep Codex OAuth on the normal PI runner instead, use
openai-codex/gpt-5.5 and omit the Codex runtime override.
Onboarding no longer imports OAuth material from ~/.codex. Sign in with browser OAuth (default) or the device-code flow above — OpenClaw manages the resulting credentials in its own agent auth store.
Check and recover Codex OAuth routing
Use these commands to see which model, runtime, and auth route your default agent is using:
openclaw models status
openclaw models auth list --provider openai-codex
openclaw config get agents.defaults.model --json
openclaw config get agents.defaults.agentRuntime --json
For a specific agent, add --agent <id>:
openclaw models status --agent <id>
openclaw models auth list --agent <id> --provider openai-codex
If a 2026.5.5 doctor --fix run changed a GPT-5.5 subscription setup from
openai-codex/gpt-5.5 to openai/gpt-5.5, switch the default agent back
to the Codex OAuth PI route:
openclaw models set openai-codex/gpt-5.5
openclaw config validate
If models auth list --provider openai-codex shows no usable profile, sign
in again:
openclaw models auth login --provider openai-codex
openclaw models status --probe --probe-provider openai-codex
openai-codex/* means ChatGPT/Codex OAuth through PI. openai/* with
agentRuntime.id: "codex" means native Codex app-server execution.
Status indicator
Chat /status shows which model runtime is active for the current session.
The default PI harness appears as Runtime: OpenClaw Pi Default. When the
bundled Codex app-server harness is selected, /status shows
Runtime: OpenAI Codex. Existing sessions keep their recorded harness id, so use
/new or /reset after changing agentRuntime if you want /status to
reflect a new PI/Codex choice.
Doctor warning
If the bundled codex plugin is enabled while an openai-codex/* route is
selected, openclaw doctor warns that the model still resolves through PI.
Keep the config unchanged only when that PI subscription-auth route is
intentional. Switch to openai/<model> plus agentRuntime.id: "codex" when
you want native Codex app-server execution.
Context window cap
OpenClaw treats model metadata and the runtime context cap as separate values.
For openai-codex/gpt-5.5 through Codex OAuth:
- Native
contextWindow:1000000 - Default runtime
contextTokenscap:272000
The smaller default cap has better latency and quality characteristics in practice. Override it with contextTokens:
{
models: {
providers: {
"openai-codex": {
models: [{ id: "gpt-5.5", contextTokens: 160000 }],
},
},
},
}
Use contextWindow to declare native model metadata. Use contextTokens to limit the runtime context budget.
Catalog recovery
OpenClaw uses upstream Codex catalog metadata for gpt-5.5 when it is
present. If live Codex discovery omits the openai-codex/gpt-5.5 row while
the account is authenticated, OpenClaw synthesizes that OAuth model row so
cron, sub-agent, and configured default-model runs do not fail with
Unknown model.
Native Codex app-server auth
The native Codex app-server harness uses openai/* model refs plus
agentRuntime.id: "codex", but its auth is still account-based. OpenClaw
selects auth in this order:
- An explicit OpenClaw
openai-codexauth profile bound to the agent. - The app-server's existing account, such as a local Codex CLI ChatGPT sign-in.
- For local stdio app-server launches only,
CODEX_API_KEY, thenOPENAI_API_KEY, when the app-server reports no account and still requires OpenAI auth.
That means a local ChatGPT/Codex subscription sign-in is not replaced just
because the gateway process also has OPENAI_API_KEY for direct OpenAI models
or embeddings. Env API-key fallback is only the local stdio no-account path; it
is not sent to WebSocket app-server connections. When a subscription-style Codex
profile is selected, OpenClaw also keeps CODEX_API_KEY and OPENAI_API_KEY
out of the spawned stdio app-server child and sends the selected credentials
through the app-server login RPC.
Image generation
The bundled openai plugin registers image generation through the image_generate tool.
It supports both OpenAI API-key image generation and Codex OAuth image
generation through the same openai/gpt-image-2 model ref.
| Capability | OpenAI API key | Codex OAuth |
|---|---|---|
| Model ref | openai/gpt-image-2 | openai/gpt-image-2 |
| Auth | OPENAI_API_KEY | OpenAI Codex OAuth sign-in |
| Transport | OpenAI Images API | Codex Responses backend |
| Max images per request | 4 | 4 |
| Edit mode | Enabled (up to 5 reference images) | Enabled (up to 5 reference images) |
| Size overrides | Supported, including 2K/4K sizes | Supported, including 2K/4K sizes |
| Aspect ratio / resolution | Not forwarded to OpenAI Images API | Mapped to a supported size when safe |
{
agents: {
defaults: {
imageGenerationModel: { primary: "openai/gpt-image-2" },
},
},
}
See Image Generation for shared tool parameters, provider selection, and failover behavior.
gpt-image-2 is the default for both OpenAI text-to-image generation and image
editing. gpt-image-1.5, gpt-image-1, and gpt-image-1-mini remain usable as
explicit model overrides. Use openai/gpt-image-1.5 for transparent-background
PNG/WebP output; the current gpt-image-2 API rejects
background: "transparent".
For a transparent-background request, agents should call image_generate with
model: "openai/gpt-image-1.5", outputFormat: "png" or "webp", and
background: "transparent"; the older openai.background provider option is
still accepted. OpenClaw also protects the public OpenAI and
OpenAI Codex OAuth routes by rewriting default openai/gpt-image-2 transparent
requests to gpt-image-1.5; Azure and custom OpenAI-compatible endpoints keep
their configured deployment/model names.
The same setting is exposed for headless CLI runs:
openclaw infer image generate \
--model openai/gpt-image-1.5 \
--output-format png \
--background transparent \
--prompt "A simple red circle sticker on a transparent background" \
--json
Use the same --output-format and --background flags with
openclaw infer image edit when starting from an input file.
--openai-background remains available as an OpenAI-specific alias.
For Codex OAuth installs, keep the same openai/gpt-image-2 ref. When an
openai-codex OAuth profile is configured, OpenClaw resolves that stored OAuth
access token and sends image requests through the Codex Responses backend. It
does not first try OPENAI_API_KEY or silently fall back to an API key for that
request. Configure models.providers.openai explicitly with an API key,
custom base URL, or Azure endpoint when you want the direct OpenAI Images API
route instead.
If that custom image endpoint is on a trusted LAN/private address, also set
browser.ssrfPolicy.dangerouslyAllowPrivateNetwork: true; OpenClaw keeps
private/internal OpenAI-compatible image endpoints blocked unless this opt-in is
present.
Generate:
/tool image_generate model=openai/gpt-image-2 prompt="A polished launch poster for OpenClaw on macOS" size=3840x2160 count=1
Generate a transparent PNG:
/tool image_generate model=openai/gpt-image-1.5 prompt="A simple red circle sticker on a transparent background" outputFormat=png background=transparent
Edit:
/tool image_generate model=openai/gpt-image-2 prompt="Preserve the object shape, change the material to translucent glass" image=/path/to/reference.png size=1024x1536
Video generation
The bundled openai plugin registers video generation through the video_generate tool.
| Capability | Value |
|---|---|
| Default model | openai/sora-2 |
| Modes | Text-to-video, image-to-video, single-video edit |
| Reference inputs | 1 image or 1 video |
| Size overrides | Supported |
| Other overrides | aspectRatio, resolution, audio, watermark are ignored with a tool warning |
{
agents: {
defaults: {
videoGenerationModel: { primary: "openai/sora-2" },
},
},
}
See Video Generation for shared tool parameters, provider selection, and failover behavior.
GPT-5 prompt contribution
OpenClaw adds a shared GPT-5 prompt contribution for GPT-5-family runs across providers. It applies by model id, so openai-codex/gpt-5.5, openai/gpt-5.5, openrouter/openai/gpt-5.5, opencode/gpt-5.5, and other compatible GPT-5 refs receive the same overlay. Older GPT-4.x models do not.
The bundled native Codex harness uses the same GPT-5 behavior and heartbeat overlay through Codex app-server developer instructions, so openai/gpt-5.x sessions forced through agentRuntime.id: "codex" keep the same follow-through and proactive heartbeat guidance even though Codex owns the rest of the harness prompt.
The GPT-5 contribution adds a tagged behavior contract for persona persistence, execution safety, tool discipline, output shape, completion checks, and verification. Channel-specific reply and silent-message behavior stays in the shared OpenClaw system prompt and outbound delivery policy. The GPT-5 guidance is always enabled for matching models. The friendly interaction-style layer is separate and configurable.
| Value | Effect |
|---|---|
"friendly" (default) | Enable the friendly interaction-style layer |
"on" | Alias for "friendly" |
"off" | Disable only the friendly style layer |
{
agents: {
defaults: {
promptOverlays: {
gpt5: { personality: "friendly" },
},
},
},
}
openclaw config set agents.defaults.promptOverlays.gpt5.personality off
Values are case-insensitive at runtime, so "Off" and "off" both disable the friendly style layer.
Legacy plugins.entries.openai.config.personality is still read as a compatibility fallback when the shared agents.defaults.promptOverlays.gpt5.personality setting is not set.
Voice and speech
Speech synthesis (TTS)
The bundled openai plugin registers speech synthesis for the messages.tts surface.
| Setting | Config path | Default |
|---|---|---|
| Model | messages.tts.providers.openai.model | gpt-4o-mini-tts |
| Voice | messages.tts.providers.openai.voice | coral |
| Speed | messages.tts.providers.openai.speed | (unset) |
| Instructions | messages.tts.providers.openai.instructions | (unset, gpt-4o-mini-tts only) |
| Format | messages.tts.providers.openai.responseFormat | opus for voice notes, mp3 for files |
| API key | messages.tts.providers.openai.apiKey | Falls back to OPENAI_API_KEY |
| Base URL | messages.tts.providers.openai.baseUrl | https://api.openai.com/v1 |
| Extra body | messages.tts.providers.openai.extraBody / extra_body | (unset) |
Available models: gpt-4o-mini-tts, tts-1, tts-1-hd. Available voices: alloy, ash, ballad, cedar, coral, echo, fable, juniper, marin, onyx, nova, sage, shimmer, verse.
extraBody is merged into /audio/speech request JSON after OpenClaw's generated fields, so use it for OpenAI-compatible endpoints that require additional keys such as lang. Prototype keys are ignored.
{
messages: {
tts: {
providers: {
openai: { model: "gpt-4o-mini-tts", voice: "coral" },
},
},
},
}
Set OPENAI_TTS_BASE_URL to override the TTS base URL without affecting the chat API endpoint.
Speech-to-text
The bundled openai plugin registers batch speech-to-text through
OpenClaw's media-understanding transcription surface.
- Default model:
gpt-4o-transcribe - Endpoint: OpenAI REST
/v1/audio/transcriptions - Input path: multipart audio file upload
- Supported by OpenClaw wherever inbound audio transcription uses
tools.media.audio, including Discord voice-channel segments and channel audio attachments
To force OpenAI for inbound audio transcription:
{
tools: {
media: {
audio: {
models: [
{
type: "provider",
provider: "openai",
model: "gpt-4o-transcribe",
},
],
},
},
},
}
Language and prompt hints are forwarded to OpenAI when supplied by the shared audio media config or per-call transcription request.
Realtime transcription
The bundled openai plugin registers realtime transcription for the Voice Call plugin.
| Setting | Config path | Default |
|---|---|---|
| Model | plugins.entries.voice-call.config.streaming.providers.openai.model | gpt-4o-transcribe |
| Language | ...openai.language | (unset) |
| Prompt | ...openai.prompt | (unset) |
| Silence duration | ...openai.silenceDurationMs | 800 |
| VAD threshold | ...openai.vadThreshold | 0.5 |
| API key | ...openai.apiKey | Falls back to OPENAI_API_KEY |
Uses a WebSocket connection to wss://api.openai.com/v1/realtime with G.711 u-law (g711_ulaw / audio/pcmu) audio. This streaming provider is for Voice Call's realtime transcription path; Discord voice currently records short segments and uses the batch tools.media.audio transcription path instead.
Realtime voice
The bundled openai plugin registers realtime voice for the Voice Call plugin.
| Setting | Config path | Default |
|---|---|---|
| Model | plugins.entries.voice-call.config.realtime.providers.openai.model | gpt-realtime-1.5 |
| Voice | ...openai.voice | alloy |
| Temperature | ...openai.temperature | 0.8 |
| VAD threshold | ...openai.vadThreshold | 0.5 |
| Silence duration | ...openai.silenceDurationMs | 500 |
| API key | ...openai.apiKey | Falls back to OPENAI_API_KEY |
Supports Azure OpenAI via azureEndpoint and azureDeployment config keys for backend realtime bridges. Supports bidirectional tool calling. Uses G.711 u-law audio format.
Control UI Talk uses OpenAI browser realtime sessions with a Gateway-minted
ephemeral client secret and a direct browser WebRTC SDP exchange against the
OpenAI Realtime API. Maintainer live verification is available with
OPENAI_API_KEY=... GEMINI_API_KEY=... node --import tsx scripts/dev/realtime-talk-live-smoke.ts;
the OpenAI leg mints a client secret in Node, generates a browser SDP offer
with fake microphone media, posts it to OpenAI, and applies the SDP answer
without logging secrets.
Azure OpenAI endpoints
The bundled openai provider can target an Azure OpenAI resource for image
generation by overriding the base URL. On the image-generation path, OpenClaw
detects Azure hostnames on models.providers.openai.baseUrl and switches to
Azure's request shape automatically.
Realtime voice uses a separate configuration path
(plugins.entries.voice-call.config.realtime.providers.openai.azureEndpoint)
and is not affected by models.providers.openai.baseUrl. See the Realtime
voice accordion under Voice and speech for its Azure
settings.
Use Azure OpenAI when:
- You already have an Azure OpenAI subscription, quota, or enterprise agreement
- You need regional data residency or compliance controls Azure provides
- You want to keep traffic inside an existing Azure tenancy
Configuration
For Azure image generation through the bundled openai provider, point
models.providers.openai.baseUrl at your Azure resource and set apiKey to
the Azure OpenAI key (not an OpenAI Platform key):
{
models: {
providers: {
openai: {
baseUrl: "https://<your-resource>.openai.azure.com",
apiKey: "<azure-openai-api-key>",
},
},
},
}
OpenClaw recognizes these Azure host suffixes for the Azure image-generation route:
*.openai.azure.com*.services.ai.azure.com*.cognitiveservices.azure.com
For image-generation requests on a recognized Azure host, OpenClaw:
- Sends the
api-keyheader instead ofAuthorization: Bearer - Uses deployment-scoped paths (
/openai/deployments/{deployment}/...) - Appends
?api-version=...to each request - Uses a 600s default request timeout for Azure image-generation calls.
Per-call
timeoutMsvalues still override this default.
Other base URLs (public OpenAI, OpenAI-compatible proxies) keep the standard OpenAI image request shape.
Azure routing for the openai provider's image-generation path requires
OpenClaw 2026.4.22 or later. Earlier versions treat any custom
openai.baseUrl like the public OpenAI endpoint and will fail against Azure
image deployments.
API version
Set AZURE_OPENAI_API_VERSION to pin a specific Azure preview or GA version
for the Azure image-generation path:
export AZURE_OPENAI_API_VERSION="2024-12-01-preview"
The default is 2024-12-01-preview when the variable is unset.
Model names are deployment names
Azure OpenAI binds models to deployments. For Azure image-generation requests
routed through the bundled openai provider, the model field in OpenClaw
must be the Azure deployment name you configured in the Azure portal, not
the public OpenAI model id.
If you create a deployment called gpt-image-2-prod that serves gpt-image-2:
/tool image_generate model=openai/gpt-image-2-prod prompt="A clean poster" size=1024x1024 count=1
The same deployment-name rule applies to image-generation calls routed through
the bundled openai provider.
Regional availability
Azure image generation is currently available only in a subset of regions
(for example eastus2, swedencentral, polandcentral, westus3,
uaenorth). Check Microsoft's current region list before creating a
deployment, and confirm the specific model is offered in your region.
Parameter differences
Azure OpenAI and public OpenAI do not always accept the same image parameters.
Azure may reject options that public OpenAI allows (for example certain
background values on gpt-image-2) or expose them only on specific model
versions. These differences come from Azure and the underlying model, not
OpenClaw. If an Azure request fails with a validation error, check the
parameter set supported by your specific deployment and API version in the
Azure portal.
Azure OpenAI uses native transport and compat behavior but does not receive OpenClaw's hidden attribution headers — see the Native vs OpenAI-compatible routes accordion under Advanced configuration.
For chat or Responses traffic on Azure (beyond image generation), use the
onboarding flow or a dedicated Azure provider config — openai.baseUrl alone
does not pick up the Azure API/auth shape. A separate
azure-openai-responses/* provider exists; see
the Server-side compaction accordion below.
Advanced configuration
Transport (WebSocket vs SSE)
OpenClaw uses WebSocket-first with SSE fallback ("auto") for both openai/* and openai-codex/*.
In "auto" mode, OpenClaw:
- Retries one early WebSocket failure before falling back to SSE
- After a failure, marks WebSocket as degraded for ~60 seconds and uses SSE during cool-down
- Attaches stable session and turn identity headers for retries and reconnects
- Normalizes usage counters (
input_tokens/prompt_tokens) across transport variants
| Value | Behavior |
|---|---|
"auto" (default) | WebSocket first, SSE fallback |
"sse" | Force SSE only |
"websocket" | Force WebSocket only |
{
agents: {
defaults: {
models: {
"openai/gpt-5.5": {
params: { transport: "auto" },
},
"openai-codex/gpt-5.5": {
params: { transport: "auto" },
},
},
},
},
}
Related OpenAI docs:
WebSocket warm-up
OpenClaw enables WebSocket warm-up by default for openai/* and openai-codex/* to reduce first-turn latency.
// Disable warm-up
{
agents: {
defaults: {
models: {
"openai/gpt-5.5": {
params: { openaiWsWarmup: false },
},
},
},
},
}
Fast mode
OpenClaw exposes a shared fast-mode toggle for openai/* and openai-codex/*:
- Chat/UI:
/fast status|on|off - Config:
agents.defaults.models["<provider>/<model>"].params.fastMode
When enabled, OpenClaw maps fast mode to OpenAI priority processing (service_tier = "priority"). Existing service_tier values are preserved, and fast mode does not rewrite reasoning or text.verbosity.
{
agents: {
defaults: {
models: {
"openai/gpt-5.5": { params: { fastMode: true } },
},
},
},
}
Session overrides win over config. Clearing the session override in the Sessions UI returns the session to the configured default.
Priority processing (service_tier)
OpenAI's API exposes priority processing via service_tier. Set it per model in OpenClaw:
{
agents: {
defaults: {
models: {
"openai/gpt-5.5": { params: { serviceTier: "priority" } },
},
},
},
}
Supported values: auto, default, flex, priority.
serviceTier is only forwarded to native OpenAI endpoints (api.openai.com) and native Codex endpoints (chatgpt.com/backend-api). If you route either provider through a proxy, OpenClaw leaves service_tier untouched.
Server-side compaction (Responses API)
For direct OpenAI Responses models (openai/* on api.openai.com), the OpenAI plugin's Pi-harness stream wrapper auto-enables server-side compaction:
- Forces
store: true(unless model compat setssupportsStore: false) - Injects
context_management: [{ type: "compaction", compact_threshold: ... }] - Default
compact_threshold: 70% ofcontextWindow(or80000when unavailable)
This applies to the built-in Pi harness path and to OpenAI provider hooks used by embedded runs. The native Codex app-server harness manages its own context through Codex and is configured separately with agents.defaults.agentRuntime.id.
Useful for compatible endpoints like Azure OpenAI Responses:
{
agents: {
defaults: {
models: {
"azure-openai-responses/gpt-5.5": {
params: { responsesServerCompaction: true },
},
},
},
},
}
{
agents: {
defaults: {
models: {
"openai/gpt-5.5": {
params: {
responsesServerCompaction: true,
responsesCompactThreshold: 120000,
},
},
},
},
},
}
{
agents: {
defaults: {
models: {
"openai/gpt-5.5": {
params: { responsesServerCompaction: false },
},
},
},
},
}
responsesServerCompaction only controls context_management injection. Direct OpenAI Responses models still force store: true unless compat sets supportsStore: false.
Strict-agentic GPT mode
For GPT-5-family runs on openai/*, OpenClaw can use a stricter embedded execution contract:
{
agents: {
defaults: {
embeddedPi: { executionContract: "strict-agentic" },
},
},
}
With strict-agentic, OpenClaw:
- No longer treats a plan-only turn as successful progress when a tool action is available
- Retries the turn with an act-now steer
- Auto-enables
update_planfor substantial work - Surfaces an explicit blocked state if the model keeps planning without acting
Scoped to OpenAI and Codex GPT-5-family runs only. Other providers and older model families keep default behavior.
Native vs OpenAI-compatible routes
OpenClaw treats direct OpenAI, Codex, and Azure OpenAI endpoints differently from generic OpenAI-compatible /v1 proxies:
Native routes (openai/*, Azure OpenAI):
- Keep
reasoning: { effort: "none" }only for models that support the OpenAInoneeffort - Omit disabled reasoning for models or proxies that reject
reasoning.effort: "none" - Default tool schemas to strict mode
- Attach hidden attribution headers on verified native hosts only
- Keep OpenAI-only request shaping (
service_tier,store, reasoning-compat, prompt-cache hints)
Proxy/compatible routes:
- Use looser compat behavior
- Strip Completions
storefrom non-nativeopenai-completionspayloads - Accept advanced
params.extra_body/params.extraBodypass-through JSON for OpenAI-compatible Completions proxies - Accept
params.chat_template_kwargsfor OpenAI-compatible Completions proxies such as vLLM - Do not force strict tool schemas or native-only headers
Azure OpenAI uses native transport and compat behavior but does not receive the hidden attribution headers.