openclaw-lighthouse

OpenAI Codex provider cooldown: all models failed (no available auth profile)

Summary

In some sessions, OpenClaw fails before replying when every configured model in the fallback chain depends on openai-codex, and openai-codex profiles are unavailable (cooldown/rate-limit state).

Observed codex-only variants include:

This behaves like a provider/profile availability failure, not a content failure.

Environment

Reproduction

  1. Configure primary + fallback models under the same provider (openai-codex).
  2. Trigger provider rate limiting or profile cooldown state (high request burst can cause this).
  3. Send a normal user prompt.

Expected vs actual

Exact reported errors (redacted variants):

Variant A

⚠️ Agent failed before reply: All models failed (2):
openai-codex/gpt-5.3-codex: No available auth profile for openai-codex (all in cooldown or unavailable). (rate_limit) |
openai-codex/gpt-5.3-codex-spark: Provider openai-codex is in cooldown (all profiles unavailable) (rate_limit).
Logs: openclaw logs --follow

Variant B

⚠️ Agent failed before reply: All models failed (2):
openai-codex/gpt-5.3-codex: LLM request timed out. (unknown) |
openai-codex/gpt-5.3-codex-spark: No available auth profile for openai-codex (all in cooldown or unavailable). (rate_limit).
Logs: openclaw logs --follow

Findings

  1. Core pattern: codex-only fallback can fully fail when profile availability collapses (cooldown/rate-limit).
  2. A first-model timeout can appear before/alongside cooldown errors on downstream candidates.
  3. Fallback is ineffective if all candidates share the same provider and that provider is unhealthy.
  4. This is likely a resilience/config gap (fallback diversity), not a prompt-specific content issue.

Mitigation / Workaround

  1. Add cross-provider fallback (for example one non-openai-codex model).
  2. Reduce bursty traffic and add spacing between heavy runs.
  3. Add/verify additional auth profiles for the provider if available.
  4. Retry after cooldown window.
  5. Capture supporting logs during failure:
    • openclaw logs --follow
    • openclaw status --all

Risk / Impact

Next actions

References