OpenAI API error 429 rate limit

OpenAI API Error 429: Rate Limit or Quota Exceeded

TL;DR: OpenAI API error 429 means the API rejected the request because the account, project, model, or organization is over a rate or quota limit.

What this error means

OpenAI API 429 usually means your request is being rate limited, your quota is exhausted, or your project has hit a per-minute token/request cap.

rate_limit mixed

Is it code, key, rate limit, or server-side?

Signal Diagnosis
Provider OpenAI
Error 429
Likely category rate_limit
Likely fault area mixed

Immediate checklist

  1. Check whether the response body says rate_limit_exceeded, insufficient_quota, or billing_hard_limit_reached.
  2. Verify the project and organization attached to the API key.
  3. Look at requests per minute, tokens per minute, and daily/monthly spend limits separately.
  4. Retry only with exponential backoff and jitter; do not loop immediately.

cURL test command

curl https://api.openai.com/v1/models \
  -H "Authorization: Bearer $OPENAI_API_KEY"

Run this from your server or terminal. llmping does not collect API keys and does not send this command for you.

Run diagnosis

Common fixes

  • Reduce concurrency and batch size first; most 429 loops are caused by parallel workers.
  • Use exponential backoff with jitter and respect Retry-After when the header is present.
  • Switch low-priority traffic to a cheaper or smaller model during spikes.
  • Increase project limits or add billing if the error body says quota rather than rate limit.

Related errors

Markdown version