Skip to content

Auto AI Router

Troubleshooting

MiXaiLL76/auto_ai_router

Troubleshooting

Rate Limit Behavior

The router uses two-level rate limiting:

Credential level — RPM (requests per minute) and TPM (tokens per minute) per API key
Model level — additional limits for specific (credential + model) pairs

When a limit is reached:

Router tries another credential for the same model (round-robin)
If no credentials are available, returns 429 Too Many Requests
If fallback proxies are configured, routes to them automatically

Check Current Usage

curl http://localhost:8080/health | jq '.credentials'

Common HTTP Errors

503 Service Unavailable

All credentials have exhausted their rate limits
All fallback proxies are unavailable
Fix: increase RPM/TPM limits, add more credentials, or wait for the next minute reset

429 Too Many Requests

Current credential hit its TPM limit
No alternative credentials available for the model
Fix: add additional credentials for the same model, or increase TPM limits

401 / 403 Unauthorized

Invalid API key in the request
Invalid master key configuration
API key revoked by the provider
Fix: check your config, update the API key

Fallback Behavior

Fallback proxies (is_fallback: true) activate when:

Primary credentials exhaust their RPM/TPM limits
Primary providers return errors (401, 403, 429, 500, 502, 503, 504)
Network errors or timeouts occur

Fallback Chain

Request sent to primary credential
Primary fails → try fallback proxy
Fallback proxy handles the request with its own credential pool
If fallback is also unavailable → 503 Service Unavailable

Debug Logging

Enable debug logging to see detailed request routing:

server:
  logging_level: debug

./auto_ai_router -config config.yaml