Rate Limiting

Edge and application-level limits that protect the runtime and your LLM budget.

DragonClaw uses two layers of rate limiting.

Nginx can apply IP-based request limits before traffic reaches Node.js.

This is the first defense against scraping, brute force traffic, and accidental bursts.

The gateway applies user-aware limits.

Default settings:

When a client is limited, DragonClaw returns:

{
  "error": "Rate limited",
  "retryAfterMs": 45000
}

HTTP status is 429 Too Many Requests.

Last updated 2 hours ago