Overview
The Yala API applies rate limits so that usage stays fair and stable for all customers. If you exceed the limits, you will receive HTTP429 Too Many Requests and should back off before retrying.
Limits
| Limit type | Value | Description |
|---|---|---|
| Rate | 5 requests per second | Sustained request rate per second. |
| Quota | 10,000 requests per month | Total number of requests allowed per calendar month for your API key. |
Limits are applied per API key (and may be per tenant in production). Different environments (e.g. sandbox vs production) may have separate limits.
When you are rate limited
- The API responds with HTTP 429 and a body that may include a
Retry-Afterhint or a message about rate limiting. - Recommended behavior: Use exponential backoff (e.g. 1s, 2s, 4s) and optional jitter before retrying. Do not retry immediately in a tight loop.
Best practices
- Throttle client-side: Keep your request rate at or below 5 requests per second in normal operation.
- Monitor quota: Track your monthly request count so you stay under 10,000 requests per month, or contact developers@useyala.com to discuss higher limits.
- Cache when possible: Use cached data (e.g. pairs, methods) where the API allows to reduce unnecessary calls.