To ensure the stability and performance of our ultra-realistic TTS engine, Sonna.ai enforces rate limits on all API requests.

Standard Limits

Rate limits are applied per API Key and per IP address.
PlanRequests per Minute (RPM)Concurrent Generations
Pro60 RPM5
MaxCustomCustom

Rate Limit Headers

Every response includes headers to help you track your current usage:
  • X-RateLimit-Limit: The maximum number of requests allowed in a window.
  • X-RateLimit-Remaining: The number of requests left in the current window.
  • X-RateLimit-Reset: The time at which the current rate limit window resets (in UTC epoch seconds).

Handling Rate Limits

If you exceed the rate limit, the API will return a 429 Too Many Requests status code.
json
{
  "error": "Too many requests. Please try again in 15 seconds."
}
Recommendation: We recommend implementing a retry mechanism with Exponential Backoff to handle rate limits gracefully.