API Documentation

DevelopersDocs

To ensure the stability and performance of our ultra-realistic TTS engine, Sonna.ai enforces rate limits on all API requests.

Standard Limits

Rate limits are applied per API Key and per IP address.

Plan	Requests per Minute (RPM)	Concurrent Generations
Pro	60 RPM	5
Max	Custom	Custom

Every response includes headers to help you track your current usage:

X-RateLimit-Limit: The maximum number of requests allowed in a window.
X-RateLimit-Remaining: The number of requests left in the current window.
X-RateLimit-Reset: The time at which the current rate limit window resets (in UTC epoch seconds).

If you exceed the rate limit, the API will return a 429 Too Many Requests status code.

json

{
  "error": "Too many requests. Please try again in 15 seconds."
}

Recommendation: We recommend implementing a retry mechanism with Exponential Backoff to handle rate limits gracefully.