X-RateLimit-Limit
The short-window request limit for the current key and endpoint.
Understand YouTube API quota, transcript API rate limits, monthly request limits, and how to handle quota_exceeded and rate_limited errors.
Quota usually means a monthly or account-level usage allowance. Rate limit means a short-window protection against bursts. Scority exposes both concepts through distinct error codes.
The official YouTube Data API uses Google's own quota model. If you are building against official Google endpoints, check the current Google documentation for quota units and billing behavior before publishing exact numbers.
Scority API keys can have a monthly quota. When the key has no remaining monthly allowance, the API returns quota_exceeded.
Short-window rate limiting protects service reliability. It is designed to slow bursts, not to replace monthly quota.
Use these headers to slow down before a burst turns into user-facing errors.
The short-window request limit for the current key and endpoint.
How many requests remain in the current short-window interval.
When the current short-window interval resets.
How many seconds to wait before retrying after a 429 rate_limited response.
quota_exceeded means the API key has reached its monthly product quota. Retrying immediately usually does not help because the monthly allowance has not changed.
rate_limited means the integration crossed a short-window request limit. This is usually fixed by slowing down, queuing work or retrying after the indicated delay.
Most transcript workflows can reduce quota and rate-limit pressure by avoiding repeated fresh fetches for the same video and language.
See the canonical rate-limit header reference.
Open →Troubleshoot monthly quota exhaustion and usage planning.
Open →Troubleshoot rate_limited, quota_exceeded and retry behavior.
Open →Review quota_exceeded, rate_limited and retry guidance.
Open →Understand cost drivers for transcript workflows.
Open →