Quotas & Limits
Quotas are numeric limits that cap resource usage within a billing period. Unlike feature flags (which are boolean), quotas define how much of a resource a deployment can consume.
Quota Types
AI Monthly Token Limit
Controls the maximum number of AI tokens (input + output) a deployment can use per billing period.
| Tier | Monthly Token Limit |
|---|---|
| Sandbox | 0 (AI disabled) |
| Trial | 1,000 |
| Launch | 10,000 |
| Growth | 50,000 |
| Enterprise | Unlimited |
API Rate Limit
Controls the maximum number of API requests per minute across all endpoints.
| Tier | Requests/Minute |
|---|---|
| Sandbox | 100 |
| Trial | 500 |
| Launch | 2,000 |
| Growth | 10,000 |
| Enterprise | Unlimited |
How Quotas Reset
Monthly Reset
AI token quotas reset at the start of each billing period. The reset is tied to the subscription's billing cycle, not the calendar month.
- Unused tokens do not roll over to the next period
- The reset happens automatically at the billing period boundary
- Purchased credits (via credit packs) are separate and do not reset
Rate Limit Window
API rate limits use a sliding 1-minute window. Once the limit is hit, subsequent requests receive a 429 Too Many Requests response until the window advances.
Exceeded Behavior
AI Token Limit Exceeded
When the AI monthly limit is reached:
{
"error": "quota_exceeded",
"code": "AI_QUOTA_EXCEEDED",
"message": "AI token quota exceeded for this billing period",
"quota_exceeded": true,
"upgrade_url": "https://app.hiveforge.dev/billing"
}The deployment can still use AI if it has purchased credits available. The deduction order is:
- Monthly allocation (period balance)
- Purchased credits
API Rate Limit Exceeded
When the rate limit is hit:
HTTP/1.1 429 Too Many Requests
Retry-After: 12
X-RateLimit-Limit: 2000
X-RateLimit-Remaining: 0
X-RateLimit-Reset: 1711612800{
"error": "rate_limit_exceeded",
"message": "API rate limit exceeded. Retry after 12 seconds."
}Sustained rate limit violations may trigger additional throttling. Design your application to respect Retry-After headers.
Monitoring Usage
Quota Status via Entitlement Check
The entitlement response includes quota information:
{
"tier": "launch",
"quotas": {
"ai_tokens": {
"used": 4200,
"limit": 10000,
"remaining": 5800,
"resets_at": "2026-04-01T00:00:00Z"
}
}
}AI Quota Endpoint
For detailed AI usage tracking:
curl https://api.hiveforge.dev/api/v1/ai/quota \
-H "X-Deployment-ID: d9f2a1b4-..." \
-H "X-Deployment-Secret: sk_live_..."Response:
{
"used": 4200,
"limit": 10000,
"remaining": 5800,
"resets_at": "2026-04-01T00:00:00Z",
"tier": "launch",
"credits_available": 9500,
"credits_per_standard": 1,
"credits_per_advanced": 3,
"credits_per_premium": 10
}SDK Quota Monitoring
import { HiveForgeClient } from "@producthacker/hiveforge-sdk";
const client = new HiveForgeClient();
// Check AI quota
const quota = await client.ai.getQuota();
console.log(`AI tokens: ${quota.used}/${quota.limit} (${quota.remaining} remaining)`);
console.log(`Resets at: ${quota.resets_at}`);
// Check if near limit
if (quota.remaining !== null && quota.remaining < 1000) {
console.warn("AI token quota is running low");
}Quota Overrides
Enterprise deployments and special cases can have quota overrides applied by HiveForge administrators:
- Override AI token limits (higher or lower than tier default)
- Override API rate limits
- Set expiration dates on overrides
- Reset current usage counters
Overrides are applied via the admin API and take priority over tier defaults.
Quota overrides persist across billing periods until they expire or are removed. They do not affect the deployment's tier -- only the specific quota values.
Best Practices
- Cache entitlement data -- Respect the
next_check_secondsfield to avoid unnecessary API calls - Show usage in your UI -- Display quota consumption to end users so they can manage their usage
- Handle 429 responses -- Implement exponential backoff with jitter for rate-limited requests
- Monitor quota warnings -- Alert when usage approaches 80% of the limit
- Consider credit packs -- For deployments that regularly exceed monthly allocations, purchased credits provide overflow capacity