Entitlements & Tiers
HiveForge's entitlement system controls what features, quotas, and capabilities each deployed application has access to. Entitlements are determined by the deployment's subscription tier and enforced at the API layer.
Core Concepts
Tiers
Every deployment is assigned one of five tiers that determine its capabilities:
| Tier | Target Use Case |
|---|---|
| Sandbox | Free exploration, no AI or billing access |
| Trial | Time-limited evaluation with full feature access |
| Launch | Production-ready for early-stage products |
| Growth | Scaling products with higher limits and branding |
| Enterprise | Unlimited usage with dedicated support |
See Tier Comparison for the full feature matrix.
Feature Flags
Boolean flags that gate access to platform capabilities:
ai_enabled-- Access to the AI proxy (completions, embeddings)billing_enabled-- Access to the Stripe billing proxycustom_domain-- Ability to use a custom domainwhite_label-- Remove HiveForge brandingmcp_enabled-- Access to MCP tool metering
See Feature Flags for details on checking each flag.
Quotas
Numeric limits that cap usage within a billing period:
- AI monthly limit -- Maximum AI tokens per month
- API rate limit -- Maximum API requests per minute
See Quotas & Limits for quota behavior and reset rules.
Credits
A fungible unit for metered usage across all proxy services:
- AI completions and embeddings
- Email sending
- Vector search and upsert
- Webhook delivery
- MCP tool calls
See Credit System for the full credit model.
Subscription Lifecycle
A deployment's subscription moves through these statuses:
┌─────────┐ ┌──────────┐ ┌────────┐
│ trialing │────►│ active │────►│ canceled│
└─────────┘ └──────┬───┘ └────────┘
│
▼
┌──────────┐ ┌───────────┐
│ past_due │────►│ suspended │
└──────┬───┘ └───────────┘
│
▼
┌──────────────┐
│ grace_period │
└──────────────┘| Status | Meaning | Feature Access |
|---|---|---|
trialing | Free trial period | Full tier features |
active | Paid subscription in good standing | Full tier features |
past_due | Payment failed, awaiting retry | Full tier features (temporary) |
grace_period | Extended window after payment failure | Reduced features |
suspended | Access revoked due to non-payment or admin action | No access |
canceled | Subscription ended by customer | No access |
The transition from past_due to grace_period to suspended is automatic. The grace period gives customers time to update payment methods before losing access.
How Entitlements Are Checked
Every API request from a deployed application goes through the entitlement check:
- Identify deployment -- Extract deployment ID from the request credentials
- Load entitlements -- Fetch tier, features, quotas, and credit balance
- Evaluate access -- Check the relevant feature flag and quota limits
- Allow or deny -- Return the response or a structured error
The entitlement response includes a next_check_seconds field (default 300) that tells the client how long to cache the result before re-checking.
Quick Reference
| Topic | Page |
|---|---|
| Full tier comparison matrix | Tier Comparison |
| Feature flags and how to check them | Feature Flags |
| Quota limits, resets, and exceeded behavior | Quotas & Limits |
| Credit system, costs, and purchasing | Credit System |