FORG works without a gateway — agent observability is fully out-of-band by default. When you need hard enforcement (blocking requests before they reach the model), flip on the gateway. One command. Same rules. Same pricing.
Both modes share the same rules engine, dashboards, and pricing. The only difference is when enforcement fires.
Agent calls LLM directly. FORG receives telemetry in parallel — zero latency added to the hot path.
All API calls route through forg.pro/gateway/v1/. Rules evaluated synchronously before the model sees the request.
Most teams start without it. These are the three scenarios where inline enforcement matters.
You need requests blocked before they reach the model — not just alerted after the fact. Budget overruns, disallowed models, or blocked patterns return HTTP 429 in real time.
Your compliance team requires all AI traffic routed through an audited control point. Gateway provides an immutable log of every request with timestamps, model IDs, token counts, and error codes.
Legal requires proof that AI calls stayed within your approved regions or never touched unapproved endpoints. Gateway's single ingress point makes this attestable and auditable.
Included in your seat. No token-based billing. Gateway billing is always $0 beyond the seat price.
No adapter changes required. Your existing rules apply immediately once the gateway is active.
Payload logging is OFF by default. FORG is a signal layer, not a content layer.
Both modes give you full observability. Gateway adds real-time enforcement on top.
Your rules carry over. Your seat price stays the same. You get hard enforcement instead of advisory alerts.