
Point your base URL at the gateway and bring your own provider key. You pay for the tokens you route through it, and the compression usually pays for itself.
For trying it on a side project.
For a production feature or two.
For teams running multiple workloads.
Azure-native, governed, deployed by Ryshe.
Indicative tiers for launch. Token limits and prices may change before general availability.
By tokens processed through the gateway, the same unit your model provider bills you on. You only route the workloads you choose, and the savings typically more than cover the fee.
No. In the default mode you bring your own provider key on each request and we forward with it. We never persist it.
The default compression is lossless: it normalizes whitespace and removes duplicate instructions and messages. Aggressive, reversible compression is opt-in and gated by evaluation.
Yes. The paid plans exist for the hosted dashboard, team accounts, governance, and support. If you want to run it yourself, the engine layer is built on open standards.