← Back to Model Selector

Pricing

Cut your LLM costs. Keep quality and speed.

Transparent, no-surprise pricing

Pay per token routed. Scale from prototypes to production. 40-60% cost savings vs. direct provider calls.

Starter

$0
100K tokens/month
  • Dynamic model routing
  • Automatic fallback logic
  • Cost dashboard
  • Community Slack

Pro

$49
/month, 10M tokens
  • Everything in Starter
  • Priority routing rules
  • Real-time cost breakdown
  • Detailed analytics API
  • Email support (2h SLA)

Enterprise

Custom
Contact sales
  • Everything in Pro
  • Dedicated routing engineer
  • 99.9% uptime SLA
  • Custom model whitelisting
  • Self-hosted option

Feature Comparison

Feature Starter Pro Enterprise
API access Yes Yes Yes
Model selection rules Basic Advanced Unlimited
Cost alerts Weekly email Real-time webhook Custom alerts
Support response time 48 hours 2 hours 15 minutes
SLA uptime guarantee 99% 99.5% 99.9%

Frequently Asked

How do you count tokens?

Input tokens + output tokens = total tokens used. Billing matches provider rates (GPT-4, Claude, Llama, etc.) plus 8% for routing overhead. Transparent per token breakdowns on your dashboard.

What if I go over my limit?

Requests route successfully. You get real-time dashboard alerts. Overage is $0.0015 per 1K tokens. Upgrade to a higher plan to lock in a predictable budget and avoid surprises.

Can I pause or downgrade?

Yes. Change plans anytime. Billing is prorated daily. No cancellation fees. Your API keys keep working with the new plan instantly.

Do you offer annual discounts?

Yes. Annual commitment gets 20% off. Contact sales for Enterprise annual pricing or custom volume discounts.

Is there a free tier for production?

Starter is free with 100K monthly tokens, suitable for prototypes. Pro is optimized for production workloads. Enterprise gets dedicated infrastructure and SLA.

Ready to shrink your LLM bill?

Get a 14-day Pro trial. No credit card required. See real savings on your stack.