Intelligent Routing for AI Native Apps

AI that picks the right model for every request.

Start for Free
1,100+ Models
16 Providers
1 Unified API
Acme Inc · org_3kf9d2

Overview · Traffic

Customize Save view
Time:
1h 24h 7d 30d
Custom Filters 2
Export
Requests
128,403
Success rate
99.4%
5xx errors
142
Tokens
84.2M
Latency · p95
1,240ms
Latency · p99
3,180ms
Cost
$1,284.50
Customers
312
Requests · Count / time 128,403
Outcome · 5xx / time 142
Latency · p95 / time 1,240ms
Cost · Total / time $1,284.50
Requests 8 loaded+
Columns
TimeRequest IDRouterAliasModelCustomerIn/OutCostLatAttStatus
07:32:14.291 req_8f2a91 rt_chatbot smart claude-sonnet-4-6 tenant_acme 1,204 / 856 $0.0142 842ms 1 200
07:32:13.884 req_7d10c4 rt_agent gpt-5 user_42891 2,860 / 1,420 $0.0388 1,910ms 1 200
07:32:12.530 req_5c93af rt_summarize fast claude-haiku-4-5 tenant_globex 640 / 220 $0.0021 318ms 1 200
07:32:10.119 req_4b77e2 rt_agent smart gpt-5 tenant_initech 3,120 / 980 $0.0402 2,240ms 2 200
07:32:08.742 req_3a18bd rt_chatbot cheap gemini-2.5-pro user_10233 1,540 / 612 $0.0096 1,120ms 1 200
07:32:06.501 req_29ee70 rt_embed text-embedding-3-large tenant_acme 820 / 0 $0.0003 92ms 1 200
07:32:04.330 req_1f0c55 rt_agent smart gpt-5 tenant_hooli 3 429
07:32:01.987 req_0b6d18 rt_chatbot fast claude-sonnet-4-6 user_88120 1,480ms 2 503
8 rows Load more

The Complete Platform for Using AI Models in Production

One platform for routing, prompts, and observability.

10msp50 overhead

Built for performance

A Rust gateway with full streaming and per-request latency tracking.

1,100+models

One API, every model

Drop-in endpoints compatible with the OpenAI and Anthropic APIs, across 16 providers.

16providers

Smart routing & fallbacks

Compose routers with aliases, automatic fallbacks, and weighted load balancing.

Gitversioned

Prompt management

Version prompts inline or sync them from GitHub, with {{variables}} and full revision history.

Livedashboard

Traffic observability

Latency, errors, cost, and every request, filterable and saved as views.

$/reqattribution

Cost & usage tracking

Per-request cost and token accounting, cache-aware, attributed by customer.