Models & aliases
Everything you can pass as `model` — concrete aliases, industry tags, strategy hints, and auto.
The model field is the steering wheel. You can name a model directly, or hand
the choice to the router with a tag, a strategy, or auto. Every form resolves
through the same routing pipeline.
What you can pass as model
Concrete alias
A specific model: gpt-4o, gpt-4o-mini, claude-3-5-sonnet,
claude-3-5-haiku, qwen2.5-coder, llama3.1, or any local model you
configure. Forwarded to the backend that serves it.
Industry tag
industry:legal, industry:healthcare, industry:code, … — the router
picks the curated best model for that domain and falls through a priority
list if your top pick is offline.
Strategy hint
strategy:cost, strategy:quality, strategy:latency, strategy:local —
route by intent rather than name. See Routing.
Auto
auto — a balanced default. The classifier inspects the prompt and routes
to the best model for the task, so you can ship once and let us tune.
Industry tags
| Tag | Tuned for |
|---|---|
industry:legal | Contracts, citations, careful nuance |
industry:healthcare | Clinical Q&A, polite refusals |
industry:finance | Quantitative reasoning, numeric reliability |
industry:code | Code generation, debugging, refactors |
industry:creative | Long-form writing, brand voice |
industry:support | Fast, polite, tool-aware |
industry:sales | Lead enrichment, outreach drafts |
industry:data | SQL, dataframes, chart reasoning |
industry:education | Step-by-step tutoring |
industry:general | Balanced default |
The live catalog
The set of routable models is driven by the gateway's live catalog — synced from the running proxy and enriched for open-weight families (Qwen, DeepSeek, Llama, Mistral, GLM, and more). Each entry carries its provider, capabilities (code / reasoning / vision), price, and observed average latency, so the router can sort candidates by price or responsiveness.
Because the catalog is live, new models become routable without a client
change — name auto or a strategy: directive and you automatically benefit
from anything newly added.
Open-weight & local models
Concrete aliases also cover models you host yourself. Configure an
OLLAMA_BASE_URL or VLLM_BASE_URL on the gateway and those models join the
catalog like any other — including under strategy:local. See
BYOK & local models.