Open-Source AI Gateway
One endpoint.
Every model.
Total control.
Nexus is a drop-in replacement for the OpenAI/Anthropic/Gemini API. BYOK, virtual keys, guardrails, semantic caching, and observability — without giving up your provider accounts.
Apache 2.0 · self-hostable · MIT-licensed UI · production-grade observability
from openai import OpenAI
# Same SDK, drop in Nexus as the base URL.
client = OpenAI(
base_url="https://nexus.ffx.ai/v1",
api_key="nxs_live_...",
)
resp = client.chat.completions.create(
model="gemini-2.5-flash",
messages=[{"role": "user", "content": "Hello, Nexus."}],
) Features
Everything you need between
your app and the LLM.
Built for teams shipping AI features in production — not demos.
BYOK, the right way
Users bring their own OpenAI / Anthropic / Gemini keys. Keys are encrypted at rest, never logged, and never leave your tenant.
Virtual keys, real isolation
Mint per-team or per-app virtual keys with their own budgets, RPM limits, allowed models, and audit trail.
Quality-aware routing
Route by speed, cost, or rolling quality score. Add provider fallbacks that kick in before users see errors.
Guardrails in-line
PII redaction, JSON-schema enforcement, and self-correction. Block bad inputs/outputs before they cost a cent.
Semantic cache
Embeddings-based cache cuts repeat traffic. 10× cheaper re-prompts, 0 code changes in your app.
OpenTelemetry out of the box
Every request is traced with model, tokens, cost, latency, cache hit, and quality score. Export to any OTLP backend.
SSO via OIDC / SAML
Connect Okta, Azure AD, Google Workspace, or self-hosted Keycloak. JIT provisioning + per-org policy.
Self-host in 5 minutes
Single binary, Postgres, Redis, ClickHouse. Docker Compose for dev, Helm for prod. No SaaS lock-in.
How it works
From pip install to $1M/mo spend — same code.
- 01
Self-host or use the cloud
Run our Helm chart in your own Kubernetes, or sign up for a managed tenant on nexus.ffx.ai.
- 02
Plug in your provider keys
Per-user BYOK or org-level platform credentials. Encrypted at rest with KMS-managed keys.
- 03
Mint virtual keys for your apps
Each app or team gets a virtual key with its own budget, allowed models, and rate limit.
- 04
Watch the dashboard
Traces, costs, eval scores, and quality — by user, team, model, or virtual key.
FAQ
Questions, answered.
How is Nexus different from LiteLLM?
Do my users' LLM keys leave my infrastructure?
Which providers are supported?
Can I self-host?
How does pricing work?
What's on the roadmap?
Ready to ship?
The fastest way to put an LLM in production.
Self-host in 5 minutes, or sign up for a managed tenant.