BYOK — Bring your own AI provider key

Jump to section

The chatbot pricing problem

Most platforms in this category sell “messages.” A message is whatever they say it is — sometimes an inbound visitor turn, sometimes a model call, sometimes a token bucket. As your volume grows the per-message price doesn’t fall as fast as the underlying token price you’d be paying yourself. You’re not buying compute. You’re buying access through a meter the platform controls.

This is fine when you’re tiny. A few hundred messages a month is a rounding error. The math gets uncomfortable somewhere around the point where your bot starts working — when your widget is converting, your channels are connected, your help center is fully ingested, and your conversation count starts to climb. Suddenly the meter is the dominant line in your support stack. You’re not paying for the platform anymore. You’re paying for somebody else’s margin on tokens you could have bought directly.

There’s a second thing nobody mentions: the meter rarely reflects what the underlying model actually costs. A long reasoning response on a frontier model consumes real tokens. A short factual reply from a small model is essentially free. Per-message pricing flattens that distinction and charges you the same per “message” either way, so the platform makes more on the cheap conversations and you subsidize the expensive ones for them. With BYOK, the cost you see is the cost you actually generated.

Two pricing shapes

NebulaHex offers two ways to pay for the AI side of your bots. Both ship the same product. The difference is where the AI compute bill comes from.

DEFAULTManaged AI · Default mode

Predictable platform fee

Pay a flat monthly platform fee. Messages count against your plan’s monthly allowance. We pick the model. You get one bill from us covering platform and AI compute — everything in a single line.

Best for

•Low or medium volume
•Simple setup, hands-off model choice
•Teams without a preferred AI provider
•Predictable bill matters more than absolute cost

BYOK MODEWorkspace toggle · Same paid plan

Platform fee + provider’s published rates

Same flat platform fee, plus your own Anthropic or OpenAI bill at the provider’s published per-million-token rates. We don’t meter messages on our side — the provider does, and you see their dashboard, not ours.

Best for

•Real volume (thousands of conversations a month)
•Teams who want to pick their own model
•Existing AI provider contract or credit programme
•Compliance / procurement consolidation

Neither shape is universally better. The default plans are simpler and the right call for most teams starting out. BYOK becomes the better economics once your volume crosses a threshold or your team has reasons to want direct provider billing — cost transparency, compliance, model choice, an existing provider relationship.

Two providers, six models

Across Anthropic and OpenAI, the BYOK lineup covers the speed-versus-quality range most teams care about. Your workspace picks one default model, and you can change it any time without losing conversation history, knowledge sources, or customizations — the only thing that changes is which model produces the next reply.

ANTHROPIC

Claude Haiku

Fast · Lowest token cost

High-volume support bots where most questions are routine FAQ-style lookups and latency matters.

Speed

Quality

ANTHROPIC

Claude Sonnet

Balanced · Speed + reasoning

Bots that follow longer policies, handle nuanced phrasing, or hold conversations beyond simple lookups.

Speed

Quality

ANTHROPIC

Claude Opus

Frontier · Quality-first

Bots where being subtly wrong would cost you a customer. Reaches for complex multi-step reasoning.

Speed

Quality

OPENAI

GPT-4o Mini

Cost-optimized small model

High-volume bots that mostly need to read your knowledge base accurately and reply quickly.

Speed

Quality

OPENAI

GPT-4o

Workhorse · General-purpose

Fast, capable across a wide range of tasks. Comfortable middle ground between cost and quality.

Speed

Quality

OPENAI

GPT-4.1

Long context · Strong instructions

When your bot needs to handle long policy documents in context or follow tight formatting instructions.

Speed

Quality

Speed and Quality on the cards reflect general positioning within each provider’s lineup, not a head-to-head ranking. The right model is the one that fits your bots’ job. Most teams settle on Haiku or GPT-4o Mini for high-volume workspaces, Sonnet or GPT-4o for general purpose, and Opus or GPT-4.1 for workspaces where quality is the constraint.

How to switch your workspace to your own key

The mechanics are short and reversible. Three steps once you’re on any paid plan.

Open AI Configuration in Workspace Settings

Navigate to your workspace’s Settings → AI Configuration. The controls are available on every paid plan; no plan switch needed. Toggle BYOK on, and your bots start using your provider key for every reply.

Paste your key and validate

Open Workspace Settings → AI Configuration. Pick Anthropic or OpenAI. Paste your API key into the field. Click Validate Key — the platform makes a real, low-cost test call against your key to confirm it works and reports back which models you have access to. Validation happens before save, so a broken key never lands in production.

Choose your default model and save

Pick from the validated model list. Save. Every bot in your workspace immediately starts using your key for new replies. No warm-up period. No propagation delay. The next conversation a customer starts is on your account.

Your key is encrypted before it’s stored and never returned to the dashboard in plaintext after you save it. If you need to rotate it, paste a new key and re-validate — the old one is replaced atomically. If you want to leave BYOK, toggle it off in workspace settings. You stay on your current plan; the AI billing flips back to managed. There’s no migration step, no data movement, no waiting period.

What changes the moment you switch

From your customers’ side, the change is invisible — the widget looks the same, the inbox looks the same, the conversation history looks the same. But the underlying economics are now yours.

Messages stop being metered on our side

We don’t count them against a monthly cap. We don’t notify you at 80% or 95% of plan usage. The platform plan is a flat fee, and that fee is the entire bill from NebulaHex regardless of whether your workspace handles a thousand conversations or a hundred thousand.

Your model bill is the provider’s published rate

When any bot in your workspace replies, the API call goes from our backend to your AI provider, signed with your key. The provider charges you directly at their public per-million-token rate. You see exactly what you’re spending in the Anthropic Console or the OpenAI dashboard. Spending caps live in your provider account.

The full feature set unlocks

BYOK isn’t a billing change with strings attached. Every paid plan already ships with all five channels, every CRM and helpdesk integration, live chat handoff, the Google Sheets connector, custom CSS and webhooks, and priority support — flipping BYOK on doesn’t remove or unlock anything else. It only changes where the AI cost shows up.

Most BYOK customers running modest volume see a small monthly AI bill at the provider, paid directly to Anthropic or OpenAI. The platform fee from us stays flat regardless. Both providers let you set spending caps in your account, and we recommend turning that on the moment you connect a key.

Included on every paid plan

BYOK is a feature on every paid plan, not its own tier. On Growth, Pro, Business, or Enterprise, you can flip BYOK on at the workspace level and your bots immediately start signing requests with your provider key. Flip it off, and your bots go back to managed AI using NebulaHex’s models against your plan’s monthly message allowance.

Two billing modes, every paid plan

DEFAULT

Managed mode

NebulaHex picks the model. Messages count against your plan’s monthly allowance. One bill from us covering platform and AI.

YOUR KEY

Optional

BYOK mode

You bring an Anthropic or OpenAI key. Your bots use that key for every reply. Your provider bills you at published rates. Platform fee unchanged.

Every paid plan ships with both options. Switch any time from workspace settings.

The difference between “managed” and “BYOK” isn’t your subscription — it’s where the AI cost shows up. Managed: in your NebulaHex bill. BYOK: in your Anthropic or OpenAI bill. The platform fee stays the same regardless.

If your team needs Claude Opus on every reply, BYOK lets you do that on a $99 plan instead of a $999 plan. If your message volume is predictable and the managed allowance covers you, leave BYOK off. Both choices are available on every paid plan.

When BYOK makes sense

📈

The operator running real volume

You’re past the proof-of-concept phase. Your bots are live, handling thousands to tens of thousands of conversations a month. You’ve looked at the math on per-message pricing and decided you want the variable line item to flow through your AI provider account where you can see and control it — not through a platform meter you can’t audit. You flip on BYOK in workspace settings. Connect a key. The platform fee stays flat regardless of how busy your bots get.

💎

The team that wants frontier models without a markup

You sell a high-touch product. Customers expect support to feel as considered as pre-sales. You want a frontier model handling your conversations — pricing questions, technical scoping, escalations — without the per-message platform premium on top of the frontier model’s underlying cost. With BYOK, the frontier-model premium is just the published Anthropic or OpenAI rate. Quality stops being a luxury and starts being a normal line item.

🏢

Standardizing on one AI provider company-wide

Your engineering team already picked an AI provider for everything else. You have a vendor relationship, spending controls, maybe a startup credit programme. You don’t want a second provider on a separate invoice for one tool. Connecting that same key to your workspace consolidates onto the relationship you already manage. Your CFO sees one provider line item across the whole stack.

🧪

A/B testing models honestly

Same workspace, same knowledge, two different models in sequence. Run a week on Sonnet. Switch to Opus for the next week. Compare your customer satisfaction scores, your handoff rates, and the cost difference in your provider dashboard. On per-message pricing, both arms look the same on your invoice regardless of what’s running underneath. With BYOK, both arms produce a real, attributable bill — so you can decide based on numbers instead of vibes.

And when BYOK doesn’t make sense.If you’re running a couple hundred conversations a month, a managed plan is simpler and the math doesn’t move much. The break-even on per-message pricing only shows up at real volume. Don’t enable BYOK just because you can — enable it when you have a reason.

Key handling & trust

Your API key is the credential that controls your spend with Anthropic or OpenAI. We treat it accordingly. When you save a key, it’s encrypted at rest before it touches our database. The key is never returned to the dashboard in plaintext after you save it — we show you a placeholder confirming one is on file, and you can replace it but you can’t read it back. The validation step on save uses the key for a single low-cost test request and discards the in-memory copy immediately.

When any bot in your workspace generates a reply, the key is decrypted in the request path, used to sign the outbound API call, and dropped — it isn’t logged, isn’t echoed in audit trails, and isn’t included in any export. If a key is ever compromised, you rotate it the same way you’d rotate any cloud credential: revoke the old one in your provider’s dashboard, issue a new one, and paste the new one into NebulaHex. The compromised key stops being able to bill you the moment your provider revokes it.

Workspace permissions matter here. Only owners and admins can change the workspace AI configuration. Editors can update knowledge and bot behavior but can’t see or change which key is in use. Viewers have read-only access. Your data is never used to train AI models on our side or your provider’s side — see our privacy policy and terms for the full posture.

Switch to your own key in minutes

If you’re already a NebulaHex customer, the path is short: open Workspace Settings → AI Configuration, pick your provider, paste your key, validate, choose your default model, save. The next reply any bot in your workspace sends is on your account.

Decouple your AI cost from your support seats.

Build bots on the free plan. Pick any paid plan when you’re ready — they all include BYOK. The version of your support stack where the AI bill stops being a black box is one workspace toggle away.

Build your bot for free See pricing →

Pay AI providers directly. No per-message markup.

The chatbot pricing problem

Two pricing shapes

Predictable platform fee

Platform fee + provider’s published rates

Two providers, six models

Claude Haiku

Claude Sonnet

Claude Opus

GPT-4o Mini

GPT-4o

GPT-4.1

How to switch your workspace to your own key

Open AI Configuration in Workspace Settings

Paste your key and validate

Choose your default model and save

What changes the moment you switch

Messages stop being metered on our side

Your model bill is the provider’s published rate

The full feature set unlocks

Included on every paid plan

Managed mode

BYOK mode

When BYOK makes sense

The operator running real volume

The team that wants frontier models without a markup

Standardizing on one AI provider company-wide

A/B testing models honestly

Key handling & trust

Switch to your own key in minutes

Decouple your AI cost from your support seats.