Jump to section
01
The chatbot pricing problem
Most platforms in this category sell “messages.” A message is whatever they say it is — sometimes an inbound visitor turn, sometimes a model call, sometimes a token bucket. As your volume grows the per-message price doesn’t fall as fast as the underlying token price you’d be paying yourself. You’re not buying compute. You’re buying access through a meter the platform controls.
This is fine when you’re tiny. A few hundred messages a month is a rounding error. The math gets uncomfortable somewhere around the point where your bot starts working — when your widget is converting, your channels are connected, your help center is fully ingested, and your conversation count starts to climb. Suddenly the meter is the dominant line in your support stack. You’re not paying for the platform anymore. You’re paying for somebody else’s margin on tokens you could have bought directly.
There’s a second thing nobody mentions: the meter rarely reflects what the underlying model actually costs. A long reasoning response on a frontier model consumes real tokens. A short factual reply from a small model is essentially free. Per-message pricing flattens that distinction and charges you the same per “message” either way, so the platform makes more on the cheap conversations and you subsidize the expensive ones for them. With BYOK, the cost you see is the cost you actually generated.
02
Two pricing shapes
NebulaHex offers two ways to pay for the AI side of your bots. Both ship the same product. The difference is where the AI compute bill comes from.
Predictable platform fee
Pay a flat monthly platform fee. Messages count against your plan’s monthly allowance. We pick the model. You get one bill from us covering platform and AI compute — everything in a single line.
Best for
- •Low or medium volume
- •Simple setup, hands-off model choice
- •Teams without a preferred AI provider
- •Predictable bill matters more than absolute cost
Platform fee + provider’s published rates
Same flat platform fee, plus your own Anthropic or OpenAI bill at the provider’s published per-million-token rates. We don’t meter messages on our side — the provider does, and you see their dashboard, not ours.
Best for
- •Real volume (thousands of conversations a month)
- •Teams who want to pick their own model
- •Existing AI provider contract or credit programme
- •Compliance / procurement consolidation
Neither shape is universally better. The default plans are simpler and the right call for most teams starting out. BYOK becomes the better economics once your volume crosses a threshold or your team has reasons to want direct provider billing — cost transparency, compliance, model choice, an existing provider relationship.
03
Two providers, six models
Across Anthropic and OpenAI, the BYOK lineup covers the speed-versus-quality range most teams care about. Your workspace picks one default model, and you can change it any time without losing conversation history, knowledge sources, or customizations — the only thing that changes is which model produces the next reply.
Claude Haiku
Fast · Lowest token cost
High-volume support bots where most questions are routine FAQ-style lookups and latency matters.
Speed
Quality
Claude Sonnet
Balanced · Speed + reasoning
Bots that follow longer policies, handle nuanced phrasing, or hold conversations beyond simple lookups.
Speed
Quality
Claude Opus
Frontier · Quality-first
Bots where being subtly wrong would cost you a customer. Reaches for complex multi-step reasoning.
Speed
Quality
GPT-4o Mini
Cost-optimized small model
High-volume bots that mostly need to read your knowledge base accurately and reply quickly.
Speed
Quality
GPT-4o
Workhorse · General-purpose
Fast, capable across a wide range of tasks. Comfortable middle ground between cost and quality.
Speed
Quality
GPT-4.1
Long context · Strong instructions
When your bot needs to handle long policy documents in context or follow tight formatting instructions.
Speed
Quality
Speed and Quality on the cards reflect general positioning within each provider’s lineup, not a head-to-head ranking. The right model is the one that fits your bots’ job. Most teams settle on Haiku or GPT-4o Mini for high-volume workspaces, Sonnet or GPT-4o for general purpose, and Opus or GPT-4.1 for workspaces where quality is the constraint.
04
How to switch your workspace to your own key
The mechanics are short and reversible. Three steps once you’re on any paid plan.
Open AI Configuration in Workspace Settings
Navigate to your workspace’s Settings → AI Configuration. The controls are available on every paid plan; no plan switch needed. Toggle BYOK on, and your bots start using your provider key for every reply.
Paste your key and validate
Open Workspace Settings → AI Configuration. Pick Anthropic or OpenAI. Paste your API key into the field. Click Validate Key — the platform makes a real, low-cost test call against your key to confirm it works and reports back which models you have access to. Validation happens before save, so a broken key never lands in production.
Choose your default model and save
Pick from the validated model list. Save. Every bot in your workspace immediately starts using your key for new replies. No warm-up period. No propagation delay. The next conversation a customer starts is on your account.
Your key is encrypted before it’s stored and never returned to the dashboard in plaintext after you save it. If you need to rotate it, paste a new key and re-validate — the old one is replaced atomically. If you want to leave BYOK, toggle it off in workspace settings. You stay on your current plan; the AI billing flips back to managed. There’s no migration step, no data movement, no waiting period.
05
What changes the moment you switch
From your customers’ side, the change is invisible — the widget looks the same, the inbox looks the same, the conversation history looks the same. But the underlying economics are now yours.
Messages stop being metered on our side
We don’t count them against a monthly cap. We don’t notify you at 80% or 95% of plan usage. The platform plan is a flat fee, and that fee is the entire bill from NebulaHex regardless of whether your workspace handles a thousand conversations or a hundred thousand.
Your model bill is the provider’s published rate
When any bot in your workspace replies, the API call goes from our backend to your AI provider, signed with your key. The provider charges you directly at their public per-million-token rate. You see exactly what you’re spending in the Anthropic Console or the OpenAI dashboard. Spending caps live in your provider account.
The full feature set unlocks
BYOK isn’t a billing change with strings attached. Every paid plan already ships with all five channels, every CRM and helpdesk integration, live chat handoff, the Google Sheets connector, custom CSS and webhooks, and priority support — flipping BYOK on doesn’t remove or unlock anything else. It only changes where the AI cost shows up.
Most BYOK customers running modest volume see a small monthly AI bill at the provider, paid directly to Anthropic or OpenAI. The platform fee from us stays flat regardless. Both providers let you set spending caps in your account, and we recommend turning that on the moment you connect a key.
06
Included on every paid plan
BYOK is a feature on every paid plan, not its own tier. On Growth, Pro, Business, or Enterprise, you can flip BYOK on at the workspace level and your bots immediately start signing requests with your provider key. Flip it off, and your bots go back to managed AI using NebulaHex’s models against your plan’s monthly message allowance.
Two billing modes, every paid plan
Managed mode
NebulaHex picks the model. Messages count against your plan’s monthly allowance. One bill from us covering platform and AI.
Optional
BYOK mode
You bring an Anthropic or OpenAI key. Your bots use that key for every reply. Your provider bills you at published rates. Platform fee unchanged.
Every paid plan ships with both options. Switch any time from workspace settings.
The difference between “managed” and “BYOK” isn’t your subscription — it’s where the AI cost shows up. Managed: in your NebulaHex bill. BYOK: in your Anthropic or OpenAI bill. The platform fee stays the same regardless.
If your team needs Claude Opus on every reply, BYOK lets you do that on a $99 plan instead of a $999 plan. If your message volume is predictable and the managed allowance covers you, leave BYOK off. Both choices are available on every paid plan.
07
When BYOK makes sense
The operator running real volume
You’re past the proof-of-concept phase. Your bots are live, handling thousands to tens of thousands of conversations a month. You’ve looked at the math on per-message pricing and decided you want the variable line item to flow through your AI provider account where you can see and control it — not through a platform meter you can’t audit. You flip on BYOK in workspace settings. Connect a key. The platform fee stays flat regardless of how busy your bots get.
The team that wants frontier models without a markup
You sell a high-touch product. Customers expect support to feel as considered as pre-sales. You want a frontier model handling your conversations — pricing questions, technical scoping, escalations — without the per-message platform premium on top of the frontier model’s underlying cost. With BYOK, the frontier-model premium is just the published Anthropic or OpenAI rate. Quality stops being a luxury and starts being a normal line item.
Standardizing on one AI provider company-wide
Your engineering team already picked an AI provider for everything else. You have a vendor relationship, spending controls, maybe a startup credit programme. You don’t want a second provider on a separate invoice for one tool. Connecting that same key to your workspace consolidates onto the relationship you already manage. Your CFO sees one provider line item across the whole stack.
A/B testing models honestly
Same workspace, same knowledge, two different models in sequence. Run a week on Sonnet. Switch to Opus for the next week. Compare your customer satisfaction scores, your handoff rates, and the cost difference in your provider dashboard. On per-message pricing, both arms look the same on your invoice regardless of what’s running underneath. With BYOK, both arms produce a real, attributable bill — so you can decide based on numbers instead of vibes.
08
Key handling & trust
Your API key is the credential that controls your spend with Anthropic or OpenAI. We treat it accordingly. When you save a key, it’s encrypted at rest before it touches our database. The key is never returned to the dashboard in plaintext after you save it — we show you a placeholder confirming one is on file, and you can replace it but you can’t read it back. The validation step on save uses the key for a single low-cost test request and discards the in-memory copy immediately.
When any bot in your workspace generates a reply, the key is decrypted in the request path, used to sign the outbound API call, and dropped — it isn’t logged, isn’t echoed in audit trails, and isn’t included in any export. If a key is ever compromised, you rotate it the same way you’d rotate any cloud credential: revoke the old one in your provider’s dashboard, issue a new one, and paste the new one into NebulaHex. The compromised key stops being able to bill you the moment your provider revokes it.
Workspace permissions matter here. Only owners and admins can change the workspace AI configuration. Editors can update knowledge and bot behavior but can’t see or change which key is in use. Viewers have read-only access. Your data is never used to train AI models on our side or your provider’s side — see our privacy policy and terms for the full posture.
09
Switch to your own key in minutes
If you’re already a NebulaHex customer, the path is short: open Workspace Settings → AI Configuration, pick your provider, paste your key, validate, choose your default model, save. The next reply any bot in your workspace sends is on your account.
Decouple your AI cost from your support seats.
Build bots on the free plan. Pick any paid plan when you’re ready — they all include BYOK. The version of your support stack where the AI bill stops being a black box is one workspace toggle away.