Knowledge Base — Train your AI bot without engineers

Jump to section

The knowledge problem nobody warns you about

A chatbot is only as useful as what it knows. A bot that hasn’t been taught your refund policy will guess, hedge, or invent one — and the customer ends up arguing with the human you wanted to protect. A bot that has never seen your product catalog will recommend products you don’t sell. A bot trained on the open internet will sound confident about your business while saying things that are not true.

The fix is to ground the bot in your own content. That step — getting your content into a form the bot can search through and quote from — is where most chatbot projects either stall for months or end up depending on engineering time you don’t have. NebulaHex collapses the whole pipeline into a single screen with an Add Source button. You upload, paste, or point. We do the rest.

Every shape your knowledge lives in

A real knowledge base is messy. Some content is in Google Docs that someone exported to PDF. Some lives on your help center. Some is in a deck the team uses for onboarding. Some is in a video your founder recorded last year. NebulaHex handles all of those shapes without asking you to flatten everything into one format first.

Pre-check

https://agilehelix.com/help

✓ CRAWLABLE47 pagesJS rendered

Advanced

Max pages25

DepthNormal

/blog/*/admin/*

Website crawl

URLs · sitemaps · JS-rendered

Paste a URL and NebulaHex walks the site. A pre-check tells you whether it’s crawlable, how many words the page has, and whether the content is JavaScript-rendered. For JS-heavy sites we offer a heavier-duty crawler that renders the page properly so you don’t end up importing empty shells.

→Three controls: max pages (5–100), depth (Shallow / Normal / Deep), exclude patterns
→Pre-check before the real crawl so you know what you’re getting
→Live progress view — pages get found and processed in real time
→Pick what to import from a checklist; thin pages are flagged Low content

PDF

Tables preserved

DOCX

Word styles kept

PPTX

Speaker notes

CSV

Columns as schema

File uploads

CSV · PDF · DOCX · PPTX

Drop in CSV, PDF, DOCX, or PPTX files and the bot reads them directly. No need to convert anything to plain text first. Drag-and-drop or file picker. Text gets extracted, structure preserved where it matters, content processed into searchable chunks in seconds to minutes.

→PDFs — policy docs, manuals, reports, scanned content
→CSVs — product catalogs, structured FAQs, member lists
→DOCX — upload directly from Word, no export step
→PPTX — onboarding decks, training material, sales collateral

✓ TRANSCRIPT EXTRACTED

Playlist · 8 of 12 processed

✓Onboarding walkthrough8:42

✓Feature deep dive: Templates14:21

○Customer Q&A session #322:08

YouTube

Single videos · playlists (up to 50)

Paste a video URL or a playlist URL. Transcripts get extracted and indexed alongside everything else. The founder explainer on your About page, the product walkthrough on your YouTube channel, the recorded webinar nobody is going to read the transcript of — all become chat-answerable.

→Single videos — demos, explainers, founder talks
→Playlists — up to 50 videos in one go (a whole webinar series)
→Bot quotes from spoken content the way it would from any other source
→No manual transcription. No video editing tooling. Paste the link.

Google SheetsSHIPPED

ShopifySHIPPED

REST APICOMING SOON

Live data

Google Sheets · Shopify · REST API

Different category. Static sources are uploaded once and answer from a snapshot. Live data sources are queried at the moment the customer asks. Order status from Shopify. Inventory from a Google Sheet. Records from a custom REST endpoint. The reply is grounded in the value that exists right now, not last Tuesday.

→Google Sheets — 4-step configuration with searchable / visible / verification fields
→Shopify — 11 store actions across orders, inventory, products, customers
→REST API connector — wire any HTTP endpoint to the bot for in-conversation lookups (roadmap). NebulaHex REST API for programmatic control is shipped today.
→Read-only by default · verification questions · rate-limited

Full Live Data page →

Name (optional)

Weekend orders FAQ

Content

Q: Do you ship on weekends?
A: Orders placed Friday after 3pm ship Monday morning, tracked via Royal Mail.

Q: Can I rush a weekend order?
A: Email orders@... before noon Saturday.

Plain text paste

Q&A · markdown · plain text

Sometimes the knowledge you need isn’t a file or a URL. It’s three paragraphs of “this is how we handle weekend orders,” or a quick Q&A you wrote in Notion that has not been published anywhere. Pick Paste Text, give it a name, drop the content in, save. The bot has it in seconds.

→Q&A format — question / answer pairs the bot can quote from cleanly
→Markdown — paste straight from your editor, structure preserved
→Plain text — anything you can type or paste
→Auto-named with a timestamp if you don’t want to name it yourself

Coming soon

Pages and databases from your Notion workspace

COMING SOON

Notion

Pages · databases · exports

Import directly from your Notion workspace once we land it. The picker entry is visible in the dashboard today with a Coming soon label — we’d rather show you what we’re working on than pretend it isn’t in development. The integration will sync pages and databases the same way other sources stay fresh.

→OAuth-based connection to your Notion workspace
→Pages and databases as separate source types
→Sync cadence comparable to the URL crawler

Coming soon

Folder sync with file-watch updates

COMING SOON

Dropbox

Folder sync · file watch

Sync entire folders. When a file changes in Dropbox, the bot’s knowledge updates. Useful when your team already manages source documents in Dropbox and you don’t want to re-upload every time something changes. Visible in the picker with a Coming soon label.

→Folder-level sync, not file-by-file
→Watch mode — changes in Dropbox propagate to the bot automatically
→Same encryption + role-based access as every other source

You can mix as many sources as you like inside one bot. Most growing businesses end up with a help center crawl, two or three policy PDFs, a product CSV, and a YouTube playlist of explainers — five to ten sources covering tens or hundreds of pages of content, all answerable through a single chat surface.

How your content becomes answers

You don’t need to understand the mechanics to use the product, but a sentence of plain-language detail helps. When you add a source, NebulaHex splits it into smaller passages — chunks — each sized to be a self-contained idea. The split is automatic and tuned so that an answer to a typical customer question lands inside one or two chunks rather than being scattered across a whole document.

Each chunk is then indexed in a way that lets the bot search the entire knowledge base by meaning, not just by exact keyword match. When a customer asks a question, the bot finds the chunks most relevant to what they’re actually asking — not what they happened to type — and uses those chunks to write its reply.

Example

Is shipping free if I spend over fifty quid?

Yes — complimentary delivery on orders above £50, anywhere in the UK.

The customer said “fifty quid.” The policy says “£50.” The policy uses “complimentary delivery.” The bot found it anyway, because meaning matches across phrasing.

Keeping knowledge fresh

A static knowledge base goes stale fast. You change your refund window. You launch a new product line. You update the shipping policy because a courier rate changed. The bot needs to know.

Every URL source has an Auto-sync control with four settings: Off, Daily, Weekly, and Monthly. Pick the cadence that matches how often the source actually changes. The crawler re-runs on schedule, picks up the new content, and updates the bot’s knowledge automatically. If you can’t wait — you just updated a page and want the change live now — the Sync Now button runs the crawl immediately.

help.agilehelix.com

89 chunks · Last synced 2 hours ago

Auto-sync schedule

Off

Daily

Weekly

Monthly

Next sync: tomorrow at 04:00 UTC

For files and pasted text, freshness works differently: replace the source. Delete the old one, upload the new one, the bot is updated. Most teams build a habit around this — the document that the bot reads is the same document the team edits, so updating the bot is one extra step on top of an edit you were going to make anyway.

Source management at a glance

The knowledge dashboard is built so a non-technical team member can run it confidently. Each source has a status pill — Pending, Processing, Ready, or Error — so you always know what state it’s in. A Processing source shows a spinner. A source that failed for any reason shows the error message inline so you can fix it without digging into logs.

Status at a glance

Each source shows Pending / Processing / Ready / Error. The dashboard header counts sources and total chunks indexed.

Rename inline

Default names take filenames, domains, or playlist titles. Edit any name with a click — useful when you have ten "Shipping policy" PDFs from different vintages.

Two-step delete

Deleting a source removes its chunks immediately and the bot stops drawing on it. Confirmation step prevents misclicks.

Page-level visibility

For website sources, expand the page list to see every page imported with its title, URL, and chunk count. Spot extraction problems before customers do.

Good chatbots are auditable. If a customer ever gets an answer that surprises you, scan the chunks for that source and you can see exactly what content produced it. The dashboard’s reader view shows the extracted content the way the bot sees it — the same passages, the same structure.

Live data, not just static knowledge

The knowledge base is one half of how the bot works. The other half is live data — the ability to look up things that change in real time, mid-conversation.

STATICKnowledge sources

How your business works

Policies, product docs, FAQs, recorded videos. The kind of content that doesn’t change every hour. Uploaded once (or auto-synced periodically), available to every conversation.

“What’s your refund policy?”

LIVEReal-time lookups

What’s happening right now

The bot looks up the actual data at the moment of the conversation. An order’s shipping status. A deal’s stage. A ticket’s current state. Powered by the integrations catalog.

“Where’s my order?”

Live data flows through the integrations catalog: look up real orders in Shopify, real tickets in your helpdesk, real records in your CRM. The Live Data picker in the dashboard also handles direct connections — Google Sheets, product catalogs, custom API endpoints — for the cases where you don’t need a full integration but you do need the bot to read something that updates regularly.

Static knowledge tells the bot how your business works. Live data tells the bot what’s happening in your business right now. Together, they cover the substantial majority of questions a customer-facing chatbot ever needs to answer.

Use cases your team will recognize

📚

The company with a real help center

You already have a help center. Customers don’t read it. They open the chat instead. Crawl the help center into your bot and every published article becomes a chat-answerable question. You stop being judged on what’s in the help center and start being judged on whether the answer is right — which it is, because it came from the article you wrote.

🧰

The team with sprawling product documentation

You sell software, hardware, or a complex service. You have a docs site, a couple of PDFs, and an internal Notion. Crawl the public docs. Upload the PDFs. Paste relevant Notion paragraphs as text. The bot answers product questions across all three without the customer needing to know which doc they should be looking in.

📌

The growing business with FAQs everywhere

FAQs in your terms, FAQs at the bottom of product pages, FAQs your founder pasted in Slack two years ago, FAQs your customer success team built up in their heads. NebulaHex is the consolidation layer. Paste them in. The bot becomes the single answer surface for the FAQ that was scattered across your business.

🎬

The team that creates video knowledge

You record demos. You run webinars. You have a YouTube channel where the founder answers customer questions on camera. None of it is searchable. Importing the videos turns spoken knowledge into chat-answerable knowledge. Customers ask the question. The bot answers from what was said in the video.

🔄

The seasonal business with rotating knowledge

You ship products in waves — a clothing brand with seasonal drops, an event company with a calendar that turns over every few months, a school with a yearly programme. Auto-sync handles the recurrence: the bot’s knowledge follows your site as the site changes. You do not manually re-import anything.

Why NebulaHex over building it yourself

You can build this in-house. Many teams have. We’ve talked to enough of them to know how it usually goes: it takes a quarter, it costs more than the build estimate suggested, the maintenance burden eats into the next quarter, and it ends up looking suspiciously like what we ship — except yours has more bugs and your engineers are tired.

NebulaHex sits at the intersection: zero engineering, knowledge-grounded answers, and source management any team member can run. You feed it anything. It learns everything. Auto-sync keeps the bot up to date as your content changes, the way it was meant to be from the start.

Trust & privacy

Your knowledge content is encrypted at rest and in transit. Access inside your team is role-based: Owner, Admin, Editor, and Viewer roles let you decide who in the workspace can edit knowledge versus who can only see it. Sensitive actions require confirmation.

We do not use your content to train AI models, and we do not share it across customers. When you delete a knowledge source, the chunks the bot was drawing on are removed and the bot stops referencing that content in subsequent conversations. The same applies when you delete a bot or close your account. See our privacy policy and terms.

Get your bot trained in an afternoon

The whole point of this product is that the gap between “we have content” and “we have a bot answering customers from that content” should be measured in minutes, not months. You don’t need to hire an engineer. You don’t need to learn what chunking is.

Upload. Paste. Point. Done.

The first source can be added inside the first ten minutes of signing up. The first time the bot answers something only your business could answer is the moment the rest of the work makes sense.

Build your bot for free Or see integrations →

Train an AI bot on your business