Intellect (AI)

Upgrade your chatbox to a live AI assistant running on your own infrastructure. Local models only — no token costs, no data leaving your stack.

Do You Need Intellect?

Intellect runs AI on infrastructure you pay for. Live mode is always-on; Batch mode is on-demand Spot. Both are fixed costs — you know what you'll pay before you deploy.

Add Intellect if:

  • Visitors are sending messages your team can't keep up with
  • Visitors ask specific, answerable questions about your products, menu, or services
  • You want AI to process content in the background (summarize, translate, extract)

Don't add Intellect if:

  • The chatbox is sitting idle most of the time
  • Your questions are simple enough for an FAQ page or accordion
  • You only want batch processing — Batch mode is significantly cheaper than Live

Start with Core's static chatbox. Add Intellect when you have real volume that justifies the infrastructure.

Ma's Bakery: Real Usage

Ma's Bakery started with a simple contact form — visitors filled it out, Ma responded via email within a day. It worked fine until the bakery went viral on a local social media post.

Overnight, Ma was getting 80–120 chatbox messages a day. Half of them were the same questions:

  • "What's fresh today?"
  • "Do you have anything gluten-free?"
  • "Can I order a custom cake?"
  • "What are your hours on Sunday?"

Ma added Intellect. She wrote a system prompt that introduced the AI as her bakery assistant and gave it access to the week's menu and her FAQ. The AI now handles the first 90% of conversations — the routine ones. When a conversation needs Ma, the AI escalates to her email.

Pa's Donuts hasn't added Intellect — their chatbox gets maybe 3–5 messages a week.

Cousin's Coffeehouse uses Intellect in Batch mode only — they run a nightly job that reads that week's Coffee Notes blog post and generates a newsletter blurb. No always-on instance.

Chatbox Upgrade

Core ships a ChatboxWidget that works as a contact form out of the box. When Intellect is provisioned and OLLAMA_HOST is configured, the chatbox automatically upgrades to a live AI assistant.

Without Intellect: visitor messages are submitted as a contact form.
With Intellect Live: visitor messages get real-time AI responses, streamed from your Ollama instance.

No code changes required — the upgrade is automatic when the env var is set.

Context-Aware AI

The server-side chatbox route can inject context from your own data before sending a message to the model.

Ma's Bakery example: The chatbox route reads Ma's current menu and injects it into the system prompt before every conversation. When someone asks "what's fresh today?", the AI reads the actual menu — not its training data.

If Commerce is provisioned, the chatbox can check your Products table and respond with live inventory:

"We have 6 of those almond croissants left today — would you like to start an order?"

The AI only accesses data you explicitly provide to it. You control what it can see and what it's allowed to say.

Batch Processing

The BatchRunner handles background AI workloads that don't need real-time responses:

  • Nightly content summarization (Cousin's Coffeehouse blog → newsletter blurb)
  • Bulk entity extraction from uploaded documents
  • Page translation into other languages
  • Structured data generation from unstructured input
  • Weekly reporting from Ocean pipeline data

Batch uses Spot EC2 instances — spun up on demand, terminated when done. Significantly cheaper than an always-on Live instance.

Mode & Tier

Intellect runs in one of two modes. The mode determines the cost model and what the instance can do.

Live provisions an always-on EC2 instance at your chosen tier. It handles real-time chatbox traffic continuously, and any batch jobs you queue run on that same instance. No additional infrastructure is needed for background processing if you already have Live running.

Batch provisions Spot EC2 on demand. An SQS trigger spins up an instance, runs the job, and terminates it. This is background-only — no real-time chatbox. The right choice when you want AI capability without the cost of an always-on instance.

If you have Live, you do not need Batch. The instance is already running.

Tiers:

Tier Specs Best for
Lean CPU-only or small GPU (Gemma 2B) Batch-only deployments, dev/eval
Small Entry GPU, 7B-class models Single-user chatbox or light batch
Medium Mid GPU, 13B-class models Moderate chatbox traffic
Large Performance GPU, 30B-class models High-traffic chatbox, multi-user
Max High-end GPU, 70B+ models Enterprise scale

Your tier choice drives your solvency cap directly. A larger instance means more infrastructure cost, which means more ad impressions running your publisher ID before SirSluginston Co earns anything above that line.

Local Models Only

Intellect ships only OllamaProvider. There are no wrappers for OpenAI, Anthropic, Bedrock, or any other cloud AI provider.

The InferenceProvider interface is open (SLIME source-available) — if you want to wire in a cloud AI provider yourself, the interface is there. That integration, and any token costs, are yours to manage.

Why? Cloud AI costs are unpredictable, don't belong in the solvency cap formula, and vary per call. Intellect only works cleanly with infrastructure you own and pay for at a fixed rate.

How to Add Intellect

Intellect is selected during adapter init:

npx @sirsluginston/aws-adapter init
# ...
? AI / Intellect?
  ❯ Live (always-on EC2 — handles chatbox and batch)
    Batch (Spot EC2 — background jobs only, no chatbox)
    No

Then select a tier (Lean → Small → Medium → Large → Max).

Once provisioned, set your model:

OLLAMA_HOST=http://your-ec2-private-ip:11434
OLLAMA_MODEL=mistral

The chatbox upgrades automatically when OLLAMA_HOST is set. No code changes required.

Cost & Solvency Cap

Intellect costs depend on mode and tier. Everything runs through your solvency cap — your ad publisher ID covers the full infrastructure bill before SirSluginston Co earns anything above it. A larger instance means a higher cap, which means more impressions running your publisher ID. The AI earns its keep.

Mode Tier Approximate monthly
Batch (Spot) Lean ~$5–20
Batch (Spot) Small ~$20–60
Live Lean ~$30–80
Live Small ~$100–200
Live Medium ~$200–400
Live Large ~$400–800
Live Max $800+

Batch rows apply to deployments without a Live instance. If you have Live, batch jobs run on that instance — no separate cost.

Actual numbers vary by AWS region and current EC2 pricing. The Cost Explorer Lambda updates your solvency cap within roughly 24 hours of any instance change.

Works With

Intellect + Ocean — The chatbox can query Lighthouse snapshots from your Ocean pipeline to give live, accurate answers. Ma's AI knows today's prices because it reads from Ocean's Lighthouse cache — not from training data.

Intellect + Commerce — The chatbox can check your Products table to answer inventory questions and guide visitors toward checkout. Requires both Intellect and Commerce.

Intellect + Core (Batch only) — Batch mode can process your Core page content — summarizing new blog posts, generating social copy, translating Rich Text blocks. Requires only Core + Batch Intellect.

Loading Advertisement...