smallchat

Semantic tool dispatch. The agent knows what to do — smallchat figures out which tool does it.

Your agent has 50 tools. The LLM sees all 50 in its context window every single turn, burning tokens and degrading selection accuracy. You write routing logic, maintain tool registries, and pray the model picks the right one.

smallchat compiles your tools into a dispatch table. The LLM expresses intent. The runtime resolves it — semantically, deterministically, in microseconds. No prompt stuffing. No selection lottery.

Dispatch, not retrieval. smallchat is not a knowledge engine or a RAG layer — it doesn't compile documents or answer questions. It compiles which tool to call from a natural-language intent. The data substrate is your agent's tool registry (.toolkit.json), not enterprise documents.

npx @smallchat/core compile --source ~/.mcp.json

One command. Point it at your MCP config, a directory of manifests, or any MCP server repo. Out comes a compiled artifact with embedded vectors, dispatch tables, and resolution caching — ready to serve.

Where smallchat fits

An agent in production needs two things: what it knows and what it can do. These are solved at different layers.

Layer	Problem	Example
Knowledge layer	Pre-compile documents and data into governed, answer-shaped artifacts. Reduces tokens by moving retrieval upstream from inference time.	RAG pipelines, enterprise knowledge engines
Dispatch layer	Pre-compile which tool to call from a natural-language intent. Reduces tokens by moving tool-selection upstream from inference time.	smallchat

smallchat runs in the agent process — no SaaS dependency, no external round-trip. Your tool registry lives with your agent.

An agent backed by any knowledge engine still has to decide whether to run a query, post to Slack, or write a file. That decision is exactly what smallchat handles. The two layers compose cleanly:

Use your knowledge engine for what the agent knows. Use smallchat for what the agent does.

Quick Start

Get up and running in under a minute:

# Install smallchat
npm install -g @smallchat/core

# Run the interactive setup wizard
smallchat setup

The setup wizard will:

Discover your existing MCP server configurations (Claude Code, Gemini CLI, OpenCode, Codex, or any .mcp.json)
Compile them into an optimized smallchat toolkit with embedded vectors and dispatch tables
Optionally replace your mcpServers config so all tools are served through smallchat

That's it — your agent now dispatches tools semantically instead of stuffing them all into the context window.

Prefer non-interactive mode? Run smallchat setup --no-interactive to auto-detect and compile without prompts.

Install

npm install @smallchat/core

Requires Node.js >= 20.

See It Work

# Compile tools from your MCP servers
npx @smallchat/core compile --source ~/.mcp.json

# Ask it a question — see which tool it picks and why
npx @smallchat/core resolve tools.toolkit.json "search for code"

# Start an MCP-compatible server
npx @smallchat/core serve --source ./manifests --port 3001

# Scaffold a new project
npx @smallchat/core init my-app --template agent

# Interactive REPL
npx @smallchat/core repl tools.toolkit.json

Use It in Code

import { ToolRuntime, MemoryVectorIndex, LocalEmbedder } from '@smallchat/core';

const runtime = new ToolRuntime(
  new MemoryVectorIndex(),
  new LocalEmbedder(),
);

const result = await runtime.dispatch('find flights', { to: 'NYC' });

// Fluent API with TypeScript inference
const content = await runtime
  .intent<{ to: string }>('find flights')
  .withArgs({ to: 'NYC' })
  .execContent<FlightResult>();

// Or stream token-by-token
for await (const token of runtime.inferenceStream('find flights', { to: 'NYC' })) {
  process.stdout.write(token);
}

What's New in 0.5.0

LoomMCP integration guide — Compile LoomMCP's 17 MCP tools through smallchat for semantic dispatch on top of exact-symbol retrieval. See the LoomMCP integration page.
Synchronized package versions — Every workspace package is now aligned at 0.5.0.
Refreshed runtime version metadata — MCP server, channel server, MCP client, REPL banner, and compiled artifacts now report 0.5.0.

Plus everything from 0.4.0:

Confidence-tiered dispatch — Every dispatch returns EXACT/HIGH/MEDIUM/LOW/NONE and branches accordingly
Resolution proof — Serializable trace documenting why a tool was chosen
Pre-flight verification — respondsToSelector: gate between resolution and execution
Intent decomposition — doesNotUnderstand: handler breaks complex intents into sub-intents
Refinement protocol — forwardInvocation: dialogue for NONE-confidence dispatches
Observation & adaptation — KVO-inspired observer adapts thresholds in real time

See the full Changelog for details.

How It Works

smallchat borrows its architecture from the Smalltalk/Objective-C runtime. Tools are objects. Intents are messages. Dispatch is semantic.

The LLM says what it wants. The runtime figures out which tool handles it — using vector similarity, resolution caching, superclass traversal, and fallback chains. No routing code. No tool selection prompts.

See the Architecture doc for the full design and the Reference for runtime details, dispatch mechanics, and the concept mapping from Smalltalk/Obj-C to smallchat.

CLI

Command	Description
`compile`	Compile manifests into a dispatch artifact
`serve`	Start an MCP-compatible server
`resolve`	Test intent-to-tool resolution
`inspect`	Examine a compiled artifact
`doctor`	Check your environment
`init`	Scaffold a new project from a template
`docs`	Generate Markdown docs from a compiled artifact
`repl`	Interactive shell for testing resolution

Packages

Package	Description
`@smallchat/core`	Core runtime, compiler, MCP server, CLI
`@smallchat/react`	React hooks: `useToolDispatch`, `useToolStream`, `SmallchatProvider`
`@smallchat/nextjs`	Next.js App Router helpers
`@smallchat/testing`	`MockEmbedder`, `MockVectorIndex`, assertion helpers
`smallchat-vscode`	VS Code syntax highlighting, manifest schema validation, snippets
`@smallchat/playground`	Browser-based resolution chain visualizer

Documentation

Doc	What's inside
Quickstart	Zero to dispatching in 5 minutes
Architecture	Full design document
Reference	Runtime, dispatch, streaming, MCP server, CLI details
Concept Mapping	Smalltalk/Obj-C → smallchat translation table
Migration Guide	Upgrading from 0.1.0 to 0.2.0
LoomMCP integration	Pair smallchat with LoomMCP for semantic dispatch over symbol-level retrieval
Changelog	Release history

Development

npm test          # 274+ specs
npm run dev       # Watch mode
npm run lint      # Type check
npm run docs:api  # Generate API reference

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
.claude		.claude
Sources		Sources
bench		bench
docs		docs
examples		examples
models		models
packages		packages
scripts		scripts
shorthand		shorthand
src		src
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
MIGRATION.md		MIGRATION.md
PLAN-0.4.0.md		PLAN-0.4.0.md
Package.swift		Package.swift
QUICKSTART.md		QUICKSTART.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
plan.md		plan.md
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

smallchat

Where smallchat fits

Quick Start

Install

See It Work

Use It in Code

What's New in 0.5.0

How It Works

CLI

Packages

Documentation

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

smallchat

Where smallchat fits

Quick Start

Install

See It Work

Use It in Code

What's New in 0.5.0

How It Works

CLI

Packages

Documentation

Development

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages