Forge

Forge is a lightweight LLM-based agent runtime built around a router + tool execution model. Instead of relying on a monolithic prompt or complex reasoning loops, Forge delegates actions to explicit tools selected by a structured LLM router.

Core Concept

Forge works in 3 steps:

User Input
   ↓
LLM Router (JSON output)
   ↓
Tool Dispatcher
   ├── chat
   └── code

The model does not “decide freely” — it must output a strict JSON instruction describing which tool to use.

Router Format

The LLM must return:

{
  "tool": "chat" | "code",
  "content": "string"
}

Example

Input:

write a python script to print hello world

Output:

{
  "tool": "code",
  "content": "print('Hello World')"
}

Available Tools

chat

Used for conversational responses.

chat(content: str) -> str

code

Used to generate executable Python code.

generate_code(content: str) -> str

Architecture

Forge v2.2 ("Clean Runtime") enforces a strict separation between the three layers: the LLM (router prompt + providers), tools (dispatch + handlers), and logs (the only place anything is printed). The orchestrator is the single point where they meet, and it is the only place a loop guard (MAX_STEPS) can ever apply.

src/forge/
│
├── orchestrator.py    # the only orchestrator (agent.py is a compat alias)
├── llm.py             # LLM dispatch — never called from anywhere else
├── config.py          # the only module that reads os.getenv()
├── logger.py           # the only module allowed to print/log
├── errors.py           # typed exception hierarchy (ForgeError, ...)
├── types.py             # RouterDecision / ToolResult / AgentResult dataclasses
│
├── router/
│   ├── prompt.py        # router prompt template — isolated, nothing else builds prompts
│   └── parser.py         # raw LLM text -> RouterDecision
│
├── tools/
│   ├── registry.py        # discovery; failures are logged, never swallowed
│   ├── chat.py
│   ├── code.py
│   └── files.py / git.py / shell.py   # roadmap stubs, no run() yet --
│                                       # and won't be reachable just
│                                       # for having one (ENABLED_TOOLS)
│
├── memory.py             # JSON-backed rolling history + key/value facts
│                          # (see Memory section below)
│
└── providers/
    ├── llama_cpp.py
    ├── ollama.py
    └── openrouter.py

Data flow:

user_input
   ↓
Orchestrator._route()      -> RouterDecision   (LLM layer)
   ↓
Orchestrator._dispatch()   -> ToolResult        (tools layer)
   ↓
AgentResult                                      (returned to caller)

Every step along the way emits a structured event through log.event() (visible only when SHOW_DEBUG=true), and is bounded by MAX_STEPS with cycle detection so the same (tool, content) pair can never be dispatched twice in a single run.

Usage

podman run -it --rm \
  --env-file .env.local \
  -v $(pwd)/data:/app/data \
  forge-core

Configuration

Environment variables:

Variable	Description	Default
FORGE_PROVIDER	LLM backend (ollama, llama_cpp, openrouter)	llama_cpp
LLM_MODEL	Model name	default
OLLAMA_URL	Ollama endpoint	http://127.0.0.1:11434/api/generate
LLAMA_CPP_URL	llama.cpp endpoint	http://127.0.0.1:8080
OPENROUTER_URL	OpenRouter endpoint	https://openrouter.ai/api/v1/chat/completions
OPENROUTER_API_KEY	OpenRouter API key	(empty)
MAX_STEPS	Hard ceiling on router→tool steps per run (loop guard)	1
MEMORY_ENABLED	Persist and recall conversation history across turns	true
MEMORY_FILE	Path to the JSON memory file (mount a volume here for persistence)	data/memory.json
MEMORY_MAX_HISTORY	Number of past messages kept and replayed to the router	20
ENABLED_TOOLS	Comma-separated allowlist of dispatchable tools. Having a run() function is not enough -- a tool must also be listed here	chat,code
SHOW_DEBUG	Emit structured debug trace (router prompt, raw output, tool dispatch, timings)	false

Memory

Forge keeps a rolling window of the last MEMORY_MAX_HISTORY messages in MEMORY_FILE (JSON, mounted via -v $(pwd)/data:/app/data) and replays it as context in the router prompt on every turn, so it can answer "do you remember..." correctly.

It's plain JSON, not a database. For a single-user, single-process local runtime, that's a feature, not a shortcut: no schema, no migrations, cat data/memory.json to debug it directly. The module also exposes a facts key/value store (add_fact / get_facts) for durable facts, not yet wired into the router prompt — that's the natural next step if/when a "remember this" tool is added. If Forge ever needs concurrent writers or queries beyond "last N messages", that's when SQLite earns its place — not before.

Design Philosophy

Forge is built on three principles:

Deterministic routing over free-form reasoning Explicit tool execution instead of implicit behavior Minimal agent core, maximal extensibility

The goal is to make LLM behavior predictable, debuggable, and composable.

Roadmap

Planned extensions:

memory tool (persistent context)
filesystem tool (read/write local files)
shell tool (sandboxed execution)
multi-agent routing layer
plugin-based tool registry

Status

Forge v2 is an experimental runtime, not a production framework. Expect frequent breaking changes.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
src/forge		src/forge
tests		tests
.gitignore		.gitignore
Containerfile		Containerfile
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Forge

Core Concept

Router Format

Available Tools

Architecture

Usage

Configuration

Memory

Design Philosophy

Roadmap

Status

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Forge

Core Concept

Router Format

Available Tools

Architecture

Usage

Configuration

Memory

Design Philosophy

Roadmap

Status

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages