AI SOC Alert

AI SOC Alert is a multi-agent incident investigation platform for security alerts. It combines deterministic detection logic, governed LLM reasoning, enrichment, response playbook generation, human approval gates, and MCP tool exposure behind a FastAPI service.

The project is structured as an AI engineering system rather than a prompt-only prototype. The core design separates orchestration, model access, guardrails, governance, persistence, API delivery, MCP delivery, and evaluation so each layer can be tested and replaced independently.

Architecture

Security alert
    |
    v
FastAPI / MCP tool interface
    |
    v
Investigation service
    |
    v
Supervisor agent
    |
    +--> Input guardrails
    +--> Deterministic rules engine
    +--> LLM triage agent
    +--> Enrichment agent
    +--> Response agent
    +--> Output guardrails
    +--> HITL approval workflow
    |
    v
Investigation report, trace, recommendations, metrics

Core Capabilities

Multi-agent alert investigation coordinated by a supervisor agent.
Deterministic rules for high-confidence security patterns before LLM escalation.
LLM triage with MITRE ATT&CK mapping, confidence scoring, and structured JSON outputs.
Threat enrichment abstraction for IPs, domains, and hashes.
Response playbook generation with action risk classification.
Human-in-the-loop approval gates for high-risk or destructive actions.
Input guardrails for prompt injection and oversized payloads.
Output guardrails for hallucinated indicators, severity downgrades, invalid confidence, and incomplete actions.
Token and call budget enforcement per investigation.
Structured JSON logging with trace IDs, latency, token usage, and decision source.
FastAPI endpoints for application integration.
MCP server exposing investigation tools to AI clients such as VS Code, Claude Desktop, and MCP Inspector.
Evaluation harness with golden alerts for regression testing investigation quality.

Repository Layout

backend/app/api/              FastAPI routes
backend/app/agents/           Triage, enrichment, response, and supervisor agents
backend/app/core/             Configuration, LLM client, logging, deterministic rules
backend/app/evals/            Golden-alert evaluation harness
backend/app/governance/       Budgets, permissions, audit log, human approvals
backend/app/guardrails/       Input and output safety controls
backend/app/integrations/     MCP server and connector abstractions
backend/app/investigations/   Shared investigation service and store
backend/tests/                API, governance, guardrail, observability, and MCP tests

Runtime Interfaces

FastAPI

The HTTP API is intended for service-to-service or dashboard integration.

uv run uvicorn main:app --reload

Primary endpoints:

POST /alerts/investigate
GET  /investigations/{investigation_id}
GET  /approvals/pending
POST /approvals/{request_id}/approve
POST /approvals/{request_id}/reject
GET  /health

MCP Server

The MCP server exposes the investigation system as tools for AI hosts.

uv run python -m backend.app.integrations.mcp_server

Available MCP tools:

investigate_alert
lookup_threat_intel
get_investigation

Example VS Code workspace configuration:

{
  "servers": {
    "soc-investigator": {
      "command": "uv",
      "args": [
        "--directory",
        "/absolute/path/to/ai-soc-investigator",
        "run",
        "python",
        "-m",
        "backend.app.integrations.mcp_server"
      ]
    }
  }
}

This repository includes .vscode/mcp.json with the same workspace server definition. Use MCP: List Servers from the VS Code command palette and start soc-investigator.

Investigation Flow

The API or MCP layer receives an alert and validates the request.
The investigation service builds a domain Alert and delegates to the supervisor.
Input guardrails block prompt injection and malformed oversized alert text before model access.
The rules engine handles known patterns such as credential dumping tools, scheduled scanner activity, and certificate lifecycle events.
Ambiguous alerts are escalated to the triage agent for severity classification, MITRE mapping, confidence scoring, and rationale generation.
The enrichment agent checks relevant indicators and normalizes results into the report.
The response agent generates a playbook with action risk levels and approval requirements.
Governance controls apply token budgets, least-privilege policy, and human approval gates.
Output guardrails validate the report before returning it to the caller.
The final report includes severity, confidence, evidence, enrichment, recommended actions, trace entries, token usage, and latency.

Configuration

Configuration is environment-driven.

cp .env.example .env

Important settings:

OPENAI_API_KEY
LLM_MODEL
LLM_TEMPERATURE
MAX_TOKENS_PER_INVESTIGATION
MAX_LLM_CALLS_PER_INVESTIGATION
MAX_TOOL_CALLS_PER_INVESTIGATION
LOG_LEVEL
DB_PATH

When OPENAI_API_KEY=demo-key, the system uses deterministic local fixtures for repeatable development and CI behavior. Set a real key to exercise live LLM calls.

Testing

Run the full test suite:

uv run pytest

Run the evaluation harness:

uv run python -m backend.app.evals.evaluate

Run MCP-focused tests:

uv run pytest backend/tests/test_mcp_server.py

Docker

Build and run the API service:

docker compose up --build

The API is exposed on port 8000.

Engineering Notes

The current store is process-local and intentionally isolated behind backend/app/investigations/store.py. Replacing it with SQLite, Postgres, or a queue-backed workflow store does not require changing the API or MCP tool layer.

The threat-intel implementation uses an injectable provider interface. Production deployment should wire this to VirusTotal, AbuseIPDB, MISP, a commercial TIP, or an internal intelligence service.

The system is designed around bounded autonomy: model calls are budgeted, tool outputs are validated, high-risk actions require approval, and deterministic rules are preferred when confidence is high.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
.vscode		.vscode
backend		backend
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI SOC Alert

Architecture

Core Capabilities

Repository Layout

Runtime Interfaces

FastAPI

MCP Server

Investigation Flow

Configuration

Testing

Docker

Engineering Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI SOC Alert

Architecture

Core Capabilities

Repository Layout

Runtime Interfaces

FastAPI

MCP Server

Investigation Flow

Configuration

Testing

Docker

Engineering Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages