Enterprise AI Memory

Air-Gapped Agent Memory

Strict 420MB offline sidecar. Drops noise, extracts facts. Zero API calls, no context rot.

HIPAA SOC2 Air-Gapped On-Premise
Request Architecture Review
Mnemostroma System Architecture v2 — Thalamus orchestrator Mnemostroma SYSTEM ARCHITECTURE v2 · REV 2026.04 L1 · INPUT USER human operator dialogue AGENT LLM · MCP host raw I/O L2 · OBSERVER OBSERVER · ASYNC SIDECAR 4 stages Regex 0.1ms · PII strip HybridNER 10ms · entities e5-small 15ms · embed TinyBERT rerank · 6ms 0ms hotpath fire & forget L3 · MEMORY CORE RAM INDEX 420-600MB SessionBrieffp16·512d Score fn.R·T·I Retrieval20ms p95 Capacity200 sessions EvictionLRU·score PRIMARY · HOT SQLITE·WAL ledger Anchorsdecision/deadline Precisionverbatim Write modeasync·append Durabilityfsync·WAL2 Accesscold·rehydrate SECONDARY · COLD L4 · ORCHESTRATOR THALAMUS orchestrator · persistence Queue Batch Repos WAL observer enqueue ram → persist async flush invariants L5 · DECAY DREAMER idle 5min 5-layer dissolution FULL → GIST → SKELETON → BEDROCK → TRACE emotion-weighted · anchor-biased decay events INVARIANT BOUNDARY L6 · MCP SURFACE read-only MCP TOOLS JSON-RPC·stdio ctx.semantic 20ms ctx.anchors 0.1ms ctx.bridge graph ctx.precision verbatim inject XML context CONDUCTOR PROXY context injection pure_context mode stdio·sidecar system prompt inject FOOTPRINT 420MB RAM LATENCY · P95 20ms retrieval RUNTIME ONNX INT8 · no GPU FIG · 02 THALAMUS HUB · DATA FLOW

Vector DB Trap vs Active Forgetting

Logs pile up → drift + OOM. Mnemostroma: Background process extracts facts to SQLite, noise decays.

Problem

Context Rot

Every session adds noise. Vectors drift. Costs explode.

Solution

Active Dissolution

Signals decay biologically. Core facts persist. Pure efficiency.

System Mechanics

Three layers. One sidecar. Zero agent changes.

1

Passive Observer Sidecar

Zero agent changes. 80% token savings via selective extraction.

2

Biological Decay

Old noise fades. Semantic core persists forever.

3

Pure Context Mode

Agent autonomy. XML facts only, no instructions.

System Constraints

420MB
Baseline
20ms
p95 Retrieval
3x
ONNX INT8
SQLite
WAL

Architectural Review

30min technical call. Quantify your token waste and constraints fit.

Schedule Review