ContextLattice

Write-optimized control plane that agents inhabit as a runtime contract — durable handoff over amnesia

What's Peculiar

Orthogonal component: A durable orchestrator with staged dual-path retrieval, write-latency-optimized at the core. As of v3.4.0 (2026-06-05) it reframed itself as a runtime contract agents inhabit, not a memory service they call — a universal adapter lifecycle (bootstrap / context-pack / checkpoint / handoff / complete) that prevents agent amnesia and guarantees durable handoff. "Context freshness" is now secondary to that message.
Isolation boundary: None / Prompt-level — local-first, API-key auth, HTTP+MCP only. No container or micro-VM isolation.
Multi-tenancy: Single-node local-first by design — private-by-default, at the cost of cross-tenant isolation guarantees.
Validated usefulness: Active deployments with payment processing (Stripe/PayPal); v3.4.23 (2026-06-13), repo github.com/sheawinkler/ContextLattice alive and very active (pushed 2026-06-14, 113 stars, 7 forks, Go, not archived; 30 releases v3.3.x–v3.4.23 ~Feb–June 2026).

The Write-Path Problem

Most agent memory systems optimize the read path: Mem0 for knowledge graphs, Zep for temporal reasoning, Hindsight for multi-strategy retrieval. ContextLattice uniquely optimizes the write path.

Context freshness under load: In long-horizon agents, write latency cascades → queue congestion → stale retrieval indexes → poor context → unreliable decisions. ContextLattice explicitly models the efficacy chain:

Write speed ↑ → queue pressure ↓ → sink freshness ↑ → retrieval freshness ↑ → recall quality ↑ → task completion reliability ↑

Architecture: Staged Dual-Path Retrieval

Write Path

┌─────────────────────────────────────────────┐
│ Single Orchestrator Entry Point             │
│ - Durable raw write (Mongo ledger)         │
│ - Async fanout to specialized sinks         │
│ - Admission control + queue monitoring      │
└──────┬──────────────────────────────────────┘
       │
       ├─→ Fast Sinks: Topic rollups, Qdrant, pgvector
       ├─→ Deep Sinks: MindsDB, Letta, memory-bank
       └─→ Durable Outbox: Coalescing, retries, backpressure

Read Path (Two-Lane Retrieval)

Fast Lane (p50: 450ms-2s)
├─ Topic rollups (high-signal summaries)
├─ Qdrant (semantic vectors)
└─ PostgreSQL-pgvector (dense embeddings)
   ↓
Fusion + Learning Reranking
├─ Multi-source result merging
├─ Feedback-driven ranking improvement
└─ Code-context enrichment (symbol overlap, file-path proximity)
   ↓
Deep Continuation (async, non-blocking)
├─ MindsDB (ML/analytics queries)
├─ Mongo raw (full ledger search)
├─ Letta (agent memory integration)
└─ memory-bank (specialized retrieval)

Fail-open degradation: Returns fast-lane results even if deep lanes timeout. Retrieval never blocks on slow backends.

Agent Runtime Contract (v3.4.0)

v3.4.0 (2026-06-05) crosses CL from a callable memory service to an inhabitable agent runtime. The website headline shifted from write-optimization / freshness framing to amnesia prevention: "Stop giving your agents amnesia and calling it workflow."

Universal adapter: a single contextlattice_agent_adapter lifecycle — bootstrap, context-pack, checkpoint, handoff, event, complete — that any agent runtime implements once.
Native runtime sessions: first-class sessions via /v1/agents/sessions with /telemetry/agents/runtime, plus objective_runtime_state.v1 threaded across preflight / context-pack / Dream Mode.
Compiled context packets: prompt-ready compiled context packages plus a surfaced Skills Index (commits #272, #274, June 2026) — the shift from raw transcript replay to clean compiled packets for cleaner prompts and cross-agent handoff.
Async-continuation steering (v3.4.21, 2026-06-13): agents can steer the async slow-lane deep continuation rather than just consume it.

Technical Stack

Gateway: Go service (41 files, 212KB) — HTTP ingress, write admission, retrieval coordination
Core Engine: Rust crates — context_engine (memory graph primitives), context_retrieval (ranking/indexing), context_codec (serialization)
Vector Stores: Qdrant, Weaviate, LanceDB backends
Relational: PostgreSQL-pgvector, MongoDB, MindsDB
LLM Routing: Ollama (local models), Letta integration
Protocol: MCP-native (Model Context Protocol, 100 msgs/sec writes)

Comparison With Read-Optimized Systems

System	Primary Focus	Write Path	Read Path
ContextLattice	Context freshness + efficacy	Durable orchestrator + fanout	Staged multi-lane + learning rerank
Letta	Stateful agent memory blocks	Agent tools (editable)	In-context injection
Mem0	Structured knowledge graphs	Vector + entity extraction	Graph + vector search
Zep	Temporal knowledge graphs	Entity extraction	Temporal graph traversal
Hindsight	Multi-strategy retrieval	Standard RAG	4 parallel strategies (semantic, BM25, graph, temporal)

ContextLattice's niche: Long-horizon agent reliability where write latency matters. Less useful if you need temporal reasoning (Zep) or knowledge graphs (Mem0/Cognee).

Isolation & Multi-Tenancy

Isolation mechanism: None / Prompt-level on the spectrum. Local-first, API-key auth on protected endpoints, HTTP+MCP transport only. No container, micro-VM, or network-level isolation.

Multi-tenancy trade-offs: Single-node local-first by design — there is no cross-tenant firewall because there is no multi-node tenancy model. The boundary is the box it runs in.

✅ Private-by-default (local-first eliminates SaaS trust boundaries; data never leaves the node)
✅ Single control plane (operational simplicity, cost efficiency)
⚠️ The trade-off: private-by-default at the cost of cross-tenant isolation guarantees
⚠️ Shared resource pools (one Qdrant, one Mongo → noisy neighbor risk if co-tenanted)

Recommended architecture: Single-tenant on-premises for air-gapped deployments; any multi-tenant SaaS use must layer RBAC and process isolation on top.

Production Evidence

Active deployments: Payment processing features (Stripe webhooks, PayPal verification) indicate real paying customers.
Release velocity: Very high — 30 GitHub releases, v3.3.x through v3.4.23 in roughly Feb–June 2026; repo pushed 2026-06-14, not archived.
Community: 113 stars, 7 forks (small but focused), registered on Glama MCP server platform.
License: Business Source License 1.1 (non-open, commons clause).
Version boundary: v3.4.0 is the stable public agent-runtime-contract baseline; v4 remains a private tuning/experiment lane gated on benchmark/recall/soak.

Performance baseline concerns: Public perf-baseline.md shows degradation on complex workloads (multi-agent: p50=21s with 33% timeout rate). No external benchmarks comparing retrieval efficacy against competitors.

When to Use ContextLattice

Choose ContextLattice if:

Long-running agents where write latency compounds into stale context
High write throughput (100+ msgs/sec) with real-time retrieval needs
On-premises deployment (local-first, air-gapped environments)
MCP-native integration (Model Context Protocol ecosystem)

Choose alternatives if:

Need temporal reasoning → Zep
Need knowledge graph induction → Mem0, Cognee
Need stateful agent memory blocks → Letta
Need multi-strategy retrieval precision → Hindsight

Links

GitHub Repository · Official Website · Glama MCP Registry

← Agents Hub · Memory Systems