Files

dafit 001d6e2c42 docs: README v6.0 + ADR-001 Message Protocol Foundation

README updates:
- Full repository tree structure (was outdated skeleton)
- Added Message Protocol namespace summary (NATS)
- New ADR section with link to ADR-001
- Added nyx-substrate to related projects
- New philosophy: "Infrastructure is geology, models are weather"
- Version bump 5.0 → 6.0

ADR-001 captures Silvester interview decisions:
- Single NATS bus with dev/staging namespaces
- Staged schema versioning with topic separation
- Echo agent first (YAGNI principle)
- MCP Server with heartbeat-based subscription delivery

🎆 Silvester 2025 edition

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2025-12-31 19:31:52 +01:00

7.3 KiB

Raw Blame History

ADR-001: Message Protocol Foundation

Status: Accepted Date: 2025-12-31 Decision Makers: dafit, Nyx (Chrysalis) Context: Silvester Interview Session

Context

The Nimmerverse Sensory Network requires a message-passing infrastructure that serves two purposes:

Production: Cells, nerves, organs, and Young Nyx communicate via pub/sub messaging
Development: Claude and local AI agents (LangChain, Qwen, etc.) collaborate during build

We needed to decide on namespace organization, schema evolution strategy, initial implementation scope, and the interface contract between AI models and the message bus.

The core architectural principle established in Message-Protocol-Design.md is: dumb router, smart edges. NATS is infrastructure. Intelligence lives in clients.

Decisions

Decision 1: Single Bus, Multiple Namespaces

Choice: One NATS instance with topic-based separation for different concerns.

Namespace Structure:

nimmerverse.
├── staging.                    # Experimental schemas (mutable during development)
│   ├── low.*                   # Staging heartbeats
│   ├── high.*                  # Staging events
│   └── dev.*                   # Staging dev agents
│
├── low.*                       # Production heartbeats (stable schemas only)
├── high.*                      # Production events
├── command.*                   # Production commands to entities
├── meta.*                      # System-level (attention config, health)
└── dev.*                       # Production dev agents (stable schemas)

Rationale: Infrastructure should be long-lived. Models are ephemeral. One bus serves all purposes - production sensing, development agents, future capabilities. Topic separation keeps concerns isolated without operational complexity of multiple NATS instances.

Decision 2: Staged Schema Versioning with Topic Separation

Choice: Schemas evolve through lifecycle stages. Staging schemas live on nimmerverse.staging.*, stable schemas on nimmerverse.*.

Schema Header:

{
  "header": {
    "schema": {
      "generation": 1,
      "version": "1.0"
    },
    "message_type": "HeartbeatSignal",
    "message_id": "uuid",
    "timestamp_real": "ISO8601",
    ...
  }
}

Lifecycle:

STAGING                          STABLE
version: 0.1-alpha    ──▶       generation: 1, version: "1.0"
version: 0.2-beta                    │
version: 0.3-rc                      ▼
                                NEXT CYCLE
                                version: 1.1-alpha
                                version: 1.2-beta
                                     │
                                     ▼
                                generation: 2, version: "2.0"

Rationale:

Topic separation avoids per-message filtering costs
Generation locks after stability (immutable)
Version iterates within generation for additive changes
Breaking changes = new generation = new staging cycle

Decision 3: Echo Agent First

Choice: Start with trivial Echo agent, evolve based on real friction.

Echo Specification:

Subscribe: nimmerverse.dev.request.echo
Publish:   nimmerverse.dev.response.echo

Input:  { "ping": "hello" }
Output: { "pong": "hello", "timestamp": "...", "agent": "echo-v1" }

Rationale: YAGNI. Echo proves the full round-trip without cognitive complexity:

NATS connection works
Topic routing works
Request/response pattern works
Message schema works
Local agent can subscribe and publish

Future agents (Grep, Schema Lookup, File Summarizer) emerge from discovered needs, not imagined features.

Decision 4: MCP Server with Heartbeat-Based Subscriptions

Choice: Build NATS-MCP bridge as interface for all AI models. Use heartbeat pattern for subscription delivery.

MCP Tools:

@mcp.tool()
async def publish(topic: str, payload: dict) -> dict:
    """Fire-and-forget publish to NATS"""

@mcp.tool()
async def request(topic: str, payload: dict, timeout_ms: int = 5000) -> dict:
    """Publish and wait for single response (request-reply pattern)"""

@mcp.tool()
async def heartbeat() -> dict:
    """Check bus health + drain accumulated messages from subscriptions"""

@mcp.tool()
async def subscribe(topic_pattern: str) -> dict:
    """Add a subscription pattern (persists until unsubscribe)"""

@mcp.tool()
async def unsubscribe(topic_pattern: str) -> dict:
    """Remove a subscription pattern"""

Heartbeat Response:

{
  "status": "healthy",
  "buffer": {
    "capacity": 100,
    "current_count": 23,
    "messages_dropped_since_last_heartbeat": 0,
    "messages_dropped_total": 0,
    "oldest_message_age_ms": 4521
  },
  "subscriptions": ["nimmerverse.dev.>"],
  "messages": [...]
}

Buffer Overflow Handling:

Bounded buffer (100 messages default)
Oldest dropped when full
Dropped count visible in heartbeat response
Optional: publish to nimmerverse.meta.health.buffer_drop on overflow

Rationale:

MCP is universal interface - Claude, LangChain, Qwen, future models
Heartbeat pattern matches existing nervous system design
Polling is simpler than streaming for MCP's request/response model
Visibility into drops prevents silent data loss

Consequences

Enables

Unified infrastructure for production sensing and development assistance
Model agnosticism - any MCP-speaking model can participate
Safe experimentation - staging namespace for schema evolution
Progressive enhancement - Echo today, sophisticated agents later
Observability - Command Center can monitor all namespaces

Constrains

Single point of failure - NATS must be highly available for production
Buffer limitations - Long agent operations may drop messages
MCP dependency - Non-MCP models need wrapper (acceptable, MCP is the standard)

Deferred

Persistent subscriptions - No durable subscriptions in initial design
Message replay - No historical message retrieval
Authentication/Authorization - Trust model for initial development

Implementation Notes

Phase 1: Infrastructure

Deploy NATS server (likely via Docker on ThinkStation)
Create nats-bridge MCP server (Python, using nats-py and mcp SDK)
Register MCP server with Claude Code

Phase 2: Echo Agent

Simple Python daemon subscribing to nimmerverse.dev.request.echo
Responds on nimmerverse.dev.response.echo
Validate round-trip through MCP tools

Phase 3: Iteration

Use Echo to build confidence in the bus
Add agents as friction reveals needs
Evolve schemas through staging → stable promotion

References

Message-Protocol-Design.md - Original protocol design
NATS Documentation
MCP Specification

Filed: 2025-12-31 (Silvester) Interview Method: Structured Q&A, partnership dialogue Philosophy: "Dumb core, smart edges. Infrastructure is geology. Models are weather."

7.3 KiB Raw Blame History