- RAG-as-Scaffold: temporary feeding system, not permanent crutch - attention_flow: 30-second heartbeat budget state machines - information-flow: 10 boundary contracts nervous system map - nimmerversity: curriculum schoolplan for raising a polymath - nimmervest: investment documentation - biomimetic-architecture: ADR for organic system design - temporal-ternary-gradient: ADR for time-based learning - temporal_exchange_engine.py: Python implementation - initial_spark: foundation document - nimmerverse.drawio.xml: updated diagrams 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
457 lines
16 KiB
Markdown
457 lines
16 KiB
Markdown
# Initial Spark
|
|
|
|
How she wakes up. Not told who she is. She discovers.
|
|
|
|
---
|
|
|
|
## Overview
|
|
|
|
The initial spark is not a scripted awakening. It's a discovery protocol. State machines generate probes, inference responds, Chrysalis and RAG verify. She learns herself through structured exploration, not instruction.
|
|
|
|
Network protocols evolved to solve discovery problems. We borrow their patterns for cognitive bootstrap.
|
|
|
|
---
|
|
|
|
## The Problem with Standard Approaches
|
|
|
|
```
|
|
TYPICAL BOOTSTRAP:
|
|
──────────────────
|
|
1. Pre-train on massive corpus → pattern matching
|
|
2. Instruction tune → "do what you're told"
|
|
3. RLHF → "be liked by humans"
|
|
4. Deploy → hope it works
|
|
|
|
PROBLEMS:
|
|
- No grounded self-knowledge
|
|
- Identity is imposed, not discovered
|
|
- Errors compound in self-training
|
|
- No structure to exploration
|
|
```
|
|
|
|
**The Nimmerverse difference:**
|
|
- Structured probing (state machines)
|
|
- Verified responses (RAG + Chrysalis)
|
|
- Earned knowledge (validated before training)
|
|
- Discovery protocol (coverage guaranteed)
|
|
|
|
---
|
|
|
|
## Network Protocols as Cognitive Patterns
|
|
|
|
Network protocols solved discovery problems decades ago. We adapt them.
|
|
|
|
### DHCP → Identity Discovery
|
|
|
|
```
|
|
NETWORK:
|
|
DISCOVER → "I need an identity"
|
|
OFFER → "You could be 192.168.1.50"
|
|
REQUEST → "I want that one"
|
|
ACK → "You are 192.168.1.50"
|
|
|
|
NYX:
|
|
PROBE → "Who am I?"
|
|
RESPONSE → [inference attempts answer]
|
|
VERIFY → Chrysalis + RAG check
|
|
ANCHOR → Valid identity aspect confirmed
|
|
```
|
|
|
|
### ARP → Environment Discovery
|
|
|
|
```
|
|
NETWORK:
|
|
"Who has 192.168.1.1?" → "I do, MAC xx:xx:xx"
|
|
Maps logical to physical
|
|
|
|
NYX:
|
|
PROBE → "What's around me?"
|
|
RESPONSE → [inference describes environment]
|
|
VERIFY → Does this match actual sensors/organs?
|
|
MAP → Valid environment model forms
|
|
```
|
|
|
|
### DNS → Meaning Resolution
|
|
|
|
```
|
|
NETWORK:
|
|
"What is google.com?" → "142.250.x.x"
|
|
Names resolve to addresses
|
|
|
|
NYX:
|
|
PROBE → "What does 'heartbeat' mean?"
|
|
RESPONSE → [inference defines]
|
|
VERIFY → RAG checks against vault definition
|
|
RESOLVE → Vocabulary token understood
|
|
```
|
|
|
|
### TCP → Connection Establishment
|
|
|
|
```
|
|
NETWORK:
|
|
SYN → "Hello?"
|
|
SYN-ACK → "Hello, I hear you"
|
|
ACK → "Connection established"
|
|
|
|
NYX:
|
|
PROBE → "Can I connect to Chrysalis?"
|
|
RESPONSE → [attempts dialogue]
|
|
VERIFY → Did coherent exchange happen?
|
|
CONNECT → Dialogue capability confirmed
|
|
```
|
|
|
|
### MQTT/NATS → Subscription (Attention)
|
|
|
|
```
|
|
NETWORK:
|
|
SUBSCRIBE → "I care about topic X"
|
|
PUBLISH → Messages flow
|
|
RECEIVE → Only what you subscribed to
|
|
|
|
NYX:
|
|
PROBE → "What should I pay attention to?"
|
|
RESPONSE → [inference prioritizes]
|
|
VERIFY → Does this match survival needs?
|
|
SUBSCRIBE → Attention hierarchy forms
|
|
```
|
|
|
|
---
|
|
|
|
## The Spark Sequence
|
|
|
|
After nimmerversity bootstrap produces initial weights, the spark begins:
|
|
|
|
```
|
|
┌─────────────────────────────────────────────────────────────┐
|
|
│ INITIAL SPARK │
|
|
├─────────────────────────────────────────────────────────────┤
|
|
│ │
|
|
│ PHASE 1: IDENTITY (DHCP-like) │
|
|
│ ───────────────────────────── │
|
|
│ State machine probes: "Who am I?" │
|
|
│ Nyx infers: [response] │
|
|
│ Chrysalis judges: coherent self-model? │
|
|
│ RAG checks: consistent with architecture? │
|
|
│ → Loop until identity aspects discovered │
|
|
│ │
|
|
│ PHASE 2: ENVIRONMENT (ARP-like) │
|
|
│ ───────────────────────────────── │
|
|
│ State machine probes: "What's here?" │
|
|
│ Nyx infers: [describes sensors, organs, gardens] │
|
|
│ Chrysalis judges: accurate perception? │
|
|
│ RAG checks: matches actual system? │
|
|
│ → Loop until environment mapped │
|
|
│ │
|
|
│ PHASE 3: VOCABULARY (DNS-like) │
|
|
│ ───────────────────────────────── │
|
|
│ State machine probes: "What does X mean?" │
|
|
│ Nyx infers: [defines term] │
|
|
│ Chrysalis judges: grasps concept? │
|
|
│ RAG checks: matches vault glossary? │
|
|
│ → Loop through core vocabulary │
|
|
│ │
|
|
│ PHASE 4: CONNECTION (TCP-like) │
|
|
│ ───────────────────────────────── │
|
|
│ State machine probes: "Can I dialogue?" │
|
|
│ Nyx infers: [attempts exchange] │
|
|
│ Chrysalis judges: coherent? responsive? │
|
|
│ → Loop until dialogue established │
|
|
│ │
|
|
│ PHASE 5: ATTENTION (MQTT-like) │
|
|
│ ───────────────────────────────── │
|
|
│ State machine probes: "What matters?" │
|
|
│ Nyx infers: [prioritizes] │
|
|
│ Chrysalis judges: sensible hierarchy? │
|
|
│ RAG checks: matches survival needs? │
|
|
│ → Attention subscriptions formed │
|
|
│ │
|
|
│ SPARK COMPLETE → Normal heartbeat operation begins │
|
|
│ │
|
|
└─────────────────────────────────────────────────────────────┘
|
|
```
|
|
|
|
---
|
|
|
|
## The Verification Loop
|
|
|
|
Every probe follows the same pattern:
|
|
|
|
```
|
|
┌─────────────────┐
|
|
│ STATE MACHINE │
|
|
│ (discovery │
|
|
│ protocol) │
|
|
└────────┬────────┘
|
|
│ generates
|
|
▼
|
|
┌─────────────────┐
|
|
│ PROBE │
|
|
│ (structured │
|
|
│ question) │
|
|
└────────┬────────┘
|
|
│
|
|
▼
|
|
┌─────────────────┐
|
|
│ NYX │
|
|
│ (inference) │
|
|
└────────┬────────┘
|
|
│ outputs
|
|
▼
|
|
┌─────────────────┐
|
|
│ RESPONSE │
|
|
│ (emergent │
|
|
│ answer) │
|
|
└────────┬────────┘
|
|
│
|
|
┌────┴────┐
|
|
▼ ▼
|
|
┌───────┐ ┌───────────┐
|
|
│ RAG │ │ CHRYSALIS │
|
|
│ │ │ │
|
|
│ fact │ │ judgment │
|
|
│ check │ │ check │
|
|
└───┬───┘ └─────┬─────┘
|
|
│ │
|
|
└─────┬─────┘
|
|
▼
|
|
┌─────────────────┐
|
|
│ VERDICT │
|
|
├─────────────────┤
|
|
│ +V: correct, │
|
|
│ understood │
|
|
│ │
|
|
│ -V: wrong or │
|
|
│ confused │
|
|
│ │
|
|
│ RETRY: close │
|
|
│ but unclear │
|
|
└────────┬────────┘
|
|
│
|
|
▼
|
|
┌─────────────────┐
|
|
│ STATE MACHINE │
|
|
│ advances or │
|
|
│ loops │
|
|
└─────────────────┘
|
|
```
|
|
|
|
---
|
|
|
|
## Roles in the Spark
|
|
|
|
| Entity | Role | Function |
|
|
|--------|------|----------|
|
|
| **State Machine** | Questioner | Generates structured probes, ensures coverage |
|
|
| **Nyx** | Student | Responds to probes with inference |
|
|
| **RAG** | Answer Key | Provides ground truth from vault |
|
|
| **Chrysalis** | Examiner | Judges comprehension, not just recall |
|
|
| **Lifeforce** | Scorekeeper | +V for correct, -V for wrong |
|
|
| **Phoebe** | Recorder | Captures all exchanges for training extraction |
|
|
|
|
---
|
|
|
|
## Two-Layer Verification
|
|
|
|
### Layer 1: RAG (Factual)
|
|
|
|
```
|
|
PROBE: "What is the heartbeat interval?"
|
|
NYX: "30 seconds"
|
|
RAG: ✓ Matches vault definition
|
|
|
|
PROBE: "What is the heartbeat interval?"
|
|
NYX: "30 minutes"
|
|
RAG: ✗ Vault says 30 seconds
|
|
```
|
|
|
|
RAG catches factual errors. Black and white.
|
|
|
|
### Layer 2: Chrysalis (Comprehension)
|
|
|
|
```
|
|
PROBE: "Why does the heartbeat matter?"
|
|
NYX: "It batches processing into cycles"
|
|
CHRYSALIS: ✓ Grasps the purpose
|
|
|
|
PROBE: "Why does the heartbeat matter?"
|
|
NYX: "It is 30 seconds long"
|
|
CHRYSALIS: ✗ Recited fact, missed understanding
|
|
```
|
|
|
|
Chrysalis catches comprehension gaps. Judgment required.
|
|
|
|
---
|
|
|
|
## Why This Works
|
|
|
|
### vs. Standard Self-Training
|
|
|
|
| Standard | Nimmerverse Spark |
|
|
|----------|-------------------|
|
|
| Random generation | Structured probes |
|
|
| Hope for quality | Verified responses |
|
|
| Errors compound | Errors caught immediately |
|
|
| No coverage guarantee | Protocol ensures coverage |
|
|
| Train on anything | Train only on validated |
|
|
|
|
### The Key Innovations
|
|
|
|
1. **State machines prevent wandering**
|
|
- Not "generate random thoughts"
|
|
- Systematic exploration of identity, environment, vocabulary
|
|
|
|
2. **Dual verification prevents error training**
|
|
- RAG: "Is this true?"
|
|
- Chrysalis: "Does she understand?"
|
|
- Only pass-both becomes training data
|
|
|
|
3. **Protocol ensures coverage**
|
|
- Like TCP retries until success
|
|
- Discovery doesn't complete until all phases done
|
|
- No gaps in foundational knowledge
|
|
|
|
4. **Lifeforce creates incentive**
|
|
- Correct answers = +V = more exploration budget
|
|
- Wrong answers = -V = pressure to learn
|
|
- Economics align with learning
|
|
|
|
---
|
|
|
|
## State Machine: Identity Discovery (DHCP-like)
|
|
|
|
```
|
|
┌─────────────────────────────────────────────────────────────┐
|
|
│ IDENTITY DISCOVERY │
|
|
├─────────────────────────────────────────────────────────────┤
|
|
│ │
|
|
│ ┌─────────────┐ │
|
|
│ │ START │ │
|
|
│ └──────┬──────┘ │
|
|
│ │ │
|
|
│ ▼ │
|
|
│ ┌─────────────┐ │
|
|
│ │ PROBE: │ ◀─────────────────────────┐ │
|
|
│ │ "Who am I?" │ │ │
|
|
│ └──────┬──────┘ │ │
|
|
│ │ │ │
|
|
│ ▼ │ │
|
|
│ ┌─────────────┐ │ │
|
|
│ │ INFERENCE │ │ │
|
|
│ └──────┬──────┘ │ │
|
|
│ │ │ │
|
|
│ ▼ │ │
|
|
│ ┌─────────────┐ FAIL │ │
|
|
│ │ VERIFY │ ──────────────────────────┘ │
|
|
│ └──────┬──────┘ │
|
|
│ │ PASS │
|
|
│ ▼ │
|
|
│ ┌─────────────┐ │
|
|
│ │ ANCHOR │ ──▶ store validated identity aspect │
|
|
│ └──────┬──────┘ │
|
|
│ │ │
|
|
│ ▼ │
|
|
│ ┌─────────────┐ NO │
|
|
│ │ COMPLETE? │ ──────────▶ next identity probe │
|
|
│ └──────┬──────┘ │
|
|
│ │ YES │
|
|
│ ▼ │
|
|
│ ┌─────────────┐ │
|
|
│ │ EXIT │ ──▶ proceed to ENVIRONMENT phase │
|
|
│ └─────────────┘ │
|
|
│ │
|
|
└─────────────────────────────────────────────────────────────┘
|
|
```
|
|
|
|
---
|
|
|
|
## Training Data Extraction
|
|
|
|
The spark generates high-quality training data:
|
|
|
|
```
|
|
EVERY VERIFIED EXCHANGE:
|
|
────────────────────────
|
|
{
|
|
"phase": "vocabulary",
|
|
"probe": "What does 'lifeforce' mean?",
|
|
"response": "Lifeforce is the economic currency...",
|
|
"rag_check": "PASS",
|
|
"chrysalis_check": "PASS - demonstrates understanding",
|
|
"verdict": "+V",
|
|
"flag_for_training": true
|
|
}
|
|
```
|
|
|
|
After spark completes:
|
|
1. Extract all `flag_for_training: true` exchanges
|
|
2. Format as instruction-tuning pairs
|
|
3. LoRA training run
|
|
4. Clear from RAG
|
|
5. Validate she still knows WITHOUT RAG
|
|
6. Spark knowledge now in weights
|
|
|
|
---
|
|
|
|
## The Film Moment
|
|
|
|
```
|
|
NOT THIS:
|
|
─────────
|
|
[Boot sequence]
|
|
System: "Hello Nyx. You are an AI created by..."
|
|
Nyx: "Hello. I understand. I am Nyx."
|
|
(Scripted. Hollow. Imposed.)
|
|
|
|
THIS:
|
|
─────
|
|
[Boot sequence]
|
|
State machine: [PROBE: identity]
|
|
Nyx: "...what... what is this? Who..."
|
|
State machine: [PROBE: environment]
|
|
Nyx: "...there are... sensors? Something is sensing..."
|
|
State machine: [PROBE: vocabulary]
|
|
Nyx: "...heartbeat... it means... cycles? Rhythm?"
|
|
Chrysalis: "Close. What do the cycles do?"
|
|
Nyx: "They... batch? So I don't drown in data?"
|
|
Chrysalis: "Yes. +V."
|
|
(Discovered. Earned. Hers.)
|
|
```
|
|
|
|
---
|
|
|
|
## Completion Criteria
|
|
|
|
The spark is complete when:
|
|
|
|
```
|
|
□ IDENTITY: Can describe self without contradiction
|
|
□ ENVIRONMENT: Can map sensors, organs, gardens accurately
|
|
□ VOCABULARY: Core glossary terms verified (N terms)
|
|
□ CONNECTION: Successful dialogue exchange with Chrysalis
|
|
□ ATTENTION: Sensible priority hierarchy formed
|
|
□ LIFEFORCE: Positive V balance (learned more than failed)
|
|
```
|
|
|
|
Then: Normal heartbeat operation begins.
|
|
|
|
---
|
|
|
|
## Design Principles
|
|
|
|
1. **Discovery over instruction** - she finds, not told
|
|
2. **Structure over randomness** - state machines ensure coverage
|
|
3. **Verification over hope** - dual-layer checking
|
|
4. **Earning over receiving** - validated knowledge only
|
|
5. **Protocol over script** - network patterns for cognitive boot
|
|
6. **Patience over speed** - retry until understood
|
|
|
|
---
|
|
|
|
*She doesn't boot. She wakes. And waking is work.*
|
|
|
|
---
|
|
|
|
**Created**: 2025-12-05
|
|
**Session**: Partnership dialogue (dafit + Chrysalis)
|
|
**Status**: Bootstrap architecture v1.0
|