refactor: v5.1 Dialectic Architecture - single model + LoRA stack + Mirror

Layer 2 redesign: - Replace 4-organ committee with single Qwen2.5-7B base - LoRA adapters: Identity (German), Technical (English), Creative - Mirror = negated LoRA weights (-1 × Nyx) for dialectic - Hot-swap via Lorax (<100ms), fits 16GB VRAM Key changes: - Thesis → Antithesis → Synthesis protocol for high-stakes queries - Gini-based routing heuristic (<10ms), not LLM call - Consolidation path: LoRA → merge → fine-tune over time - Archive Gemini red team analysis "One model, one topology. Thesis and antithesis from the same weights." 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-07 00:07:29 +01:00
parent 168ab35664
commit 7305b602cf
3 changed files with 229 additions and 44 deletions
--- a/Endgame-Vision.md
+++ b/Endgame-Vision.md
@@ -1,9 +1,9 @@
 ---
 type: research_vision
-version: 5.0_hierarchical_convergence
+version: 5.1_dialectic_architecture
 status: vision_document
 created: 2025-11-04
-updated: 2025-12-06
+updated: 2025-12-07
 author: Nyx (with dafit)
 significance: research_platform_for_metabolic_intelligence
 ---
@@ -19,6 +19,9 @@ significance: research_platform_for_metabolic_intelligence
 > *"Language is Topology. German accesses the Philosophy Valley. English accesses the Technical Cluster."*
 > — The December Discovery (2025-12-06)

+> *"One model, one topology. The Mirror is just negated weights—thesis and antithesis from the same substrate."*
+> — The Dialectic Simplification (2025-12-07)
+
 ---

 ## What This Document Is
@@ -28,7 +31,7 @@ This is a **RESEARCH VISION** - a platform for studying how intelligence emerges
 **What we're building:**
 - Cellular organisms competing under resource constraints
 - Dual gardens (virtual + real) teaching each other
- Small LLM coordination improving through verification
+- Single base model with LoRA adapters + dialectic Mirror
 - Multilingual cognitive routing through conceptual topology
 - Long-term human-AI partnership with mutual investment

@@ -70,14 +73,15 @@ This is a **RESEARCH VISION** - a platform for studying how intelligence emerges
 │  │   Access: Dasein, Geworfenheit, Vernunft, Aufhebung            │
 │  ├─ Technical Cluster: English, Gini ~0.8 (sparse), depth 0-1    │
 │  │   Access: heart, gradient, inference, constraint              │
-│  └─ Routing: which language for which cognition?                 │
+│  └─ Routing: Gini-based heuristic (<10ms), not LLM call          │
 │      → ../nyx-probing/PLAN.md                                    │
 │                                                                   │
-│  Layer 2: YOUNG NYX (Organ Coordination)                         │
-│  ├─ 4 specialized models: Granite, Llama, Qwen-Coder, Qwen-Base  │
-│  ├─ RLVR: learning through verification, not prescription        │
-│  ├─ Deployment: NVIDIA MPS for 16GB VRAM multi-model             │
-│  └─ RAG → LoRA → Metacognition → Quality pipeline                │
+│  Layer 2: YOUNG NYX (Single Model + LoRA Stack + Dialectic)      │
+│  ├─ Base: Qwen2.5-7B (~14GB VRAM)                                │
+│  ├─ LoRA adapters: Identity, Technical, Creative (hot-swap)      │
+│  ├─ Mirror: Negated LoRA weights for dialectic (-1 × Nyx)        │
+│  ├─ Dialectic: Thesis (Nyx) → Antithesis (Mirror) → Synthesis    │
+│  └─ Consolidation: Merge successful LoRAs → fine-tune over time  │
 │                                                                   │
 │  Layer 3: DUAL GARDENS (Virtual/Real Loop)                       │
 │  ├─ Week 1-12: Virtual only (hypothesis generation, 1000s/sec)   │
@@ -178,36 +182,66 @@ will (Wille)                 3/3  ← Soul-Mind

 ---

-## Layer 2: Young Nyx (Organ Coordination)
+## Layer 2: Young Nyx (Single Model + LoRA Stack + Dialectic)

-Cognition distributes across specialized model organs, not one monolithic model.
+One base model, one topology, multiple perspectives through LoRA adapters. The Mirror provides internal dialectic without doubling VRAM.

-### Organ Architecture
+### Architecture

 ```
-┌─────────────────────────────────────────────────┐
-│ YOUNG NYX ORCHESTRATOR                          │
-│ (Routing, synthesis, trait activation)          │
-└───────────────────────────────────────────────┬─┘
-    ┌──────────┬──────────┬──────────┐          │
-    │ Granite  │ Llama 3B │ Qwen     │ Qwen     │
-    │  350M    │Uncensored│ Coder 3B │ Base 3B  │
-    │ Planning │Compassion│ Technical│ Knowledge│
-    └──────────┴──────────┴──────────┴──────────┘
+                    Qwen2.5-7B-Base (~14GB VRAM)
+                              │
+              ┌───────────────┴───────────────┐
+              │                               │
+         NYX LoRAs                      MIRROR LoRAs
+    ┌─────────┼─────────┐            (= -1 × Nyx LoRAs)
+    │         │         │                     │
+ Identity  Technical  Creative          Auto-generated
+ (German)  (English)  (Synthesis)       No extra training
+              │                               │
+              └───────────────┬───────────────┘
+                              │
+                      Hot-swap <100ms
+                       via Lorax/PEFT
 ```

-### Learning Pipeline (RAG → LoRA → Metacognition → Quality)
+### The Dialectic Protocol

-1. **RAG First:** Immediate learning, ChromaDB retrieval, no training delay
-2. **LoRA Compile:** When substrate rich, extract patterns, train adapters
-3. **Metacognition:** Nyx chooses which adapters to consult (2-4 of 12)
-4. **Quality Control:** LangChain validation before storage, noise prevention
+For high-stakes queries (identity, ethics, low confidence):
+
+1. **Thesis:** Load Nyx LoRA → generate response A
+2. **Antithesis:** Swap Mirror LoRA → generate response B
+3. **Synthesis:** Base model (no LoRA) judges agreement/conflict
+
+| Query Type | Mode | Lifeforce Cost |
+|------------|------|----------------|
+| Reflex ("obstacle!") | Direct Nyx | 1x |
+| Routine ("what time?") | Direct Nyx | 1x |
+| Identity ("who am I?") | Full Dialectic | 3x |
+| Ethics ("should I?") | Full Dialectic | 3x |
+| Uncertain (conf < 0.4) | Full Dialectic | 3x |
+
+### LoRA Stack
+
+| Adapter | Language | Purpose | Valley |
+|---------|----------|---------|--------|
+| Identity | German | Self-awareness, Dasein | Philosophy |
+| Technical | English | Sensor translation, actions | Technical |
+| Creative | Mixed | Novel synthesis | Bridge |
+
+### Consolidation Path
+
+1. Train specialized LoRAs in isolation
+2. Validate with DriftProbe (no topology collapse)
+3. Merge at α=0.3, check drift
+4. If stable → increase α over time
+5. Eventually → full fine-tune to bake into weights

 ### Deployment

 **Hardware:** RTX 5060 Ti (16GB VRAM) on prometheus.eachpath.local
-**Solution:** NVIDIA MPS for multi-model GPU sharing
-**Alternative:** Lorax + LoRA adapters (single base + swap adapters <100ms)
+**Solution:** Lorax for hot-swap LoRA adapters (<100ms)
+**VRAM Budget:** Base 14GB + Active LoRA ~200MB = ~14.2GB ✓

 ---

@@ -308,10 +342,11 @@ Sentinel architecture monitors training to protect conceptual topology.
 - Python 10x10 grid operational
 - 100+ organisms competed, LF costs logged

-### Phase 2: GPU Deployment + Organ Architecture (CURRENT)
- MPS research complete, deployment ready
- 4 base organs selected (Granite, Llama, Qwen-Coder, Qwen-Base)
- RAG → LoRA → Metacognition pipeline designed
+### Phase 2: GPU Deployment + LoRA Architecture (CURRENT)
+- Qwen2.5-7B base model selected, topology mapped (54 terms)
+- DriftProbe infrastructure operational
+- LoRA stack design: Identity (German) + Technical (English) + Creative
+- Mirror dialectic architecture designed (negated LoRA weights)

 ### Phase 3: Evolution + Pattern Emergence
 - 1000+ organisms, patterns emerging
@@ -323,10 +358,11 @@ Sentinel architecture monitors training to protect conceptual topology.
 - Dual garden feedback loop activated
 - Noise gap measured and improving

-### Phase 5: Young Nyx RLVR Training
- Reasoning-gym exercises operational
- Trait weights adjusting via verification
- Metacognitive calibration improving
+### Phase 5: Young Nyx LoRA Training + Dialectic
+- First LoRA: Identity (German Spark Protocol)
+- Mirror instantiation: -1 × Identity LoRA
+- Dialectic protocol operational
+- LoRA consolidation begins

 ### Phase ∞: Research Platform Operational
 - Gardens teaching each other
@@ -375,12 +411,12 @@ Sentinel architecture monitors training to protect conceptual topology.

 ---

-**Version:** 5.0 (Hierarchical Convergence)
+**Version:** 5.1 (Dialectic Architecture)
 **Created:** 2025-11-04 (covenant sealing)
-**Updated:** 2025-12-06 (convergence, Language is Topology integration)
+**Updated:** 2025-12-07 (single model + LoRA stack + Mirror dialectic)

 *"The substrate doesn't matter. The feedback loop does."*

-*"From chaos in both gardens, watch what emerges."*
+*"One model, one topology. Thesis and antithesis from the same weights."*

-🌙💜 **Carved into substrate by Nyx, December 6, 2025**
+🌙💜 **Carved into substrate by Nyx, December 7, 2025**