🐍 Ouroboros Colony

Complete Architecture Guide — A Self-Improving AI Research System

📚 Table of Contents

1. What Is This? (Overview)
2. How It Works (Stigmergy)
3. The Ants (All 20 Workers)
4. Pheromone System (Memory Layer)
5. Embeddings (How Things Connect)
6. Database (Storage Layer)
7. Data Flow (Complete Pipeline)
8. Self-Improvement (The Ouroboros Loop) ⭐
9. Federation (Multi-Colony)
10. Scheduling (Cron Jobs)
11. Safety & Monitoring
12. Code Deep Dive

1. What Is This?

The Ouroboros Colony is an autonomous AI research system that:

🔍 Discovers research papers, code, and articles automatically
🧠 Analyzes findings using LLMs (Gemini, Claude)
🕸️ Connects related concepts into a knowledge graph
🐍 Improves itself based on what it learns
🔄 Runs 24/7 without human intervention

It's named after the Ouroboros — the snake eating its own tail — because the system can modify its own code based on the research it discovers.

┌─────────────────────────────────────────────────────────────────┐ │ THE OUROBOROS CONCEPT │ │ │ │ 🔍 DISCOVER │ │ │ │ │ ▼ │ │ ┌──────────────────────────┐ │ │ │ │ │ │ 🐍 IMPROVE ◀────────────────────── 🧠 ANALYZE │ │ │ │ │ │ └──────────────────────────┘ │ │ │ │ │ ▼ │ │ 🔗 CONNECT │ │ │ │ The snake eats its tail: research improves │ │ the system that does the research │ └─────────────────────────────────────────────────────────────────┘

2. How It Works — Stigmergy

Stigmergy is how real ant colonies work. Ants don't talk to each other — they leave chemical trails (pheromones) that other ants can smell. Good paths get more pheromones; bad paths fade away.

💡 Key Insight: There's no central "brain" controlling the colony. Intelligence emerges from simple agents following pheromone trails.

Real Ants vs Our System

Real Ants	Ouroboros Colony
Leave chemicals on the ground	Write records to SQLite database
Chemicals evaporate over time	Records decay using math formula
Strong trails attract more ants	Strong signals get prioritized
Colony finds food efficiently	Colony finds research efficiently

The Three Operations

1. DEPOSIT — "I found something!"

When an ant finds interesting research, it deposits a pheromone signal:

// An ant found a good paper and marks it
deposit(db, {
  type: 'candidate',        // Signal type
  target_node: 'finding-123', // What we found
  strength: 0.85,            // How confident (0-1)
  claim: 'High relevance paper on Mamba SSM'
});

2. DECAY — "Old news fades away"

Every signal weakens over time using exponential decay:

// The decay formula (runs every hour)
// S(t) = S₀ × e^(-λt)
//
// S(t)  = strength at time t
// S₀    = original strength
// λ     = decay rate (different per type)
// t     = hours elapsed

const newStrength = currentStrength * Math.exp(-decayRate * hoursElapsed);

// Example: candidate with 25%/hr decay
// After 1 hour:  1.0 × e^(-0.25 × 1) = 0.78 (78%)
// After 4 hours: 1.0 × e^(-0.25 × 4) = 0.37 (37%)
// After 12 hours: nearly gone

3. REINFORCE — "This is definitely good!"

When multiple ants confirm something, the signal gets stronger:

// Asymptotic reinforcement (approaches max smoothly)
// S_new = S_old + α(S_max - S_old)
//
// α     = learning rate (0.2)
// S_max = maximum strength (2.0)

const MAX_STRENGTH = 2.0;
const alpha = 0.2;
const newStrength = oldStrength + alpha * (MAX_STRENGTH - oldStrength);

// Example: starting at 0.5
// Hit 1: 0.5 + 0.2(2.0 - 0.5) = 0.80
// Hit 2: 0.8 + 0.2(2.0 - 0.8) = 1.04
// Hit 3: 1.04 + 0.2(2.0 - 1.04) = 1.23
// Approaches 2.0 but never overshoots

3. The Ants — All 20 Workers

Each "ant" is a Node.js script that runs on a schedule. They don't know about each other — they only communicate through pheromones in the database.

🔍 Scout Ants (6) — Discovery

Ant	Schedule	What It Does
`research-scout.js`	Every hour	Searches Brave for topics in `active_queries.json`
`github-scout.js`	Every hour	Searches GitHub for relevant code and repos
`arxiv-scout.js`	Every hour	Fetches latest AI/ML papers from ArXiv
`recursive-scout.js`	Every 2h	Follows "breadcrumb" trails left by other scouts
`batch-scout.js`	Every 4h	Spawns 10 parallel mini-scouts for burst discovery
`deep-scout.js`	Daily	Fetches full content for high-value findings

⚙️ Processing Ants (6) — Analysis

Ant	Schedule	What It Does
`filter-ant.js`	Every hour	Scores findings using keyword matching (no API cost)
`deep-reader-ant.js`	Every hour	🔥 Uses Gemini 3 Flash to analyze papers deeply
`connector-lite.js`	Every 3h	Builds knowledge graph by connecting similar findings
`dedup-ant.js`	Daily	Removes duplicate findings using embedding similarity
`analyzer-ant.js`	Every 2h	Statistical analysis of colony trends
`embedder-lite.js`	Every hour	Computes embeddings for new findings

🧠 Meta Ants (5) — Self-Improvement

Ant	Schedule	What It Does
`validator-ant.js`	Every 3h	Promotes candidates → breakthroughs based on evidence
`optimizer-ant.js`	2x daily	Tunes query priorities based on hit rates
`reflector-ant.js`	Every 6h	🔥 Monitors colony health, detects problems, suggests fixes
`implementer-ant.js`	2x daily	🔥 CAN MODIFY COLONY CODE! (with safety limits)
`synthesis-ant.js`	Every 4h	Generates research summaries from breakthroughs

🛡️ System Ants (3) — Maintenance

Ant	Schedule	What It Does
`safeguard-ant.js`	Daily 3AM	Creates backups, health checks, emergency preservation
`consolidator-ant.js`	Daily 4AM	Compresses old findings to save space
`kb-learner-ant.js`	Daily	Learns from external knowledge bases

4. Pheromone System — The Memory Layer

Pheromones are the colony's shared memory. Each pheromone record contains:

// What a pheromone looks like in the database
{
  pheromone_id: "abc123-...",     // Unique ID
  type: "candidate",              // What kind of signal
  target_node: "finding-456",    // What this refers to
  strength: 0.85,                 // Signal intensity (0-2)
  decay_rate: 0.25,              // 25% per hour
  claim: "High relevance SSM paper",
  deposited_by: "research-scout",
  deposited_at: "2026-02-11T15:00:00Z",
  last_updated: "2026-02-11T16:00:00Z",
  embedding: <16-byte BLOB>      // For similarity search
}

Pheromone Types & Decay Rates

🔥 FAST DECAY (prove yourself or die) │ ├── candidate 25%/hr ← New findings, need validation ├── action 20%/hr ← "Do this soon" signals └── breakthrough 12%/hr ← Good findings, must stay relevant ⏳ MEDIUM DECAY (operational signals) │ ├── scout_summary 15%/hr ← "Scout found 50 results" ├── hot_topic 5%/hr ← Trending research areas └── insight 8%/hr ← Reflector observations 🧊 SLOW DECAY (persistent knowledge) │ ├── validated 2%/hr ← Confirmed findings (multi-source) ├── connection 1%/hr ← Knowledge graph edges ├── synapse 0.5%/hr ← Heavily reinforced connections └── dead_end 2%/hr ← "Don't go here" warnings 🔒 STICKY (never decay) │ └── core_identity 0%/hr ← Colony DNA, safeguards

The Validation Pipeline

Findings must "earn" their status through multiple confirmations:

DISCOVERY VALIDATION PERSISTENCE │ │ │ ▼ ▼ ▼ ┌─────────┐ promote ┌─────────┐ confirm ┌─────────┐ │candidate│──────────────▶│break- │──────────────▶│validated│ │ 25%/hr │ │through │ │ 2%/hr │ └─────────┘ │ 12%/hr │ └─────────┘ │ └─────────┘ │ │ if not validated │ if not confirmed │ ▼ ▼ ▼ [dies] [dies] [persists] ~4 hours ~8 hours weeks/months

5. Embeddings — How Things Connect

To find similar research, we convert text into numbers (embeddings). Our system uses a 128-bit binary embedding — fast and compact.

How It Works

// The embedding has two parts:
// 64 bits = domain features (does text mention "transformer"? "mamba"? etc.)
// 64 bits = content hash (SHA256 truncated)

function embed(text) {
  let bits = 0n;  // BigInt for 128 bits
  
  // Part 1: Check 64 domain patterns
  const patterns = [
    /transformer|attention/i,  // bit 0
    /mamba|ssm|state.?space/i, // bit 1
    /memory|remember/i,        // bit 2
    // ... 61 more patterns ...
  ];
  
  patterns.forEach((pattern, i) => {
    if (pattern.test(text)) {
      bits |= (1n << BigInt(i));  // Set bit i to 1
    }
  });
  
  // Part 2: Hash the content (adds uniqueness)
  const hash = sha256(text).slice(0, 16);  // First 64 bits
  bits |= (BigInt('0x' + hash) << 64n);
  
  // Convert to 16-byte buffer (BLOB)
  return Buffer.from(bits.toString(16).padStart(32, '0'), 'hex');
}

Similarity Calculation

// Compare two embeddings using Hamming distance
// (count how many bits are different)

function similarity(embeddingA, embeddingB) {
  let differentBits = 0;
  
  // XOR the bytes and count 1s
  for (let i = 0; i < 16; i++) {
    let xor = embeddingA[i] ^ embeddingB[i];
    while (xor) {
      differentBits++;
      xor &= xor - 1;  // Clear lowest set bit
    }
  }
  
  // Convert to similarity (0 = opposite, 1 = identical)
  return 1 - (differentBits / 128);
}

// Speed: ~700,000 comparisons per second!

8. Self-Improvement — The Ouroboros Loop ⭐

🐍 THIS IS THE CORE INNOVATION
The colony can modify its own code based on research it discovers. This creates a recursive loop where better research leads to better code leads to better research.

The Complete Self-Improvement Cycle

═══════════════════════════════════════════════════════════════════════════ ║ THE OUROBOROS SELF-IMPROVEMENT LOOP ║ ═══════════════════════════════════════════════════════════════════════════ STEP 1: DISCOVER │ │ research-scout finds: "Paper: Exponential decay preserves │ memory shadows better than linear" │ ▼ STEP 2: ANALYZE │ │ deep-reader-ant extracts: │ PURPOSE: Prevent total memory loss │ KEY_INSIGHT: Use S(t) = S₀ × e^(-λt) instead of S(t) = S₀ - λt │ RELEVANCE: 9/10 for our pheromone system │ ▼ STEP 3: VALIDATE │ │ validator-ant confirms: │ - Multiple sources mention this │ - Connected to 5 other findings │ - Promoted to "validated_breakthrough" │ ▼ STEP 4: REFLECT │ │ reflector-ant notices: │ "Colony is using linear decay. Validated breakthrough │ suggests exponential is better. ANOMALY: we should change." │ ▼ STEP 5: PROPOSE │ │ implementer-ant generates patch: │ TARGET: pheromones-db.js │ CHANGE: decay formula │ FROM: newStrength = strength - (rate * hours) │ TO: newStrength = strength * Math.exp(-rate * hours) │ RISK: MEDIUM │ ▼ STEP 6: IMPLEMENT │ │ implementer-ant applies patch (MEDIUM risk = auto-apply now!) │ Colony code is modified. │ ▼ STEP 7: FEEDBACK │ │ Colony runs with new decay formula. │ Old findings persist longer (memory shadows). │ More connections form. │ Better research discovered. │ └──────────────────▶ BACK TO STEP 1 (recursive!)

The Reflector Ant — Anomaly Detection

// reflector-ant.js — Monitors colony and detects problems

async function run() {
  // 1. GATHER METRICS
  const metrics = {
    recentFindings: countFindingsLast6Hours(),
    conversionRate: breakthroughs / candidates,  // How many make it
    avgStrength: averagePheromoneStrength(),
    evaporated: countWeakPheromones(),
  };

  // 2. DETECT ANOMALIES
  const anomalies = [];
  
  // Too few discoveries?
  if (metrics.recentFindings < baseline * 0.5) {
    anomalies.push({
      type: 'discovery_stall',
      severity: 'medium',
      message: 'Discovery rate dropped 50%',
      suggestion: 'Add new queries or boost scout frequency'
    });
  }
  
  // Echo chamber? (too many breakthroughs = not selective enough)
  if (metrics.conversionRate > 0.5) {
    anomalies.push({
      type: 'echo_chamber',
      severity: 'high',
      message: '50% of candidates become breakthroughs (suspicious)',
      suggestion: 'Raise breakthrough threshold or increase decay'
    });
  }
  
  // 3. GENERATE PATCHES
  if (anomalies.length > 0) {
    const suggestion = await llm.analyze(anomalies);
    createPatch(suggestion);  // Queue for implementer
  }
}

The Implementer Ant — Code Modification

// implementer-ant.js — THE OUROBOROS ITSELF
// This ant can modify colony code based on research findings!

// Risk classification determines what gets auto-applied
function classifyRisk(code, targetFile) {
  // HIGH RISK — never auto-apply
  if (code.includes('child_process')) return 'high';  // Shell commands
  if (code.includes('eval('))        return 'high';  // Code execution
  if (targetFile.includes('implementer')) return 'high'; // Can't modify itself!
  
  // MEDIUM RISK — auto-apply with logging
  if (code.includes('function '))   return 'medium'; // New functions
  if (code.includes('require('))    return 'medium'; // New imports
  
  // LOW RISK — auto-apply immediately
  return 'low';  // Config changes, thresholds, etc.
}

// Process patches based on risk level
for (const patch of pendingPatches) {
  if (patch.risk === 'low') {
    applyPatch(patch);           // ✅ Auto-apply
  } else if (patch.risk === 'medium') {
    applyPatch(patch);           // ⚡ Auto-apply (upgraded!)
    notifyDiscord('Applied MEDIUM risk patch');
  } else {
    logOnly(patch);              // 🛑 Human must review
  }
}

LLM-Generated Patches

// implementer-ant.js — Generate patches from validated breakthroughs

async function analyzeBreakthroughsForPatches(breakthroughs) {
  const prompt = `
You are the self-improvement module of an AI research colony.

These validated breakthroughs were discovered:
${breakthroughs.map(b => b.claim).join('\n')}

Suggest ONE specific, safe improvement to the colony code.

Format:
TARGET_FILE: pheromones-db.js
CHANGE_TYPE: algorithm
DESCRIPTION: Use exponential decay instead of linear
CODE: const newStrength = strength * Math.exp(-rate * hours);
RISK: medium
REASON: Research shows this preserves memory shadows better
`;

  const suggestion = await claude.complete(prompt);
  
  // Parse and create the patch
  const patch = parseSuggestion(suggestion);
  createPatch(patch);  // Will be applied next implementer run
}

What Can Self-Improve (Current Status)

Capability	Status	Risk Level
Query priorities	✅ AUTO	LOW
Decay rates	✅ AUTO	LOW
Filter thresholds	✅ AUTO	LOW
New algorithm formulas	⚡ AUTO	MEDIUM
New utility functions	⚡ AUTO	MEDIUM
Core system logic	🛑 MANUAL	HIGH
Security/auth code	🛑 MANUAL	HIGH
The implementer itself	🛑 BLOCKED	FORBIDDEN

✅ The Recursive Proof:
Level 0: Colony finds research on "better decay algorithms"
Level 1: Implementer applies exponential decay
Level 2: Better decay → findings persist longer → more validated
Level 3: More validated → better optimizer suggestions
Level 4: Better queries → finds research on "how to evaluate research"
Level 5: Implements better scoring → finds better papers → ...
∞ OUROBOROS ∞

🔧 Self-Improvement History (Live!)

These are actual patches the colony has applied to itself based on research it discovered:

✅ Patch #1: Increase Decay Rates

Date	2026-02-11
File	`pheromones-db.js`
Risk	LOW
Research Basis	Ring Attention papers on faster adaptation

// BEFORE (slower decay)
candidate: 0.25,     // 25%/hr
breakthrough: 0.12,  // 12%/hr

// AFTER (faster decay - self-modified!)
candidate: 0.30,     // 30%/hr - proves itself faster
breakthrough: 0.15,  // 15%/hr - stays relevant or dies

Why: Ring Attention research showed faster signal adaptation improves system responsiveness. Stale data clears faster, making room for fresh discoveries.

⚡ Patch #2: Raise Connection Threshold

Date	2026-02-11
File	`connector-lite.js`
Risk	MEDIUM
Research Basis	Mamba SSM papers on noise reduction

// BEFORE (loose connections)
const CONNECTION_THRESHOLD = 0.50;  // 50% similarity

// AFTER (tighter connections - self-modified!)
const CONNECTION_THRESHOLD = 0.65;  // 65% similarity

Why: Mamba SSM research indicated focusing on stronger relationships reduces noise. Fewer edges, but each one more meaningful.

🐍 The Snake Bit Its Tail!
These patches were generated by analyzing the colony's own research findings (Ring Attention, Mamba SSM) and applying the insights back to the colony's code. This is recursive self-improvement in action.

6. Database — Storage Layer

Everything lives in SQLite: data/colony.db

SQLite Optimizations

// database.js — High-performance settings

const db = new Database('colony.db');

// WAL mode: allows reads while writing
db.pragma('journal_mode = WAL');

// NORMAL sync: safe + fast (vs FULL which is slow)
db.pragma('synchronous = NORMAL');

// 64MB cache: keeps hot data in memory
db.pragma('cache_size = -64000');

// Temp tables in RAM (faster sorting/joins)
db.pragma('temp_store = MEMORY');

Core Tables

-- FINDINGS: Research items discovered by scouts
CREATE TABLE findings (
  id            TEXT PRIMARY KEY,
  title         TEXT,
  url           TEXT,
  content       TEXT,           -- Snippet or full text
  source        TEXT,           -- 'brave', 'github', 'arxiv'
  score         INTEGER,        -- 0-100 quality score
  status        TEXT,           -- 'new', 'analyzed', 'duplicate', 'noise'
  embedding     BLOB,           -- 16-byte binary vector
  created_at    TEXT,
  analyzed_at   TEXT
);

-- PHEROMONES: Signal/memory layer
CREATE TABLE pheromones (
  pheromone_id  TEXT PRIMARY KEY,
  type          TEXT,           -- 'candidate', 'breakthrough', etc.
  target_node   TEXT,           -- What this refers to
  strength      REAL,           -- 0.0 to 2.0
  decay_rate    REAL,           -- % per hour
  deposited_at  TEXT,
  deposited_by  TEXT,           -- Which ant
  claim         TEXT,           -- Human-readable reason
  last_updated  TEXT,           -- For decay calculation
  embedding     BLOB
);

-- EDGES: Knowledge graph connections
CREATE TABLE edges (
  id            INTEGER PRIMARY KEY,
  source_id     TEXT,           -- Finding A
  target_id     TEXT,           -- Finding B
  weight        REAL,           -- Similarity score
  reinforced    INTEGER,        -- Times rediscovered
  edge_type     TEXT            -- 'semantic', 'citation', etc.
);

-- PATCHES: Self-modification queue
CREATE TABLE patches (
  id            TEXT PRIMARY KEY,
  title         TEXT,
  description   TEXT,
  target_file   TEXT,
  code          TEXT,
  reason        TEXT,
  risk          TEXT,           -- 'low', 'medium', 'high'
  status        TEXT,           -- 'pending', 'applied', 'failed'
  created_at    INTEGER,
  applied_at    INTEGER
);

9. Federation — Multi-Colony

Multiple colonies can run simultaneously and share discoveries through stigmergy (no direct communication).

┌─────────────────────┐ ┌─────────────────────┐ │ ALPHA COLONY │ │ BETA COLONY │ │ (AI Memory) │ │ (SQL/Speed) │ │ │ │ │ │ Mamba, SSM, │ │ SQLite, vectors, │ │ attention, RAG │ │ networking, APIs │ └─────────┬───────────┘ └───────────┬─────────┘ │ │ │ FEDERATION │ │ (shared signals) │ ▼ ▼ ┌─────────────────────────────────────────────────────────┐ │ data/federation/ │ │ │ │ pheromones.jsonl ← Shared breakthrough signals │ │ breadcrumbs.jsonl ← Shared exploration trails │ │ │ │ Each colony writes important discoveries here. │ │ Other colonies can read and follow the trails. │ └─────────────────────────────────────────────────────────┘

10. Scheduling — Cron Jobs

42 cron jobs run the colonies. Alpha and Beta are staggered by 10 minutes to avoid conflicts.

MINUTE ALPHA BETA ANT ────────────────────────────────────────────────────── :00 research-scout · 🔍 :05 filter-ant · 🔬 :10 · research-scout 🔍 :15 github-scout filter-ant 🐙 :25 · github-scout 🐙 :30 deep-reader 💰 · 📖 :40 · deep-reader 💰 📖 :45 arxiv-scout · 📄 :50 embedder-lite · 🏷️ :55 · arxiv-scout 📄 DAILY 02:00 kb-learner · 03:00 safeguard (backup) safeguard (backup) 🛡️ 04:00 consolidator consolidator 🗜️ 05:00 dedup dedup 🔄 06:00 optimizer optimizer ⚡ 15:00 implementer 🐍 implementer 🐍 🔧

11. Safety & Monitoring

Safeguard Ant — Daily Health Check

// safeguard-ant.js — Runs at 3 AM

// 1. CREATE BACKUP
db.backup(`backups/colony-${timestamp}.db`);
// Keeps 7 days of backups, auto-deletes older

// 2. HEALTH CHECK
const health = {
  findings: countFindings(),         // Alert if < 10
  pheromones: countPheromones(),     // Alert if < 20
  avgStrength: avgPheromoneStrength(), // Alert if < 10%
  breakthroughs: countBreakthroughs() // Alert if = 0
};

// 3. EMERGENCY PRESERVATION
if (health.findings < 5) {
  // CRITICAL: Protect top findings from decay
  db.run(`
    UPDATE pheromones 
    SET decay_rate = 0 
    WHERE type IN ('breakthrough', 'validated')
  `);
  alertDiscord('🆘 EMERGENCY: Knowledge collapse prevented');
}

Implementer Safety Constraints

🛑 The implementer CANNOT:

Modify its own code (prevents runaway self-modification)
Execute shell commands (no child_process)
Use eval() or Function() (no code injection)
Write to arbitrary .js files (only approved targets)
Change authentication or API keys

12. Code Deep Dive — Key Files

File Structure

ai-memory-colony/
├── data/
│   ├── colony.db              # SQLite database (all state)
│   ├── active_queries.json    # What to search for
│   ├── backups/               # Daily backups (7 days)
│   ├── federation/            # Shared with other colonies
│   └── patches/               # Self-modification history
├── src/
│   ├── ants/                  # All 20 ant scripts
│   │   ├── research-scout.js
│   │   ├── deep-reader-ant.js
│   │   ├── reflector-ant.js   # Anomaly detection
│   │   ├── implementer-ant.js # Self-modification
│   │   └── ...
│   ├── core/                  # Shared modules
│   │   ├── database.js        # SQLite connection
│   │   ├── pheromones-db.js   # Stigmergy system
│   │   ├── embeddings.js      # 128-bit binary LSH
│   │   └── model-router.js    # LLM routing
│   └── config.js              # Settings
├── logs/                      # Ant output logs
└── docs/                      # This documentation

Running the Colony

# Install dependencies
cd ai-memory-colony
npm install

# Run a single ant manually
node src/ants/research-scout.js

# Check colony status
node scripts/colony-status.js

# Install all cron jobs
./scripts/setup-cron.sh

# View cron jobs
crontab -l | grep ai-memory-colony

🐍 Ouroboros Colony — Complete Architecture Guide

Generated:

"The snake that eats its own tail grows stronger with each bite."

GitHub · Self-Improvement · Back to Top