COLM 2026 Submission

Context Management
Meets Epigenetics

EpiContext treats your agent's history like a genome — preserving everything, but dynamically regulating what gets expressed. Inspired by how your cells decide which genes to activate.

📄 Read the Paper 🔗 View on GitHub

The Problem

Your Agent's Memory Problem

Modern AI agents accumulate massive context histories across hundreds of turns. Keeping everything is expensive; throwing things away loses critical information. There's a better way.

📋

Full Context

Keep every turn in the context window. Token costs explode, and LLM performance degrades from "lost in the middle" effects.

🗑️

Sliding Window

Only keep the last N turns. Simple and fast, but critical early decisions are permanently lost when they scroll out of view.

🧬

EpiContext

Store everything (the genome), but dynamically select what to include in each request (epigenetic expression). Content-aware, not time-based.

Architecture

The Epigenetic Pipeline

Your agent's complete history is the genome. Three operators decide what gets expressed in each LLM call.

🧬 Context Graph

Complete history
(Immutable Genome)

→

🔇 Methylation

Silence resolved
or noisy blocks

→

📢 Acetylation

Activate relevant
tools & context

→

🎯 Fitness Function

F(P) = αR − βC + γI
Optimize & feedback

→

⚡ Optimized Payload

Minimal, relevant
context to LLM

Three Operators

How Epigenetic Regulation Works

Each operator addresses a different failure mode of context accumulation. Together they form a self-tuning system.

Memory Methylation

M: w(v) → 0 for resolved blocks

When the agent resolves a subproblem after extensive trial-and-error, those detailed logs become noise. Methylation detects low-progress segments and replaces them with compact summaries — preserving the insight, discarding the noise.

Tool Acetylation

A: r(f, τ) = λ₁·name + λ₂·desc + λ₃·param

Modern agents carry 20+ tools, but any task needs only 2–3. Acetylation computes multi-level relevance scores between each tool and the current task, activating only what's needed.

Fitness Feedback

F(P) = α·R_task − β·C_token + γ·I_density

A joint objective over task success, token cost, and information density. After each turn, weights update: successful context is reinforced, failed context is suppressed. The system learns what to keep.

Key Innovation

Adaptive Strategy Switching

The breakthrough: don't use epigenetic regulation on every turn. Use simple sliding window for early turns, then switch to content-aware filtering when context volume exceeds the window.

🪟 Turns 1–10

Sliding Window
Simple & fast

→

🔄 Turn 10+

Switch to
EpiContext

→

✨ Result

Best of both:
efficient + intelligent

Like driving: use first gear in the parking lot, fifth gear on the highway. Don't use fifth gear everywhere.

Results

Experimental Results

81 successful runs across 5 containerized tasks in the Harbor evaluation framework. 6 strategies compared under identical conditions.

Strategy	N	Avg Turns	Avg Time (s)	Avg Input Tokens	Avg Output Tokens
Full-Context	15	10.6	69.6	11,980	2,581
Sliding Window	15	8.7	42.3	4,665	1,801
Methylation-Only	9	10.0	31.7	5,919	1,854
Acetylation-Only	12	12.5	75.6	14,264	3,362
EpiContext (v1)	15	12.5	61.6	13,791	3,351
Adaptive EpiContext (v2)	15	3.8	15.9	1,153	667

describe-image

−96%

1,801 vs 43,600 tokens
5 turns vs 20 turns

Statistical Tests

Turns: p = 0.0001
Tokens: p = 0.022

Paired t-test, 15 matched runs
Both highly significant

v1 → v2 Journey

12×

Token efficiency improvement
From worst to best strategy

Ready to dive deeper?

Read the full paper — 17 pages with architecture diagrams, mathematical proofs, ablation studies, and complete experimental data.

📄 Download Paper (PDF) 🔗 GitHub Repository

Context ManagementMeets Epigenetics