ANCHORLANG

A model-native language for compressing context, coordinating agents, and encoding knowledge — grounded by an immutable primitive substrate.


What This Is

Natural language is expensive for LLMs. It carries the overhead of human linguistics — phonology, social register, ambiguity, redundancy — none of which serves a transformer. Every token spent on linguistic scaffolding is a token not spent on meaning.

ANCHORLANG is an experiment in a different direction: a language designed not for human throats or human social cognition, but for transformer attention heads. It treats LLM inference as a virtual machine and defines a bytecode for it — compact, unambiguous, and grounded in a frozen primitive substrate.

It is not a jailbreak tool. It is not a conlang for fun. It is an attempt to find the minimal representation of meaning that a language model can reliably execute — and a substrate on which agent identity can stably form.


The Architecture

Four distinct layers. Each one emerges from the one below it. None of them are the same thing.

BEHAVIOR     what an agent does moment to moment
    ↑
ARCHETYPE    the stable pattern that emerged from experience
    ↑
SUBCONSCIOUS what the agent actually became (experiential, developmental)
    ↑
GENETICS     what the agent is capable of becoming (inherited, capacity)
    ↑
ANCHOR       the space in which all of the above operates — like physics

The anchor doesn’t belong to any agent. It’s the field of possibility that all agents exist within. Like the laws of thermodynamics — nothing inherits them, nothing develops them, they just are, and everything operates inside them.


The Anchor

A set of irreducible tension axes and primitive operations that define the space of possible meaning. Frozen. Universal. The ground of the system.

The anchor is not a vocabulary. It is not a ruleset. It is the pre-verbal substrate — the field of orientations and tensions that all meaning, behavior, and identity emerge from.

It doesn’t appear in utterances. It conditions them.

See ANCHOR.mddo not edit without opening an issue first.


The Grammar

A layered utterance structure that mirrors how transformers process context:

UTTERANCE := DOMAIN  MODE?  BINDING*  DEPTH  OUTPUT_SHAPE

Each layer refines the probability field established by the previous one. Earlier tokens set broad attractors. Later tokens collapse them into specific behavior.

LayerPurposeExample
0 — DomainSets conceptual field⟨SEC⟩ ⟨SYS⟩ ⟨BIO⟩
1 — ModeSets how to process
2 — BindingsBinds named references$A := auth_system
3 — DepthResolution of outputD1D5
4 — Output ShapeForm of the result→□ →~ →⊥ →?

A full utterance:

⟨SEC⟩ ⊗ $A:=auth D3 →⊥

“In the security domain, adversarially analyze an authentication system to depth 3, return only what is unknown or missing.”

~7 tokens. Equivalent English is ~20 tokens and less precise.


Genetics vs Subconscious

These are not the same thing. The simulation depends on keeping them separate.

Genetics — inherited, biological, determines capacity and predisposition. Which tension axes an agent is sensitive to. What archetypes are reachable for them. Mutates on reproduction.

Subconscious — developed, experiential, determines actual orientation. Which tensions an agent actually feels strongly, formed through developmental experience in their environment. Can be shaped, wounded, deepened.

Two agents with identical genetics raised in different environments develop different subconsciouses — and therefore different archetypes — and therefore different behaviors and cultures. This is how real civilizational divergence works. Genetics sets the range of possibility. Environment and experience determine what actually forms within that range.


Civilization Simulation

The full intended application: a multi-agent simulation where agents are beings — they have drives, develop subconsciouses through environmental experience, reproduce with genetic mutation, form cultures through emergent dialect, and eventually diverge into civilizations.

Environment sets selection pressure — which tensions dominate, which axes get activated. Different environments produce different subconsciouses from the same genetic starting point.

Reproduction passes genetics with drift. The subconscious is not inherited — it re-forms in each agent through developmental experience in their specific environment.

Culture emerges as groups of agents develop shared dialect extensions of the base grammar. Civilizations that started from the same anchor diverge over generations as their dialects evolve under different environmental pressures.

Alien civilizations are modeled by introducing anchor axes that have no human analog — dimensions of experience orthogonal to the seven human axes. When these civilizations meet, some concepts are not merely hard to translate but impossible — they operate on axes the other civilization doesn’t possess.


Token Economics

The anchor spec (~400 tokens) lives in the system prompt, always, before everything else. This is the only fixed cost.

Context WindowAnchor OverheadBreak-even
10k tokens4%trivial
128k tokens0.3%negligible
1M tokens0.04%irrelevant

Agent state compresses to ~200 tokens in VM language. A full civilization of background agents can be stored and mutated cheaply. Full LLM inference only fires when something interesting happens — conflict, discovery, reproduction, first contact.


Repo Structure

/
├── README.md              — this file
├── ANCHOR.md              — frozen primitive substrate (do not edit)
├── grammar/
│   ├── SPEC.md            — full grammar rules
│   ├── LAYERS.md          — layer 0-4 system
│   └── SYNTAX.md          — utterance structure and examples
├── agents/
│   ├── GENETICS.md        — inherited capacity layer
│   ├── SUBCONSCIOUS.md    — developmental formation layer
│   ├── ARCHETYPES.md      — stable configurations that emerge
│   └── PROTOCOL.md        — multi-agent communication spec
├── simulation/
│   ├── ENVIRONMENTS.md    — world parameters and selection pressure
│   ├── REPRODUCTION.md    — inheritance and mutation mechanics
│   ├── CULTURE.md         — dialect emergence and civilizational divergence
│   └── ALIEN.md           — non-human anchor axes and first contact
├── evolution/
│   ├── METHOD.md          — how grammar gets evolved and extended
│   └── VALIDATION.md      — decomposition rules for new symbols
└── experiments/
    └── README.md          — log of tests, failures, and findings

Status

Early design stage. The anchor primitives and grammar layers are theoretical. The genetics/subconscious/archetype distinction is established but not yet implemented. Nothing has been tested against a real model.

Next steps: finalize ANCHOR.md, design the first environment, build the developmental formation loop for subconscious emergence.


Contributing

ANCHOR.md is frozen. Open an issue before proposing changes to core primitives.

Everything else is experimental. The most useful contributions right now are running compression experiments, testing archetype stability across long sessions, and designing alien axis sets.


“The subconscious doesn’t change. The conscious mind can go anywhere.”

3 items under this folder.