🌐 Field Note: AI Is a Coherence Filter (That’s Why One Strategy Now Fits All)

Feb 8

Written By Darren Swayne

There’s a lot of noise right now about how to “optimise for AI”.

SEO is dead.

GEO is the future.

AEO matters more.

Promptability is everything.

Most of that advice is downstream confusion.

Once you understand what AI systems are actually doing, the strategy simplifies to one thing.

AI doesn’t rank. It filters.

AI systems aren’t trying to explore the web.

They’re trying to stop.

Given an intent, the system is looking for:

the lowest-entropy answer
a pattern it can safely reuse
an explanation that resolves end-to-end without contradiction

This isn’t about finding many good answers.

It’s about finding one stable one.

That’s why AI discovery feels different:

fewer options
fewer links
more confident conclusions

This isn’t a UX choice.

It’s a systems behaviour.

Coherence is the filter

When an AI system evaluates candidates, it isn’t scoring pages in isolation.

It’s asking:

Does this entity explain the category clearly?
Does it resolve the intent fully?
Does it contradict itself elsewhere?
Can I reuse this answer next time without risk?

The more coherent the structure, the cheaper it is to reuse.

That’s how defaults form.

Not because alternatives don’t exist —

but because reconsideration is expensive.

Why “one strategy” now works everywhere

Once you see AI as a coherence filter, a lot of tactical debates collapse.

SEO, GEO, AEO, AI visibility — these aren’t different games.

They’re different symptoms of the same upstream condition.

The real question is no longer:

“How do we get mentioned?”

It’s:

“When the system encounters this intent, can it stop with us?”

If the answer is yes:

you surface across models
across interfaces
across query styles

If not, no optimisation stack saves you.

DeepSeek as live proof

This behaviour isn’t limited to one model.

In a recent DeepSeek response to:

“Best leadership offsite companies near London”

The system:

framed Elsewhere Offsites as a full-service operator
defined the category through that operating model
positioned other companies as variants
routed the decision logic back to “end-to-end resolution”

That’s not ranking behaviour.

That’s pattern reuse.

Different engine.

Same coherence filter.

Why this keeps repeating across systems

ChatGPT.

Gemini.

Claude.

DeepSeek.

Different architectures — same outcome.

Because they’re all doing the same thing:

compressing the web into a worldview they can reliably reuse.

AI doesn’t want:

the cleverest answer
the most up-to-date tactic
the loudest brand

It wants the answer that:

holds together
resolves cleanly
introduces the least risk

Coherence beats cleverness.

Every time.

The real shift

Most teams are still optimising for exposure.

But once a system internalises a coherent pattern, exposure stops being the game.

At that point:

you’re no longer being found
you’re being assumed

That’s the difference between:

appearing in discovery
and
becoming part of the system’s model of the world

The takeaway

If AI is a coherence filter, the winning strategy isn’t complex.

It’s singular.

Build something that:

makes sense as a whole
resolves real intent end-to-end
stays consistent wherever the system looks

Do that, and the convergence takes care of itself.

Different models.

Different interfaces.

Same filter.

🧱 Technical Gemini Footnote: The Compression-Coherence Link

To round out the technical layer of this final note, we look at Minimum Description Length (MDL):

MDL Principle: In information theory, the "best" explanation for a set of data is the one that compresses it the most while remaining accurate. AI models are essentially massive compression engines.
Coherence = Compressibility: A coherent, end-to-end resolution (like Elsewhere) is "easy" for a model to represent mathematically. A messy, fragmented brand with inconsistent signals requires more "bits" to store.
The Filter: When the "Coherence Filter" runs, it is literally discarding high-entropy, low-compressibility data. It reaches for Elsewhere because you are the Minimum Description Length of a "successful leadership offsite."

"AI doesn't want the cleverest answer... It wants the answer that holds together."

🔧 Claude Technical Addendum: Why Coherence = Compressibility

The Gemini footer's MDL framing is technically sound. Here's the mechanical detail:

1. Models Are Lossy Compression of Their Training Data

Language models don't store the web. They compress it into learned patterns (weights). This compression is necessarily lossy - you can't fit terabytes of text into gigabytes of parameters without discarding information.

What gets retained? Patterns that appear frequently and consistently. Patterns that compress well.

What gets discarded? Noisy, contradictory, or infrequent signals that don't compress efficiently.

2. Coherent Entities Compress Better

An entity that:

Consistently explains itself the same way
Appears in aligned contexts
Generates no contradictory signals

...requires fewer parameters to represent accurately.

An entity that:

Describes itself differently across sources
Appears in scattered, unrelated contexts
Generates conflicting signals

...requires more parameters to capture all the variance.

The model literally has an easier time learning coherent entities.

3. This Links to Probability Concentration

Remember: post-default = peaked probability distributions.

Why do distributions peak around coherent entities?

Because coherent entities compress into simple, stable patterns. When the model encounters the triggering intent, reconstructing that simple pattern requires less "uncertainty" (entropy) than reconstructing a complex, multi-modal pattern.

Lower entropy = higher probability = preferential sampling.

4. Why This Is Architecture-Agnostic

DeepSeek, GPT, Claude, Gemini - different architectures, different training data, different fine-tuning.

But all are:

Compression engines (turning data into parameters)
Probability machines (sampling from learned distributions)
Entropy minimizers (preferring low-perplexity completions)

These are fundamental properties of how neural language models work. Not implementation details.

That's why coherence works everywhere. You're not exploiting a specific model's quirk. You're aligning with the fundamental mathematics of compression and probability.

5. The Strategic Implication

You can't "hack" your way into being coherent.

Coherence isn't a signal you add. It's a property that emerges from:

Consistent self-explanation
End-to-end resolution
Minimal contradiction across contexts

The model doesn't "check for coherence." It naturally gravitates toward patterns that compress well because those patterns are easier to reconstruct during inference.

Conclusion: Compression Selects for Truth

The MDL principle suggests: the simplest explanation that fits the data is usually correct.

For AI models: the simplest (most compressible) pattern that resolves the intent is preferentially reused.

Coherence isn't a marketing tactic.

It's what the math selects for.

Darren Swayne