arXiv:wander.2605.0004 [cs.CL · cs.LG]
Wander Around: A Walkable Cartography of the Long-Tail Web
Cartography platform for ~7M long-tail websites with a substrate-augmented spatial AI.
Open preprints from the Prometheus7 Institute on the substrate-augmented language-modeling research programme. Updated 2026-05-06.
Cartography platform for ~7M long-tail websites with a substrate-augmented spatial AI.
13× to 18× perplexity reduction on cross-corpus evaluation; HRR-initialized 27M-param model matches behavior expected of substantially larger randomly-initialized baselines.
Algebraic binding via Plate-style HRR replaces gradient-discovered compositional structure, yielding 3–5 orders of magnitude data efficiency.
"Parameter count" is a frame-dependent quantity. Three accounting frames (gradient, substrate, composition) yield three different counts, all valid; reporting only one obscures the others.