A substrate-level operational falsification of strong semantic manifold assumptions, now sharpened by an obstruction-theoretic view of transformer inference.
This document targets a strong operational form of the Manifold Hypothesis as invoked in semantic space, concept vectors, smooth steering, semantic distance, latent traversal, and global continuity claims about transformer representations.
Local linear operability is not denied. Narrow operational regimes may admit approximate linearity, temporary feature extraction, useful probes, and short-horizon steering.
What is denied is the existence of a globally coherent manifold supporting stable coordinates, smooth transition maps, predictable transport, and semantic continuity across the reachable state space of real transformer inference.
Disconnected linear islands do not constitute a manifold. Useful fragments do not rescue a false global picture.
This work is operational and negative. It does not require a replacement semantic ontology in order to reject a false one. Its task is to show that widely invoked semantic-geometric assumptions are structurally incompatible with transformer computation as actually performed.
Once falsified, explanatory burden shifts. Proponents of manifold-based semantics must either weaken their claims until manifold structure is no longer required, or demonstrate validity under explicit discontinuity, aliasing, non-invertibility, finite precision, and path-dependent state collapse.
This work does not claim that transformers are uninterpretable, that probes never work, that useful structure does not exist, or that internal mechanisms cannot be studied.
It claims only that such successes do not license global semantic-geometric interpretation.
| Object | Description |
|---|---|
| X | Raw text prompts |
| T | Token sequences τ(X) |
| Hℓ ⊂ ℝᵈ | Reachable hidden states at layer ℓ |
| K,V | Cached attention state derived from prior sequence history |
| Y | Output token distribution after logit projection |
| Name | Claim |
|---|---|
| MH-A | Hℓ approximates a smooth low-dimensional manifold |
| MH-B | Stable coordinates correspond to semantic variation |
| MH-C | Small perturbations induce small, predictable changes |
| MH-D | Local validity can be coherently glued into global structure |
A fracture is a structural or operational mechanism that violates MH-A, MH-B, MH-C, or MH-D during real transformer inference.
An obstruction is not merely a local defect. It is a reason that local validity cannot be extended, glued, or globally completed. In this document the twelve fractures are treated as surface manifestations of deeper obstruction classes. The list shows where manifold assumptions fail. The obstruction view explains why those failures are not accidental.
The new addition is simple but powerful: the fractures can be regrouped as obstruction classes. This sharpens the falsification from a list of breaks into a theory of nonexistence.
H1 obstruction concerns local failure. A neighborhood cannot be made stably smooth, linearly transportable, or predictably controllable. H2 obstruction concerns gluing failure. Even if small regions appear workable, the overlaps between them do not compose coherently. H3 obstruction concerns higher-order global failure. Even after attempted repairs, there is no consistent global section, no stable atlas, and no valid manifold picture left.
In short: the twelve fractures show repeated breakage; H1, H2, and H3 show why the breakage is principled.
| Obstruction | Operational Meaning | What Fails | Representative Fractures |
|---|---|---|---|
| H1 | Local differential or neighborhood failure | Local smoothness, local predictability, local transport | 4, 7, 8, 12 |
| H2 | Transition and gluing failure across regions or histories | Patch consistency, path composition, overlap agreement | 5, 6, 9 |
| H3 | Higher-order global incompatibility | Existence of a coherent global atlas or section | 1, 2, 3, 10, 11 |
This does not mean every fracture is only one thing. Some fractures participate in more than one obstruction class. But the grouping is still useful because it separates three kinds of failure: local break, glue break, and global nonclosure.
| Idx | Fracture | Mechanism | Break Type | Manifold Property Violated | Obstruction | Status | Notes |
|---|---|---|---|---|---|---|---|
| 1 | Tokenization Quotient Break | Many-to-one non-invertible mapping | Topological | Global topology | H3 | Structural | Quotient singularities preclude stable manifold structure |
| 2 | Embedding Table Folding | Intersecting embeddings under training pressure | Geometric | Local injectivity | H3 | Structural | Self-intersections destroy coordinate uniqueness |
| 3 | Positional Phase Wrap | Periodic or rotary coordinate identification | Topological | Global charts | H3 | Structural | Phase seams enforce coordinate singularities |
| 4 | Attention Softmax Saturation | Exponentiation and normalization cliffs | Differential | Smooth transport | H1 | Structural | Degenerate response regimes fracture local continuity |
| 5 | Residual Dominance Shift | Abrupt pathway switching | Differential | Tangent stability | H2 | Structural | Nearby states can follow different effective compute paths |
| 6 | KV-Cache Aliasing | Distinct histories collapse to identical states | Topological | Trajectory injectivity | H2 | Structural | History cannot embed as a single faithful path |
| 7 | MLP Activation Saturation | Flat or clipped nonlinear regions | Differential | Local diffeomorphism | H1 | Structural | Neighborhood collapse blocks stable local coordinates |
| 8 | Finite Precision Quantization | Floating-point discretization | Numeric | Continuity | H1 | Structural | Lattice effects replace continuous geometry |
| 9 | Normalization Geometry Rewriting | LayerNorm or RMSNorm erase scale and rewrite relations | Numeric/Geometric | Metric persistence | H2 | Structural | Distances are recomputed rather than preserved across transport |
| 10 | Undefined Numeric States | NaN or Inf from overflow or instability | Topological | Totality | H3 | Operational | Hard representational holes break total state coverage |
| 11 | Logit Rank Collapse | Anisotropic vocabulary projection | Geometric | Dimensional regularity | H3 | Structural | Effective output dimension varies by regime |
| 12 | Stress-Prompt Discontinuities | Tiny prompt changes trigger large jumps | Empirical | Predictable response | H1 | Operational | Ordinary prompt variation can cross hidden fracture boundaries |
The original twelve fractures already constituted a strong structural falsification. The obstruction view does not replace them. It compresses them into three deeper forms of failure.
The list falsifies by accumulation: too many incompatible breaks must be ignored in order to preserve the manifold story. The obstruction view falsifies by necessity: once local, gluing, and global obstructions are present, manifold structure is not merely damaged. It is unavailable.
| Obstruction | Core Question | If the Answer is No | Result |
|---|---|---|---|
| H1 | Can a neighborhood be treated as stably smooth and locally predictive? | Local linearity is regime-bound and brittle | No reliable local manifold patch |
| H2 | Can workable local patches be glued across histories, overlaps, or transitions? | Transport fails across boundaries or alternative paths | No coherent transition structure |
| H3 | Can all local and overlap information be completed into a single global object? | Global closure fails even after attempted repair | No valid manifold exists |
There is a deep and broad dogma of semantics. It cascades from language into methods, metrics, steering claims, alignment narratives, and safety rhetoric. The danger is not merely philosophical. It is operational. A false geometric picture encourages false confidence about control.
| Method | Domain | Assumes MH | Fracture Index | Obstruction Exposure | Conflict | Risk if MH False | Notes |
|---|---|---|---|---|---|---|---|
| Sparse Autoencoders | McInt | Yes | 2,7,8,9,11 | H1,H2,H3 | Assumes smooth separable feature space | Feature drift and false atomization | Locally useful only |
| Steering Vectors | McInt | Yes | 4,5,7,12 | H1,H2 | Assumes linear semantic control | Brittle and regime dependent behavior | Context sensitive |
| Representation Similarity | McInt | Yes | 2,8,11 | H1,H3 | Metric continuity assumed | False similarity and false persistence | Correlational only |
| Belief Probes | Align | Yes | 4,5,6,7 | H1,H2 | Stable semantic coordinates assumed | False confidence in hidden state attribution | Axes are non-persistent |
| RLHF | Align | Implicit | 4,5,9,12 | H1,H2 | Assumes smooth reward landscape | Reward hacking and brittle control | Surface shaping only |
| Constitutional AI | Safety | Implicit | 4,5,7,12 | H1,H2 | Assumes continuous steerability | Sudden failure at fracture boundaries | Governance veneer |
| Logit Lens | McInt | No | – | – | Syntactic readout | Low manifold dependence | Pre-semantic and operational |
| Causal Tracing | McInt | No | – | – | Perturbational testing | Low manifold dependence | Model-agnostic |
| Red Teaming | Safety | No | 12 | H1 | Direct fracture probing | Ground truth over theory | Empirical check |
The following table evaluates commonly used transformer and interpretability terms by their scope of validity under the twelve structural falsifications and their obstruction collapse.
Classifications are operational, not ontological. They describe what a term can safely be used to claim, and where it silently overclaims.
| Term | Classification | Valid Use | Overclaim Risk | Obstruction Pressure | Notes |
|---|---|---|---|---|---|
| Token | Structural | Discrete algebraic primitive | None | Low | Foundation of computation; non-semantic by construction |
| Attention | Structural | Routing and weighting mechanism | Semantic attribution | H1 | Operationally precise; semantics often projected post hoc |
| Residual Stream | Structural | Additive state composition | Continuous trajectory claim | H2 | Additivity does not imply geometric smoothness |
| Embedding | Operational | Lookup-based representational handle | Semantic distance and neighborhood meaning | H3 | Folding and normalization undermine global geometry |
| Feature | Context-Bound | Repeatable activation motif in restricted regimes | Global semantic primitive | H1,H2 | Feature identity drifts across context and scale |
| Sparse Feature | Context-Bound | Local basis element under fixed conditions | Monosemantic interpretation | H1,H2 | Useful diagnostically; unstable under perturbation |
| Latent Space | Context-Bound | Visualization and local linear analysis | Global geometry and smooth traversal | H1,H2,H3 | Fails under normalization, aliasing, and rank collapse |
| Representation | Operational | Intermediate computational state | Semantic encoding claim | H1,H2,H3 | Representation does not equal meaning storage |
| Semantic Space | Category Error | None as internal substrate | Meaning-as-geometry projection | Total | Observer ontology, not model structure |
| Concept Vector | Category Error | None beyond heuristic steering | Stable semantic axis assumption | H1,H2,H3 | Violates coordinate persistence |
| Concept Neuron | Narrative | Pedagogical shorthand | Unit-level semantic attribution | H1,H2 | Fails under distribution shift |
| Belief | Narrative | External behavioral description | Internal state attribution | H1,H2 | Useful for UX, weak for mechanics |
| Knowledge Storage | Narrative | Informal behavioral description | Memory localization claims | H2,H3 | Computation is reconstructive, not archival |
| Understanding | Narrative | Human-facing evaluation | Internal competence inference | Total | Non-operational internally |
| Steering | Context-Bound | Short-horizon bias injection | Global control guarantee | H1,H2 | Sharp regime edges persist |
| Linear Probe | Operational | Telemetry and correlation detection | Causal or semantic inference | Low | Can work without global manifold commitments |
| SAE Feature | Context-Bound | Local coordinate extraction | Semantic atom claim | H1,H2,H3 | Feature identity is not invariant |
| Mechanistic Circuit | Context-Bound | Reusable execution fragment | Global module interpretation | H2 | Regime dependent |
| World Model | Narrative | Behavioral abstraction | Internal simulation claim | Total | Observer convenience term |
| Alignment | Operational | Behavioral constraint satisfaction | Internal value shaping | H1,H2 | Surface-level property |
| Safety | Operational | Failure avoidance and monitoring | Semantic guarantee inference | H1,H2 | Engineering discipline, not ontology |
| Context | Structural | Total boundary condition of computation | Verb-like usage | Low | Substrate, not operation |
| Context Window | Structural | Finite dependency horizon | Memory equivalence claim | H2 | Length does not imply persistence or faithful recall |
| Generalization | Operational | Performance outside training samples | Semantic abstraction inference | H1,H2,H3 | Often regime-specific |
The twelve fractures already defeat the strong semantic manifold hypothesis as an operational account of transformer inference. The obstruction view strengthens the result.
H1 says the local patch fails. H2 says the patches do not glue. H3 says no global completion exists.
The manifold story is therefore not merely approximate. In its strong semantic form, it is structurally unavailable.