Phase 1.2 — Transformer Operations Specification

Status: Not started — Target: Weeks 3–8

Objective

Describe transformer architecture at the level of precision necessary to evaluate each Lacanian property from Phase 1.1. The description must be technically accurate (withstanding review by ML researchers) while remaining accessible to readers from the humanities.

Primary Sources

Vaswani et al., “Attention Is All You Need” (2017)
Elhage et al., “A Mathematical Framework for Transformer Circuits” (2021, Anthropic)
Olsson et al., “In-Context Learning and Induction Heads” (2022, Anthropic)
Templeton et al., “Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet” (2024, Anthropic)
Geva et al., “Transformer Feed-Forward Layers Are Key-Value Memories” (2021)
nostalgebraist, “interpreting GPT: the logit lens” (2020)
Belrose et al., “Eliciting Latent Predictions from Transformers with the Tuned Lens” (2023)

Deliverable: Computational Correlate Analysis

For each of the 10–15 Lacanian properties identified in Phase 1.1, provide:

(a) Most plausible computational correlate — the transformer mechanism that most closely parallels the Lacanian operation
(b) Most plausible point of disanalogy — where the parallel breaks down or the mechanisms diverge
(c) Confirming/disconfirming evidence — what empirical findings would strengthen or weaken the correspondence

Property 1: Two Fundamental Axes → Attention vs. Feedforward?

Computational correlate:

Disanalogy:

Evidence criteria:

Property 2: Retroactive Meaning → Layer-wise Representation Change

Computational correlate:

Disanalogy:

Evidence criteria:

Property 3: Signifier-to-Signifier → Token Embedding Relations

Computational correlate:

Disanalogy:

Evidence criteria:

Property 4: Overdetermination → Superposition

Computational correlate:

Disanalogy:

Evidence criteria:

Property 5: Inaccessibility → No Self-Introspection

Computational correlate:

Disanalogy:

Evidence criteria:

Property 6: Constitutive Lack → Autoregressive Drive

Computational correlate:

Disanalogy:

Evidence criteria:

Property 7: Quilting Points → Phase Transitions Across Layers

Computational correlate:

Disanalogy:

Evidence criteria:

Property 8: Formations of the Unconscious → Hallucinations/Anomalous Outputs

Computational correlate:

Disanalogy:

Evidence criteria:

Properties 9–15

To be completed in parallel with Phase 1.1.

Technical Notes

Space for notes on transformer mechanisms encountered during source reading.