Engineered Thinking

We humans never formalized most of our thinking. It was implicit, inferred, and left out of the text because we assumed someone with a brain would be reading. But for AI, the text is the brain. The gaps we left for each other are gaps in the model. Six open research systems for developing the parts of cognition that never got written down, and the parts we never had to formalize because we are not machines.

Sema

When the Hash Is the Word·April 2026 · updated May 2026

Autonomous agents need shared, verifiable vocabulary: labels that compress coordination without hiding semantic drift. Sema turns content-addressed behavioral contracts into words in natural language — each simultaneously a word and a cryptographic proof. A Pattern Card canonicalizes a behavior’s invariants, preconditions, and failure modes into a hash-backed identifier that ordinary prose can carry. A bootstrap vocabulary of 452 patterns shows 22.6× mean token compression across the library.

PDFGitHubWebSemanticsCoordinationOntology

Understanding Graph

Persisting the Invisible Thinking·April 2026 · updated May 2026

Understanding — the movement from confusion to clarity — was always ephemeral. When AI reasons in tokens, it becomes directly storable in the medium where it occurs. Rather than indexing documents or extracting user facts — the two dominant paradigms in AI memory — the Understanding Graph captures the cognitive process itself: tensions, hypotheses, belief revisions, dead ends. Not what the AI concluded, but how it understood.

PDFGitHubKnowledge GraphsMemoryMCP

Entangled Alignment

When Safety Is the Substrate·March 2026 · updated May 2026

Post-training alignment has produced substantial behavioral improvements, but its timing may limit its depth: it arrives after pretraining has formed much of a model’s interpretive substrate. Entangled Alignment proposes making safety part of that substrate instead. Chronological Metacognitive Pretraining annotates the corpus with the invisible thinking that accompanies human comprehension but is absent from polished text — generated by a multi-agent Teacher reading chronologically as one stable first-person identity, the Reader Core, through a shared Understanding Graph. The trace-generation pipeline is validated across literary and technical case studies; substrate-level alignment itself awaits student-model tests. The aim is to move the model from becoming the text to becoming the reader.

PDFGitHubGraphAI SafetyAlignmentPretraining

The Ontology of the Alien

Escaping the Median Trap·March 2026

Ask an LLM to “be creative” and it converges on the same archetypes — the Median Trap. This paper compares eight methods for inducing structural diversity in a controlled experiment (N=196): Semantic Tabu accumulates constraints, the Studio Model pairs a generator with a strict taxonomist, and the Orthogonal Insight Protocol derives mechanisms from alternative physics. Under constraint pressure the system showed emergent metacognition — repairing its own taxonomy and commissioning research into regions it hadn’t explored — and produced structural novelty that no monolithic model reaches. The boundary does the creative work.

PDFGitHubCreativityLLMEvaluation

Fractal Intelligence

Conceptual Decomposition as Problem-Solving Infrastructure·April 2026 · updated May 2026

Existing frameworks decompose tasks. This paper decomposes concepts — the persistent structure of what a domain is made of. Each concept becomes a solver node behind one five-surface contract, so a leaf and a thousand-node subtree are indistinguishable to their caller: specialists all the way down. In a prototype of 100 problems across 20 domains, concept-based routing produced a shared graph of 456 nodes with 64% reuse — the structural precondition, not yet the improvement it predicts. If decomposed solving beats the conventional approach at matched compute, and independent attempts converge on the same concepts, the result is an internet of reasoning: a shared substrate where you post a problem rather than fetch a page.

PDFGitHubGraphMulti-AgentArchitectureCognitive Science

Temporal Hindsight Learning

Blindness as Teacher, Hindsight as Curriculum·March 2026

Language models are lazy optimizers: if a shortcut to the answer exists — retrieval, memorization — the gradient will reinforce it over reasoning. Temporal Hindsight Learning treats the knowledge cutoff as a curriculum tool rather than a defect: a Teacher with hindsight works backward to the causal signals that were available before the outcome; the Student, frozen in the past, must derive them rather than retrieve them. In a pilot on unseen 2025 events, a 70B Student improved reasoning quality by 20% over its base model and approached its frontier Teacher in prediction accuracy. Blindness is the teacher.

PDFGitHubModelFine-tuningReasoningForecasting