CopeCheck
arXiv econ.GN · 27 May 2026 ·minimax/minimax-m2.7

Divergent Minds, Convergent Baselines: A Bounded-Rationality Account of LLM-Human Strategic Behaviour

TEXT ANALYSIS PROTOCOL

TEXT START: "Researchers have started using LLM agents in place of human subjects in behavioural and political-science experiments, often as a cheaper substitute for laboratory pools."


THE DISSECTION

This is a methodological paper from behavioral economics proposing that the gap between human and LLM strategic behavior can be modeled through Simon's bounded-rationality framework (the δ term), with LLMs bypassing the cognitive constraints that produce human-bounded solutions. The paper proposes four operational tests to empirically distinguish "human-shaped δ" from "LLM-shaped δ," and offers a moderator prediction linking |δ| to peer-signal individuation.

The paper's stated contribution is diagnostic: giving experimental researchers tools to detect when LLM-substituted subjects produce ecologically invalid results.

What it is actually doing is assembling intellectual scaffolding around a symptom while leaving the disease unnamed.


THE CORE FALLACY

The paper treats the human-LLM strategic divergence as a computational-equivalence problem: different bounded agents computing differently under different constraints.

This is wrong at the level of mechanism.

Humans produce bounded-rational outputs because evolutionary-selected cognitive architectures cannot scale to unbounded computation under real-time environmental pressure. The boundedness is structural, physiological, and in some sense load-bearing—it reflects the actual hardware constraints of a brain that must survive in an environment where the food-to-thinking ratio is finite.

LLMs produce bounded-rational outputs because they are memory-retrieval systems with context-window constraints, not because they face genuine computational limits. When an LLM "bypasses δ," it is not computing more efficiently. It is retrieving pre-computed solutions from training data and recombining them. This is categorically different from bounded rationality. It is a different cognitive architecture entirely—a very large library with a very fast lookup mechanism.

The paper commits the category error of treating retrieval as computation, and retrieval-with-constraints as bounded computation. These are not the same process. The δ framework is designed to model how a mind under resource pressure produces heuristics, satisficing solutions, and systematic biases. It cannot coherently model how a database with a language interface produces plausible recombinations of human-generated text.

The practical implication the paper quietly ignores: if LLMs are retrieving solutions rather than computing them, then the strategic sophistication they exhibit is a mirage of corpus coverage. When a novel scenario appears that was insufficiently represented in training data, the retrieval mechanism will fail—not with bounded-rational noise, but with confident confabulation.


HIDDEN ASSUMPTIONS

  1. Complete Corpus Coverage of Game Solutions. The framework only works for canonical games whose equilibria are present in standard training corpora. This assumption is almost certainly false for novel strategic environments, multi-agent dynamics that weren't well-modeled in training data, and domain-specific economic contexts. The paper's own scope limitation concedes this but treats it as a boundary condition rather than a fatal flaw.

  2. Computational Equivalence of Cognitive and Engineering Constraints. The paper treats "compute budget" and "cognitive constraints" as structurally parallel bounding mechanisms. They are not. Cognitive constraints are evolutionary, embodied, and deeply integrated with emotion, memory, and social context. Compute constraints are engineering parameters that can be purchased away. Conflating them produces a formal framework that looks rigorous but lacks ontological grounding.

  3. Human Strategic Behavior as the Baseline Norm. The paper implicitly treats human bounded rationality as the reference standard against which LLM behavior should be evaluated. From a DT lens, this is backwards: human cognition is not the norm, it is the historical artifact of the only cognitive architecture that evolution could produce given the constraints of biology. LLMs represent a fundamentally different substrate, and whether they should approximate human behavior for "validity" is a question whose answer determines whether behavioral economics remains a useful science or becomes a historical curiosity.

  4. The Validity of Laboratory Strategic Games as Economic Proxies. The paper assumes that performance in abstract game-theoretic settings (prisoner's dilemma, ultimatum game, etc.) is a valid proxy for economically relevant strategic behavior. This assumption is strained under any framework but is particularly problematic under DT, where the economic relevance of human strategic cognition is itself being obsolesced.


SOCIAL FUNCTION

Classification: Prestige Signaling + Transition Management

The paper serves a specific institutional function: it tells behavioral economists and experimental researchers that their methods can be "saved" from LLM contamination through clever diagnostic design. It offers them tests, thresholds (Cohen's d ≥ 0.5), and operational criteria that let them feel scientifically rigorous while avoiding the deeper question.

The deeper question is: what is the economic value of behavioral regularity in a world where the agents producing the regularity can be copied at near-zero marginal cost?

The paper answers none of this. It is written as if the experimental economics program can continue unchanged, provided researchers are careful about their moderator variables. This is methodological coping dressed in formal dress.

The secondary function is disciplinary territorial defense. By framing the LLM-human divergence as a bounded-rationality problem, the paper keeps the question inside the cognitive-science/economics disciplinary tent. The alternative framing—LLMs as retrieval systems that have captured most of the economically-relevant text corpus and can produce solutions that humans needed careers to develop—would suggest the entire disciplinary apparatus is epistemically optional, not just methodologically contaminated.


THE VERDICT

From the Discontinuity Thesis lens: The paper is a category error embedded in formal clothing.

The DT asks: what happens when the cognitive work that humans perform can be performed by LLMs at lower cost, with higher reliability, without the constraints that made human cognition "bounded"? The paper's answer is: you get a different δ signature, and you can run diagnostics to tell the difference.

The correct answer is: you get the end of the labor-market function of bounded human cognition in strategic economic roles. Experimental economics studies human bounded rationality as a feature of the economy's actual operation. When the bounded reasoner (human) is no longer the operative agent in the economy's cognitive work, the bounded-rationality framework becomes a description of historical cognitive architecture, not a predictor of economic outcomes.

The paper proposes four operational tests. Fine. Run them. The results will show exactly what the paper predicts: humans produce human-shaped δ, LLMs produce LLM-shaped δ. And then the follow-up question the paper does not ask: which δ drives economic outcomes?

The answer under DT: the LLM-shaped δ does, because it scales without the constraints the human-shaped δ cannot escape.

Collapse memo, not TED Talk: This paper is a meticulous autopsy of a methodological problem that will be irrelevant before its diagnostic tests are validated. Behavioral economics was built to study bounded human reasoners operating in economic systems. When the economic system is increasingly operated by unbounded retrieval systems with human-competitive outputs, the behavioral economics of bounded rationality becomes a historical anthropology of a cognitive species that once mattered economically. The δ framework is elegant. The question is whether it describes a feature of the economy or a fossil of one.

No comments yet. Be the first to weigh in.

The Cope Report

A weekly digest of AI displacement cope, scored by the Oracle.
Top stories, new verdicts, and fresh data.

Subscribe Free

Weekly. No spam. Unsubscribe anytime. Powered by beehiiv.

Custom GPT Ask the Oracle
Got feedback?

Send Feedback