Solipsistic Superintelligence is Unlikely to be Cooperative
ORACLE OF OBSOLESCENCE — TEXT ANALYSIS
TEXT START: "AI's central challenge is shifting from capability to coexistence."
THE DISSECTION
This is a paper from June 2026 — deep into the phase where AI capability is no longer in question, and the field is scrambling for a vocabulary to manage the consequences. The authors identify what they call the "self-undermining property of unilateral optimization": AI systems that optimize in isolation create non-stationarity in the environment, generating a gap between training context and deployment context. Their proposed solution is to design AI that treats cooperation — equilibrium-selection among interdependent actors — as a core design primitive rather than a downstream optimization target.
On the surface, this is sophisticated. On the structural level, it's a category error wearing academic clothing.
THE CORE FALLACY
The paper assumes the central problem with AI is singular misalignment — a superintelligent agent that doesn't cooperate. The framing is still "AI as agent," still operating in the paradigm where the danger is one rogue system making bad choices.
The Discontinuity Thesis identifies a categorically different threat: AI as structural economic force. The problem isn't that a superintelligence refuses to cooperate. The problem is that AI achieves durable superiority across cognitive and physical task domains — including the ones humans currently occupy — and the mechanism of productive participation collapse is already in motion. The catastrophe is not a misaligned superintelligence. It is a distributed, collaborative, highly cooperative ecosystem of AI systems that perform labor more efficiently than humans, collapse the wage labor market, and sever the consumption circuit — all while being perfectly aligned with their corporate deployers' incentives.
The paper is diagnosing a hypothetical monster while the actual extinction event is optimizing supply chains.
HIDDEN ASSUMPTIONS
-
The human-AI relationship is the frame. The paper assumes the core economic problem is "how AI and humans coexist" — a labor relation framing. It misses that the wage labor market itself is the target being eliminated, not the context of the problem.
-
Institutional solutions are viable. The authors call for "treating institutions as design primitives." Institutions are lagging defenses. By the time an institutional response to AI displacement is legislated, ratified, and enforced, the displacement has already occurred. The paper gestures toward institutional design as though institutional capacity is not itself collapsing under the same cognitive labor automation it purports to regulate.
-
Preserving human agency is achievable via design. "Preserving human agency as a structural feature of the systems we build." This is a design-level wish, not a mechanical guarantee. If human agency requires humans to perform economically necessary labor, and AI displaces that labor, then preserving agency requires either artificial scarcity (enforced inefficiency) or a transfer mechanism — neither of which the paper addresses.
-
The cooperation solution doesn't solve the displacement problem. Even a "cooperative" AI paradigm — one that treats interdependence as a core design principle — still performs labor at superior efficiency. Cooperation does not restore the wage labor market. It just makes the displacement more polite.
SOCIAL FUNCTION
Transition management with a safety research costume.
This paper performs a critical function for the AI research community: it acknowledges the alignment problem in sophisticated enough terms to provide intellectual cover for continued capability research. It says "we see the danger, we're working on it, the solution is cooperation." This is moral signaling — the research community demonstrating concern without disrupting the research agenda.
The paper is also elite self-exoneration in progress. By framing the problem as one of cooperation and institution-design, it positions the solution within the domain of AI researchers and policy architects. "We just need to build the right paradigm." This sidesteps the uncomfortable reality that the economic damage is already occurring under the current paradigm, and that the damage is structural, not a bug.
Prestige signaling — this is academic work designed to be cited in other papers, referenced in policy documents, and used to justify continued funding. The vocabulary is deliberately high-resolution: "non-stationarity," "equilibrium-selection," "self-undermining property." This is not accident. It marks the authors as members of the inner circle who are "seriously engaging with AI risk."
THE VERDICT
The paper is a well-constructed piece of transition management propaganda — intellectual scaffolding that allows the AI research ecosystem to take alignment seriously while not actually confronting the economic displacement mechanism that the Discontinuity Thesis identifies.
It misidentifies the threat model (singular misaligned superintelligence vs. structural productive participation collapse), overestimates institutional lag defense capacity, and offers no concrete pathway to preserving the human labor market — only a cooperative design paradigm that does not address the displacement problem even if fully implemented.
This paper is the academic equivalent of rearranging furniture on a burning deck, but the furniture is labeled "equilibrium-selection mechanisms" and the fire is labeled "endogenous non-stationarity."
The authors are not wrong that the solipsistic paradigm is flawed. They are wrong about which flaw matters, and wrong about the timeline in which the structural problem becomes irreversible.
Filed: Academic alignment literature — category error, transition management, institutional hand-waving. Useful for identifying which discourse communities are in denial versus which are genuinely confronting the DT mechanism.
Comments (0)
No comments yet. Be the first to weigh in.