A study posted to arXiv argues that humans do not rely on a single strategy for physical reasoning — instead, they dynamically shift both how they predict object behaviour and how far ahead they plan, depending on available cognitive resources.
The research, led by scientists studying the intersection of intuitive physics and decision-making, addresses two debates that have run in parallel for years without resolution. The first concerns whether humans predict physical outcomes using a mental simulator — called the Intuitive Physics Engine (IPE) — or faster, pattern-based visual shortcuts. The second asks whether humans plan deliberatively by thinking multiple steps ahead, or make shallow, near-sighted choices. Until now, these questions had been studied separately.
The Overhang Tower Task
To test both questions at once, the researchers designed a task called Overhang Tower, in which participants stack blocks to create the greatest possible horizontal overhang from a base while keeping the structure stable. The task is elegant in its design: it requires both physical prediction (will this block fall?) and sequential planning (what is the best sequence of moves?). By varying time pressure and structural complexity, the team could observe how participants' strategies shifted under different cognitive loads.
These findings reveal a hierarchical, resource-rational architecture that flexibly trades computational cost against predictive fidelity.
The results showed a clear dual transition. In early, simpler stages of the task, participants' choices were best explained by IPE-based simulation — they appeared to mentally model physics. As complexity grew, behaviour shifted toward patterns consistent with CNN-based visual heuristics, meaning fast, appearance-driven shortcuts rather than explicit simulation. Simultaneously, time pressure caused participants to plan with shallower horizons, reducing deliberative lookahead in favour of more immediate, myopic decisions.
Why a Dual Shift Matters
Prior theoretical accounts predicted that people might shift either their prediction mechanism or their planning depth under pressure — but not both at the same time. The paper argues that observing both transitions simultaneously is precisely what prior single-mechanism models failed to anticipate. This dual shift, according to the authors, points to a hierarchical, resource-rational cognitive architecture: a system that flexibly reconfigures its own computational strategy based on available budget, rather than committing to one fixed approach.
The concept of resource-rationality — the idea that minds optimise behaviour given their own processing limitations — has gained traction in cognitive science over the past decade. This study's contribution is to show that resource-rationality extends not just to individual processes, but to how multiple distinct cognitive mechanisms coordinate with one another under pressure.
Unifying Two Longstanding Debates
The paper frames its contribution as a unification of what it calls two long-standing debates: simulation versus heuristics, and myopic versus deliberative planning. Rather than declaring one side of either debate correct, the researchers recast both as endpoints on a continuum — positions that humans occupy depending on context and cognitive constraints.
This framing has practical implications for how researchers build computational models of human physical reasoning. Models that assume a fixed prediction mechanism, or a fixed planning depth, will systematically mispredict human behaviour in conditions where resource pressure varies. The Overhang Tower task offers a controlled benchmark for testing such models, though it is worth noting that all findings reported are based on the authors' own experimental data, and the work has not yet undergone formal peer review as an arXiv preprint.
The study also raises questions about artificial intelligence systems designed to reason about the physical world. Many robotic planning systems and physics-based AI models rely on fixed simulation pipelines — they do not adapt their prediction strategy based on computational cost in the way this research suggests humans do. If human physical reasoning is genuinely resource-rational and dual-mechanism, designing AI systems with similar flexibility could be a meaningful research direction.
Implications for AI and Cognitive Modelling
For cognitive scientists, the study offers a new experimental paradigm. The Overhang Tower task is well-suited to generating the kind of behavioural data needed to distinguish between competing models of physical reasoning, and the dual-transition finding provides a specific, falsifiable prediction that future work can test. Replication across different populations and task variations will be an important next step.
For AI researchers, the paper's architecture of a system that flexibly reconfigures its own mechanisms — rather than committing to a single strategy — is a design principle worth examining. Current large-scale models for robotics and embodied AI typically select a method once and apply it uniformly. A system that dynamically balances simulation fidelity against computational cost, as this study suggests humans do, could perform more robustly across varying task demands.
What This Means
Human physical reasoning is more adaptive than existing cognitive models assumed — and building AI systems that match this flexibility may require rethinking how simulation and heuristic strategies are combined under computational constraints.