Hierarchical Reasoning Model

Source

Raw Markdown: paper_hierarchical-reasoning-model-2025.md
PDF: paper_hierarchical-reasoning-model-2025.pdf
Preprint: arXiv 2506.21734
Official code: sapientinc/HRM

Core Claim

HRM uses interdependent high-level and low-level recurrent modules operating at different timescales to solve reasoning tasks in a single forward pass without explicit chain-of-thought supervision.

Relevance To This Wiki

HRM is a landmark recursive-reasoning branch adjacent to UT and looped Transformers: it shows that tiny recurrent systems can solve hard puzzle tasks without large language-model pretraining.

Limitations

The impressive evidence is concentrated in puzzle and ARC-style domains. Transfer to temporal world models requires a separate observation, action, and state contract.

Foundation TSFM Relevance

Important for hierarchical latent-state updates and fast/slow processing, but still architecture background rather than direct multivariate time-series evidence.

Links Into The Wiki

Open Questions

What matched-budget baseline should this source be compared against: unique-depth Transformer layers, recurrent state, explicit memory, or extra inference steps?
Which claims transfer from token-sequence reasoning to multivariate time-series state tracking, event streams, or action-conditioned world models?
Which part of the result is hierarchy, deep supervision, recurrence, or task-specific puzzle augmentation?

Alex Open Research Wiki

Explorer

Hierarchical Reasoning Model

Hierarchical Reasoning Model

Source

Core Claim

Relevance To This Wiki

Limitations

Foundation TSFM Relevance

Links Into The Wiki

Open Questions

Graph View

Table of Contents

Backlinks