Hierarchical Reasoning Model

Source

Core Claim

HRM uses interdependent high-level and low-level recurrent modules operating at different timescales to solve reasoning tasks in a single forward pass without explicit chain-of-thought supervision.

Relevance To This Wiki

HRM is a landmark recursive-reasoning branch adjacent to UT and looped Transformers: it shows that tiny recurrent systems can solve hard puzzle tasks without large language-model pretraining.

Limitations

The impressive evidence is concentrated in puzzle and ARC-style domains. Transfer to temporal world models requires a separate observation, action, and state contract.

Foundation TSFM Relevance

Important for hierarchical latent-state updates and fast/slow processing, but still architecture background rather than direct multivariate time-series evidence.

Open Questions

  • What matched-budget baseline should this source be compared against: unique-depth Transformer layers, recurrent state, explicit memory, or extra inference steps?
  • Which claims transfer from token-sequence reasoning to multivariate time-series state tracking, event streams, or action-conditioned world models?
  • Which part of the result is hierarchy, deep supervision, recurrence, or task-specific puzzle augmentation?