A Path Towards Autonomous Machine Intelligence

Source

Raw Markdown: paper_lecun-autonomous-machine-intelligence-2022.md
PDF: paper_lecun-autonomous-machine-intelligence-2022.pdf

Core Claim

LeCun proposes an autonomous intelligence architecture built from configurable predictive world models, intrinsic objectives, hierarchical planning, and joint embedding architectures trained by self-supervised learning.

Key Contributions

Frames world models as the missing substrate for human-like sample efficiency, reasoning, and planning.
Argues for prediction in representation space rather than direct pixel-level prediction.
Connects intrinsic motivation, actor modules, cost modules, and latent variables into one agent architecture.

Method Notes

This is a position paper rather than a narrow empirical result. It provides the conceptual root for JEPA, Energy-Based Models, and World Models in this wiki.

Evidence And Results

The evidence is architectural and argumentative: the paper compares limits of supervised learning, reinforcement learning, and generative modeling, then motivates hierarchical predictive representations.

Limitations

The proposal is broad and leaves many training details unresolved; later sources such as LeJEPA and LeWorldModel instantiate pieces of it.

Foundation TSFM Relevance

Agenda slot	Verdict	Evidence	Missing pieces
Control and counterfactuals	adjacent	Proposes actor-generated action sequences evaluated by a predictive world model and cost module before executing the first action.	Position paper only; no time-series benchmark, training recipe, or empirical action-conditioned rollout evidence.
Multi-modal future distributions	adjacent	Uses latent variables and energy-based inference to represent multiple plausible future world states.	Does not instantiate calibrated future distributions for numeric time-series systems.
Streaming state, long context, and constant updates	adjacent	Short-term memory stores past, current, and predicted world states while the world model predicts future and missing state.	Operational update costs, retained memory policies, and always-on stream serving are unspecified.
Representation quality: semantic state vs dense detail	partially closes	Argues for prediction in abstract representation space and hierarchical JEPA so long-horizon prediction can ignore unpredictable details.	The abstraction/fidelity tradeoff is conceptual and not tested for generation, editing, or observability data.

Links Into The Wiki

Open Questions

Which parts of the proposed architecture are necessary versus optional?
How should hierarchical prediction be trained at large scale without collapse?

Alex Open Research Wiki

Explorer

A Path Towards Autonomous Machine Intelligence

A Path Towards Autonomous Machine Intelligence

Source

Core Claim

Key Contributions

Method Notes

Evidence And Results

Limitations

Foundation TSFM Relevance

Links Into The Wiki

Open Questions

Graph View

Table of Contents

Backlinks