A Path Towards Autonomous Machine Intelligence
Source
- Raw Markdown: paper_lecun-autonomous-machine-intelligence-2022.md
- PDF: paper_lecun-autonomous-machine-intelligence-2022.pdf
Core Claim
LeCun proposes an autonomous intelligence architecture built from configurable predictive world models, intrinsic objectives, hierarchical planning, and joint embedding architectures trained by self-supervised learning.
Key Contributions
- Frames world models as the missing substrate for human-like sample efficiency, reasoning, and planning.
- Argues for prediction in representation space rather than direct pixel-level prediction.
- Connects intrinsic motivation, actor modules, cost modules, and latent variables into one agent architecture.
Method Notes
This is a position paper rather than a narrow empirical result. It provides the conceptual root for JEPA, Energy-Based Models, and World Models in this wiki.
Evidence And Results
The evidence is architectural and argumentative: the paper compares limits of supervised learning, reinforcement learning, and generative modeling, then motivates hierarchical predictive representations.
Limitations
The proposal is broad and leaves many training details unresolved; later sources such as LeJEPA and LeWorldModel instantiate pieces of it.
Foundation TSFM Relevance
| Agenda slot | Verdict | Evidence | Missing pieces |
|---|---|---|---|
| Control and counterfactuals | adjacent | Proposes actor-generated action sequences evaluated by a predictive world model and cost module before executing the first action. | Position paper only; no time-series benchmark, training recipe, or empirical action-conditioned rollout evidence. |
| Multi-modal future distributions | adjacent | Uses latent variables and energy-based inference to represent multiple plausible future world states. | Does not instantiate calibrated future distributions for numeric time-series systems. |
| Streaming state, long context, and constant updates | adjacent | Short-term memory stores past, current, and predicted world states while the world model predicts future and missing state. | Operational update costs, retained memory policies, and always-on stream serving are unspecified. |
| Representation quality: semantic state vs dense detail | partially closes | Argues for prediction in abstract representation space and hierarchical JEPA so long-horizon prediction can ignore unpredictable details. | The abstraction/fidelity tradeoff is conceptual and not tested for generation, editing, or observability data. |
Links Into The Wiki
- Foundation Time-Series Model Research Agenda
- JEPA
- Energy-Based Models
- World Models
- Self-Supervised Representation Learning
Open Questions
- Which parts of the proposed architecture are necessary versus optional?
- How should hierarchical prediction be trained at large scale without collapse?