Gated DeltaNet-2

Summary

Gated DeltaNet-2 is the linear recurrent attention method introduced by Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention. It replaces the scalar delta-rule update gate used by Gated DeltaNet and KDA with separate channel-wise key-side erase and value-side write gates.

Role In The Wiki

Use this page as the object card for the Gated DeltaNet-2 method. The source page carries the detailed evidence, limitations, and agenda mapping.

Gated DeltaNet-2 is relevant to the wiki as upstream architecture evidence for compact recurrent state and memory editing. It is not direct evidence for numeric time-series modeling, event streams, or action-conditioned world models.

Relation To Foundation TSFM Agenda

Use the source-level agenda mapping in gated-deltanet-2-2026 rather than duplicating verdict rows here. At the entity level, the important transfer is the state-update pattern: erasing stale associations and writing new information may need separate gates when a fixed-size latent state must preserve long-context information under interference.

Evidence

Official Artifacts