Alex Open Research Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: language-modeling
9 items with this tag.
Jun 02, 2026
ELF: Embedded Language Flows
language-modeling
diffusion
flow-matching
continuous-embeddings
multimodal
time-series-adjacent
May 18, 2026
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
recurrent-depth
latent-reasoning
test-time-compute
language-modeling
May 18, 2026
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation
looped-transformers
elastic-depth
test-time-compute
language-modeling
May 18, 2026
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
sequence-models
state-space-models
structured-matrices
recurrent-models
language-modeling
May 18, 2026
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
sequence-models
state-space-models
selective-ssms
recurrent-models
language-modeling
May 18, 2026
Mamba-3: Improved Sequence Modeling using State Space Principles
sequence-models
state-space-models
recurrent-models
inference-efficiency
language-modeling
May 18, 2026
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
test-time-training
online-optimization
recurrent-models
language-modeling
May 18, 2026
ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models
sequence-models
recurrent-models
nonlinear-rnns
language-modeling
parallel-training
May 18, 2026
The Recurrent Transformer: Greater Effective Depth and Efficient Decoding
recurrent-depth
transformers
efficient-decoding
language-modeling