Alex Open Research Wiki

Tag: language-modeling

9 items with this tag.

  • Jun 02, 2026

    ELF: Embedded Language Flows

    • language-modeling
    • diffusion
    • flow-matching
    • continuous-embeddings
    • multimodal
    • time-series-adjacent
  • May 18, 2026

    Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

    • recurrent-depth
    • latent-reasoning
    • test-time-compute
    • language-modeling
  • May 18, 2026

    LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation

    • looped-transformers
    • elastic-depth
    • test-time-compute
    • language-modeling
  • May 18, 2026

    Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

    • sequence-models
    • state-space-models
    • structured-matrices
    • recurrent-models
    • language-modeling
  • May 18, 2026

    Mamba: Linear-Time Sequence Modeling with Selective State Spaces

    • sequence-models
    • state-space-models
    • selective-ssms
    • recurrent-models
    • language-modeling
  • May 18, 2026

    Mamba-3: Improved Sequence Modeling using State Space Principles

    • sequence-models
    • state-space-models
    • recurrent-models
    • inference-efficiency
    • language-modeling
  • May 18, 2026

    MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

    • test-time-training
    • online-optimization
    • recurrent-models
    • language-modeling
  • May 18, 2026

    ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models

    • sequence-models
    • recurrent-models
    • nonlinear-rnns
    • language-modeling
    • parallel-training
  • May 18, 2026

    The Recurrent Transformer: Greater Effective Depth and Efficient Decoding

    • recurrent-depth
    • transformers
    • efficient-decoding
    • language-modeling

Created with Quartz v4.5.2 © 2026