Alex Open Research Wiki

Tag: reinforcement-learning

8 items with this tag.

  • Jun 04, 2026

    On Training in Imagination

    • world-models
    • reinforcement-learning
    • reward-models
    • scaling-laws
    • data-economics
  • May 28, 2026

    LLM Post-Training

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • evolution-strategies
    • continual-learning
    • private-adaptation
  • May 24, 2026

    CWM: An Open-Weights LLM for Research on Code Generation with World Models

    • world-models
    • code-generation
    • llm-agents
    • software-engineering
    • reinforcement-learning
  • May 23, 2026

    Evolution Strategies

    • evolution-strategies
    • reinforcement-learning
    • post-training
  • May 18, 2026

    Learning, Fast and Slow: Towards LLMs That Adapt Continually

    • llm-post-training
    • continual-learning
    • reinforcement-learning
    • prompt-optimization
    • plasticity
  • May 15, 2026

    On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • reward-rectification
    • weight-updates
  • May 15, 2026

    Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

    • evolution-strategies
    • llm-fine-tuning
    • reinforcement-learning
    • post-training
  • May 15, 2026

    Evolution Strategies as a Scalable Alternative to Reinforcement Learning

    • evolution-strategies
    • reinforcement-learning
    • black-box-optimization

Created with Quartz v4.5.2 © 2026