Alex Open Research Wiki

Tag: llm-post-training

4 items with this tag.

  • May 28, 2026

    DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

    • block-wise-training
    • diffusion
    • memory-efficient-training
    • training-dynamics
    • recurrent-depth
    • llm-post-training
    • private-adaptation
  • May 28, 2026

    LLM Post-Training

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • evolution-strategies
    • continual-learning
    • private-adaptation
  • May 18, 2026

    Learning, Fast and Slow: Towards LLMs That Adapt Continually

    • llm-post-training
    • continual-learning
    • reinforcement-learning
    • prompt-optimization
    • plasticity
  • May 15, 2026

    On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • reward-rectification
    • weight-updates

Created with Quartz v4.5.2 © 2026