Alex Open Research Wiki

Tag: supervised-fine-tuning

2 items with this tag.

  • May 28, 2026

    LLM Post-Training

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • evolution-strategies
    • continual-learning
    • private-adaptation
  • May 15, 2026

    On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • reward-rectification
    • weight-updates

Created with Quartz v4.5.2 © 2026