Alex Open Research Wiki

Tag: autoscaling

2 items with this tag.

  • Jun 19, 2026

    SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling

    • llm-serving
    • gpu-inference
    • autoscaling
    • forecasting
    • scheduling
    • cloud-infrastructure
  • Jun 19, 2026

    GPU Inference Optimization

    • gpu-inference
    • llm-serving
    • simulation
    • emulation
    • autoscaling
    • scheduling
    • systems

Created with Quartz v4.5.2 © 2026