Learning to run a Power Network Challenge: a Retrospective Analysis

Source

Raw Markdown: l2rpn-challenge-retrospective-2021
Rendered / retrieved PDF: paper_l2rpn-challenge-retrospective-2021.pdf
External source: https://arxiv.org/abs/2103.03104
Additional source: http://proceedings.mlr.press/v133/marot21a.html
Official L2RPN reference list: https://l2rpn.chalearn.org/papers-references

Publication And Credibility

Paper date: 2021-03-02; arXiv v2 on 2021-10-21.
Venue/status: PMLR NeurIPS 2020 Competition and Demonstration Track, 2021.
Credibility: Peer-reviewed competition-track paper from the L2RPN organizers at RTE, ChaLearn, Google Research, EPRI, TenneT, and collaborators. It is older than one year, so it is used as the canonical benchmark description, not as current algorithmic SOTA.

Core Claim

The paper introduces the NeurIPS 2020 L2RPN competition and the reusable Grid2Op benchmark framework for realistic sequential power-network operation scenarios.

L2RPN / Grid2Op Notes

Grid2Op exposes multivariate grid observations, topology actions, redispatching, curtailment, storage controls, cooldown constraints, exogenous load/generation variation, maintenance events, and contingent line disconnections. The competition uses week-long episodes at 5-minute resolution on IEEE-118-style networks and evaluates agents by survival, constraints, and operation cost.

Action-Time-Series Notes

This source is useful when Grid2Op is treated as an action-conditioned graph time-series environment:

power-grid observations + topology / redispatch / storage control input + scenario context
  -> next grid observations + safety/cost outcome

The terminology distinction matters. Topology changes, redispatching, curtailment, and storage commands are actions or control inputs when an agent chooses them. Line failures, maintenance outages, weather-driven renewable shifts, and demand variation are events or exogenous variables unless they are deliberately controlled by the experimenter.

Foundation TSFM Relevance

Agenda slot	Verdict	Evidence	Missing pieces
Causal structure, counterfactuals, and control	partially closes	Strong non-vision benchmark for action-conditioned graph time series and energy-control world-model experiments.	The benchmark remains simulator-specific and requires pinned environment versions, scenario generation, reward functions, and action masks before TSFM comparisons are meaningful.
Context interface: topology and channel context	partially closes	Power-grid state is naturally graph-structured and tied to physical assets, limits, and scenario metadata.	Needs a reusable schema that a general TSFM can consume across grids and non-grid operational systems.
Benchmark level	adjacent	L2RPN/Grid2Op provides simulator-backed trajectories with explicit controls and outcomes.	TSFM-ready comparisons require pinned environment versions, action sets, reward definitions, and train/test scenario splits.

Alex Open Research Wiki

Explorer

Learning to run a Power Network Challenge: a Retrospective Analysis

Learning to run a Power Network Challenge: a Retrospective Analysis

Source

Publication And Credibility

Core Claim

L2RPN / Grid2Op Notes

Action-Time-Series Notes

Foundation TSFM Relevance

Links Into The Wiki

Graph View

Table of Contents

Backlinks