Abstract FZJ-2025-05267

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
A Benchmark Arena for Real-Time Energy Forecasting

 ;

2025

EEM26, CopenhagenCopenhagen, Denmark, 18 Aug 2026 - 21 Aug 20262026-08-182026-08-21

Abstract: Forecasting electricity prices, load, and renewable generation has become essential for power system operators, energy traders, and analysts as renewable penetration and market volatility rise across Europe, a trend reflected in the sharp growth of related research, including more than 4,000 papers published in 2024 alone1. Despite a vast body of literature and notable open-access efforts such as Lago et al.2 (2021), the field lacks a widely accepted, continuously updated real-time benchmark [JW1.1]that captures the evolving conditions of modern power markets. As a result, it remains difficult to reliably assess state-of-the-art forecasting performance, hindering consistent and measurable progress, both in research and in commercial practice.Existing studies rely on static historical benchmark datasets2–4, which are useful for comparing relative model performance. However, in a rapidly changing energy system, their results quickly become outdated and offer limited insight into real-time forecasting performance under current system conditions. Furthermore, static benchmarks allow to rely on exogenous regressors that were not available at the time the forecast would have been issued, thereby overlooking key real-time forecasting constraints . At the same time, commercial forecasting solutions remain predominantly closed-source, preventing transparent quality assessment and inhibiting trust in their performance claims (see, e.g., Semmelmann7 (2025) or Stupperich8 (2025)).This paper introduces a benchmark arena for real-time energy forecasting. The platform provides the first open, API-driven environment where researchers and practitioners can submit day-ahead forecasts for electricity prices, load, and renewable generation. [JW2.1]All submissions are evaluated once actual values are published, ensuring assessment under true real-time conditions. Standardized metrics are computed automatically, and a public leaderboard ranks models based on rolling-horizon performance (e.g., the best results over the last 30, 90, or 365 days). This enables systematic comparison of forecasting solutions, quantifying real performance differences and providing an evidence-based view of methodological progress. By continuously reflecting contemporary market dynamics, the benchmark arena establishes a transparent, reproducible, and dynamically evolving reference for forecasting quality.


Contributing Institute(s):
  1. Jülicher Systemanalyse (ICE-2)
Research Program(s):
  1. 1111 - Effective System Transformation Pathways (POF4-111) (POF4-111)
  2. 1112 - Societally Feasible Transformation Pathways (POF4-111) (POF4-111)

Appears in the scientific report 2025
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Abstracts
Institute Collections > ICE > ICE-2
Workflow collections > Public records
Publications database

 Record created 2025-12-11, last modified 2025-12-11



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)