Assess long-term dynamics
Background: The Impermanent benchmark evaluates models on evolving data streams, enabling analysis of performance stability and ranking dynamics over time.
Question / Future Work: The authors suggest using longer evaluation horizons in the live benchmark setting to better understand how model performance stability and the relative rankings of different forecasting models evolve over extended periods of temporal change.
Metadata & Links
- created_at
- 2026-03-27T14:08:19Z